-
Notifications
You must be signed in to change notification settings - Fork 2.4k
Pull requests: NVIDIA/TensorRT-LLM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[None][infra] Waive 1 failed cases for main in pre-merge 41894
#15089
opened Jun 8, 2026 by
ZhanruiSunCh
Collaborator
Loading…
[https://nvbugs/6255037][fix] Count DSA indexer K-cache correctly as UINT8 in KV cache size estimate
#15088
opened Jun 8, 2026 by
eopXD
Collaborator
Loading…
1 task done
[TRTLLMINF-127][infra]Upgrade dependencies for dlfw 26.05 stack
#15087
opened Jun 8, 2026 by
EmmaQiaoCh
Collaborator
Loading…
1 task done
[None][infra] Waive TestQwen3NextInstruct nvfp4 cases
#15086
opened Jun 8, 2026 by
mzweilz
Collaborator
Loading…
1 task
[None][fix] Fix and unwaive nemotron related bugs
#15085
opened Jun 8, 2026 by
Wanli-Jiang
Collaborator
Loading…
1 task done
[None][test] Move Ultra V3 to official checkpoint
#15084
opened Jun 8, 2026 by
tcherckez-nvidia
Collaborator
Loading…
1 task done
[None][test] Half K25 Agg Multi Round to Solve Timeout Issue
#15083
opened Jun 8, 2026 by
chenfeiz0326
Collaborator
Loading…
1 task done
[https://nvbugs/6212252][fix] Select CUTLASS MoE backend on non-Blackwell SMs in TestQwen3_5_35B_A3B::test_fp8
#15081
opened Jun 8, 2026 by
xxi-nv
Collaborator
Loading…
1 task done
[None][fix] Register Multimodal Placeholders for Qwen3.5 MoE VLM Serving
#15079
opened Jun 8, 2026 by
anurags25
Loading…
4 tasks done
[https://nvbugs/6162940][chore] Unwaive fixed test
#15078
opened Jun 8, 2026 by
longlee0622
Collaborator
Loading…
1 task done
[None][chore] Add Pro aggregate GSM8K CI gate
deepseek-v4
#15076
opened Jun 8, 2026 by
mingyangHao
Collaborator
•
Draft
1 task
[None][feat] Dis-agg content-derived conversation affinity
deepseek-v4
#15074
opened Jun 8, 2026 by
Shixiaowei02
Collaborator
Loading…
1 task done
[None][test] Waive 4 failed cases for main in QA CI
#15071
opened Jun 8, 2026 by
tensorrt-cicd
Collaborator
•
Draft
[https://nvbugs/6262407][test] used to debug v2c bot
#15070
opened Jun 8, 2026 by
xinhe-nv
Collaborator
Loading…
1 task done
[None][fix] Generalize FP8 checkpoint loading for Qwen3.5
#15067
opened Jun 8, 2026 by
amukkara
Collaborator
Loading…
1 task done
[None][fix] Guard Gemma4 backend on unsupported GPUs and fix Gemma3 AutoDeploy use_cache
#15064
opened Jun 7, 2026 by
ssam18
Contributor
Loading…
[#14225][perf] AutoDeploy MTP + ADP enablement and MoE all-to-all optimization
#15063
opened Jun 7, 2026 by
MrGeva
Collaborator
Loading…
1 task done
[None][fix] Disable MegaMoE DeepGEMM fast-math under sparse attention
#15062
opened Jun 7, 2026 by
lishicheng1996-nv
Collaborator
•
Draft
1 task done
[https://nvbugs/6162120][test] Remove 78 closed-bug waive entries for main
#15061
opened Jun 7, 2026 by
tensorrt-cicd
Collaborator
Loading…
[None][fix] fix CppMambaHybridCacheManager to handle dp dummy request
#15054
opened Jun 7, 2026 by
bo-nv
Collaborator
Loading…
1 task done
[TRTLLM-13264][feat] Add native bias epilogue to NVFP4 GEMM
#15053
opened Jun 7, 2026 by
luyiyun1021
Collaborator
Loading…
1 task done
[None][chore] Enable skip memory estimation on kv cache manager v2
#15052
opened Jun 7, 2026 by
HuiGao-NV
Collaborator
Loading…
1 task
Previous Next
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.