Skip to content

Pull requests: NVIDIA/TensorRT-LLM

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[None][infra] Waive 1 failed cases for main in pre-merge 41894
#15089 opened Jun 8, 2026 by ZhanruiSunCh Collaborator Loading…
[TRTLLMINF-127][infra]Upgrade dependencies for dlfw 26.05 stack
#15087 opened Jun 8, 2026 by EmmaQiaoCh Collaborator Loading…
1 task done
[None][infra] Waive TestQwen3NextInstruct nvfp4 cases
#15086 opened Jun 8, 2026 by mzweilz Collaborator Loading…
1 task
[None][fix] Fix and unwaive nemotron related bugs
#15085 opened Jun 8, 2026 by Wanli-Jiang Collaborator Loading…
1 task done
[None][test] Move Ultra V3 to official checkpoint
#15084 opened Jun 8, 2026 by tcherckez-nvidia Collaborator Loading…
1 task done
[None][test] Half K25 Agg Multi Round to Solve Timeout Issue
#15083 opened Jun 8, 2026 by chenfeiz0326 Collaborator Loading…
1 task done
[https://nvbugs/6162940][chore] Unwaive fixed test
#15078 opened Jun 8, 2026 by longlee0622 Collaborator Loading…
1 task done
[None][chore] Add Pro aggregate GSM8K CI gate deepseek-v4
#15076 opened Jun 8, 2026 by mingyangHao Collaborator Draft
1 task
[None][fix] report primary KV cache stats
#15075 opened Jun 8, 2026 by yizhang-nv Member Loading…
[None][feat] Dis-agg content-derived conversation affinity deepseek-v4
#15074 opened Jun 8, 2026 by Shixiaowei02 Collaborator Loading…
1 task done
[https://nvbugs/6262407][test] used to debug v2c bot
#15070 opened Jun 8, 2026 by xinhe-nv Collaborator Loading…
1 task done
[None][fix] Generalize FP8 checkpoint loading for Qwen3.5
#15067 opened Jun 8, 2026 by amukkara Collaborator Loading…
1 task done
[None][chore] Fix MAX_UTILIZATION reuse token budget on main
#15066 opened Jun 7, 2026 by brb-nv Collaborator Draft
1 task done
[#14225][perf] AutoDeploy MTP + ADP enablement and MoE all-to-all optimization
#15063 opened Jun 7, 2026 by MrGeva Collaborator Loading…
1 task done
[None][fix] fix CppMambaHybridCacheManager to handle dp dummy request
#15054 opened Jun 7, 2026 by bo-nv Collaborator Loading…
1 task done
[TRTLLM-13264][feat] Add native bias epilogue to NVFP4 GEMM
#15053 opened Jun 7, 2026 by luyiyun1021 Collaborator Loading…
1 task done
[None][chore] Enable skip memory estimation on kv cache manager v2
#15052 opened Jun 7, 2026 by HuiGao-NV Collaborator Loading…
1 task
ProTip! Type g i on any issue or pull request to go back to the issue listing page.