Skip to content

Pull requests: NVIDIA/TensorRT-LLM

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[doc] update mtp documents
#5387 opened Jun 20, 2025 by lfr-0531 Loading…
feat: Remove not used padding_idx in models
#5385 opened Jun 20, 2025 by HuiGao-NV Loading…
Detokenize option in /v1/completions request Community Engagement help/insights needed from community Community want to contribute PRs initiated from Community
#5382 opened Jun 20, 2025 by Wokzy Loading…
Feat/unify checkpoints loading
#5372 opened Jun 19, 2025 by shaharmor98 Draft
fix: fix bug of qwen3 + eagle3 + finalize_moe_fusion
#5369 opened Jun 19, 2025 by byshiue Loading…
Mxfp4 moe
#5367 opened Jun 19, 2025 by Tracin Loading…
tests: update benchmark test lists
#5365 opened Jun 19, 2025 by xinhe-nv Loading…
Make moe permute and final as custom op
#5358 opened Jun 19, 2025 by limin2021 Loading…
fix: Fix skip by mpi size fixture
#5355 opened Jun 19, 2025 by yizhang-nv Loading…
feat(openai protocol):support logitbias
#5354 opened Jun 19, 2025 by xq25478 Loading…
feat(eagle):support qwen in eagle1/2
#5352 opened Jun 19, 2025 by xq25478 Loading…
feat(model):support qwen3 dense in trt flow
#5350 opened Jun 19, 2025 by xq25478 Loading…
[fix][test] parametrize deepseek eval
#5341 opened Jun 18, 2025 by omera-nv Loading…
ProTip! Updated in the last three days: updated:>2025-06-17.