NVIDIA / TensorRT-LLM Public

Notifications You must be signed in to change notification settings
Fork 1.5k
Star 10.8k

Code
Issues 623
Pull requests 268
Discussions
Actions
Projects 1
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Security
Insights

Pull requests: NVIDIA/TensorRT-LLM

Labels 44 Milestones 1

New pull request New

268 Open 2,326 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

[doc] update mtp documents

#5387 opened Jun 20, 2025 by lfr-0531

Loading…

feat: Remove not used padding_idx in models

#5385 opened Jun 20, 2025 by HuiGao-NV

Loading…

refactor: remove batch_manager::KvCacheConfig and use executor::KvCacheConfig instead

#5384 opened Jun 20, 2025 by Funatiq • Draft

Detokenize option in /v1/completions request Community Engagement

help/insights needed from community

Community want to contribute

PRs initiated from Community

#5382 opened Jun 20, 2025 by Wokzy

Loading…

Fix: missing clientId when serialize and deserialize response (cherry-pick #5231)

#5378 opened Jun 20, 2025 by kaiyux

Loading…

[TRTLLM-5831][feat] Add LoRA support for pytorch backend in trtllm-serve

#5376 opened Jun 19, 2025 by talorabr

Loading…

chore: Add scripts to profile cutlass moe

#5375 opened Jun 19, 2025 by bobboli • Draft

Fix permission for local user issues in NGC docker container.

#5373 opened Jun 19, 2025 by MartinMarciniszyn

Loading…

Feat/unify checkpoints loading

#5372 opened Jun 19, 2025 by shaharmor98 • Draft

[TRTLLM-5838][fix] fix max batch size and max tokens in kv cache estimations for Nemotron-H

#5371 opened Jun 19, 2025 by tomeras91

Loading…

fix: fix bug of qwen3 + eagle3 + finalize_moe_fusion

#5369 opened Jun 19, 2025 by byshiue

Loading…

Mxfp4 moe

#5367 opened Jun 19, 2025 by Tracin

Loading…

tests: update benchmark test lists

#5365 opened Jun 19, 2025 by xinhe-nv

Loading…

feat: TRTLLM-5941 Upgrade xgrammar to 0.1.18

#5364 opened Jun 19, 2025 by Wanli-Jiang • Draft

Make moe permute and final as custom op

#5358 opened Jun 19, 2025 by limin2021

Loading…

fix: Fix skip by mpi size fixture

#5355 opened Jun 19, 2025 by yizhang-nv

Loading…

feat(openai protocol):support logitbias

#5354 opened Jun 19, 2025 by xq25478

Loading…

feat(eagle):support qwen in eagle1/2

#5352 opened Jun 19, 2025 by xq25478

Loading…

feat(model):support qwen3 dense in trt flow

#5350 opened Jun 19, 2025 by xq25478

Loading…

test: Add LLGuidance test and refine guided decoding

#5348 opened Jun 19, 2025 by syuoni

Loading…

[TRTLLM-5350] Add Phi-4-Mini-Instruct in Pytorch backend for LLM API accuracy tests

#5347 opened Jun 19, 2025 by moraxu • Draft

feat: add LLmArgs option to force using dynamic quantization

#5346 opened Jun 19, 2025 by achartier

Loading…

feat(scaffolding): add streaming scaffolding_llm.generate_async support

#5345 opened Jun 19, 2025 by dc3671 • Draft

[fix] Add 1 and draft_token_num to seq_len when overlap scheduling is enabled during memory estimation

#5343 opened Jun 19, 2025 by HuiGao-NV

Loading…

[fix][test] parametrize deepseek eval

#5341 opened Jun 18, 2025 by omera-nv

Loading…

Previous 1 2 3 4 5 … 10 11 Next

Previous Next

ProTip! Updated in the last three days: updated:>2025-06-17.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!