Skip to content

Pull requests: NVIDIA/TensorRT-LLM

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

feat: Add EXAONE-Deep
#3054 opened Mar 25, 2025 by yechank-nvidia Loading…
doc: Update DeepSeekV3 doc
#3052 opened Mar 25, 2025 by xiaoweiw-nv Loading…
chore: upgrade transformers to 4.50.0
#3051 opened Mar 25, 2025 by achartier Loading…
fix: AllReduce CUDA Graph Fix
#3049 opened Mar 25, 2025 by yizhang-nv Draft
feat: Pytorch PP + attention DP support
#3044 opened Mar 24, 2025 by achartier Loading…
chore: Add second possible output for llava
#3043 opened Mar 24, 2025 by amukkara Loading…
feat: Add initial EAGLE-3 implementation
#3035 opened Mar 24, 2025 by mikeiovine Loading…
feat: Unify two versions of allreduce custom op
#3032 opened Mar 24, 2025 by yukunh-nvidia Loading…
test: [TRTLLM-4000] Port multi GPU changes to GitHub CI Any issue relates with CI testing
#3027 opened Mar 24, 2025 by DomBrown Loading…
feat: Draft/lora_modules_support
#3026 opened Mar 24, 2025 by danielafrimi Draft
fix: disable KV cache reuse if using attention sink
#3021 opened Mar 24, 2025 by Funatiq Loading…
Support cos_sin_cache in all cases.
#3020 opened Mar 24, 2025 by yuxianq Loading…
fix: creating output of dataset generator in current directory bug Something isn't working
#3018 opened Mar 24, 2025 by hypdeb Loading…
ProTip! Exclude everything labeled bug with -label:bug.