-
Notifications
You must be signed in to change notification settings - Fork 1.2k
Pull requests: NVIDIA/TensorRT-LLM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
test: add random image test for llama-3.2-11b-vision
#3055
opened Mar 25, 2025 by
crazydemo
Loading…
fix: Set correct draft_token_nums to dummy requests for torch compilation with MTP
#3053
opened Mar 25, 2025 by
HuiGao-NV
Loading…
feat: Support prequantized fp8 ckpt for nemotron-mini-4b-instruct
#3046
opened Mar 24, 2025 by
brb-nv
Loading…
Feat: Support Linear block scale layout in FP4 quantization
#3045
opened Mar 24, 2025 by
yibinl-nvidia
Loading…
perf: [AutoDeploy] Enable AutoDeploy as a backend in trtllm-bench
#3041
opened Mar 24, 2025 by
suyoggupta
Loading…
infra: [CI] - Only checkout the Git sourcecodes once in the CI pipeline
#3029
opened Mar 24, 2025 by
chzblych
Loading…
test: [TRTLLM-4000] Port multi GPU changes to GitHub
CI
Any issue relates with CI testing
#3027
opened Mar 24, 2025 by
DomBrown
Loading…
refactor: Remove speculative decoding parameters from stateful decoders
#3024
opened Mar 24, 2025 by
Funatiq
Loading…
fix: creating output of dataset generator in current directory
bug
Something isn't working
#3018
opened Mar 24, 2025 by
hypdeb
Loading…
Previous Next
ProTip!
Exclude everything labeled
bug
with -label:bug.