-
Notifications
You must be signed in to change notification settings - Fork 214
Pull requests: NVIDIA/Model-Optimizer
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Minitron pruning refactor [1/2]: Remove num_query_groups pruning and simplify megatron dynamic modules for follow-up PRs
#690
opened Dec 15, 2025 by
kevalmorabia97
Loading…
2 tasks done
Use shared activation hooks component in the puzzle algorithm
#687
opened Dec 13, 2025 by
danielkorzekwa
Loading…
Added support for KV cache quantization for vllm fakequant
#686
opened Dec 13, 2025 by
kinjalpatel27
Loading…
Refactor: Clean up EAGLE training dataset preparation
#684
opened Dec 12, 2025 by
benchislett
Loading…
Registry interface for custom quantization functional backend
#683
opened Dec 12, 2025 by
realAsma
Loading…
[5615343][ONNX] Add support for dynamically linked TensorRT plugins
#675
opened Dec 11, 2025 by
gcunhase
Loading…
Previous Next
ProTip!
Filter pull requests by the default branch with base:main.