NVIDIA / Model-Optimizer Public

Notifications You must be signed in to change notification settings
Fork 214
Star 1.7k

Code
Issues 62
Pull requests 56
Actions
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Security
Insights

Pull requests: NVIDIA/Model-Optimizer

Labels 27 Milestones 0

New pull request New

56 Open 349 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

Minitron pruning refactor [1/2]: Remove num_query_groups pruning and simplify megatron dynamic modules for follow-up PRs

#690 opened Dec 15, 2025 by kevalmorabia97

Loading…

2 tasks done

Feat: MLA eagle

#689 opened Dec 15, 2025 by h-guo18 • Draft

Use shared activation hooks component in the puzzle algorithm

#687 opened Dec 13, 2025 by danielkorzekwa

Loading…

Added support for KV cache quantization for vllm fakequant

#686 opened Dec 13, 2025 by kinjalpatel27

Loading…

Update llm_ptq doc

#685 opened Dec 12, 2025 by cjluo-nv

Loading…

Refactor: Clean up EAGLE training dataset preparation

#684 opened Dec 12, 2025 by benchislett

Loading…

Registry interface for custom quantization functional backend

#683 opened Dec 12, 2025 by realAsma

Loading…

MLM QQD example

#682 opened Dec 12, 2025 by meenchen • Draft

Support Qwen3Next NVFP4 quantization

#681 opened Dec 12, 2025 by cjluo-nv

Loading…

Update README.md

#678 opened Dec 12, 2025 by omrialmog

Loading…

Add support for Qwen3-Omni-30B-A3B-Thinking

#677 opened Dec 11, 2025 by ajrasane • Draft

[5615343][ONNX] Add support for dynamically linked TensorRT plugins

#675 opened Dec 11, 2025 by gcunhase

Loading…

Use kitchen FA in huggingface plugin

#674 opened Dec 11, 2025 by sychen52 • Draft

Write extra state for KV quantizer

#673 opened Dec 10, 2025 by jenchen13

Loading…

Support KIMI K2 Thinking int4 checkpoint PTQ

#669 opened Dec 9, 2025 by cjluo-nv

Loading…

Refactor: Eagle data loading

#668 opened Dec 8, 2025 by h-guo18 • Draft

Add Quantizers for Qwen3VLMoeTextDecoderLayer

#666 opened Dec 8, 2025 by soodoshll

Loading…

Refactor and clean up hf_ptq.py

#665 opened Dec 8, 2025 by shengliangxu

Loading…

Replace mip package with pulp

#663 opened Dec 8, 2025 by kevalmorabia97

Loading…

us gesvd as solver

#661 opened Dec 8, 2025 by andompesta

Loading…

[5336829][AutoCast] Support subgraphs

#659 opened Dec 7, 2025 by galagam

Loading…

config file based modelopt config 1/N

#657 opened Dec 6, 2025 by shengliangxu • Draft

Kimi-k2 calib+export

#655 opened Dec 5, 2025 by jingyu-ml

Loading…

Support model export for int4 wo

#653 opened Dec 5, 2025 by meenchen

Loading…

[NVBUG: 5701937]Clear GPU cache for 3D weight tensors

#649 opened Dec 4, 2025 by cjluo-nv

Loading…

Previous 1 2 3 Next

Previous Next

ProTip! Filter pull requests by the default branch with base:main.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!