-
Notifications
You must be signed in to change notification settings - Fork 457
Pull requests: NVIDIA/Model-Optimizer
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Emit VisualGen-compatible sparse_attention_config for diffusion skip-softmax export
#1816
opened Jun 24, 2026 by
jingyu-ml
Contributor
Loading…
Add NVFP4 Conv3d export for diffusers VAE (Wan 2.2)
#1809
opened Jun 23, 2026 by
jingyu-ml
Contributor
Loading…
Support FP8 per block (weight + dynamic per token activation) export
#1807
opened Jun 23, 2026 by
sugunav14
Contributor
Loading…
MiniMax-M3 mixed MXFP8-base + NVFP4-experts PTQ export
#1806
opened Jun 23, 2026 by
chadvoegele
Contributor
Loading…
Account for CE loss for MTP heads in Megatron KD
#1805
opened Jun 23, 2026 by
AAnoosheh
Contributor
Loading…
Puzzletron tutorial fixes for runtime optimization
#1803
opened Jun 23, 2026 by
grzegorz-k-karch
Contributor
Loading…
Remove deprecated examples/llm_autodeploy
#1797
opened Jun 22, 2026 by
Fridah-nv
Contributor
Loading…
Add VLM pruning and PTQ with image-text calibration (Megatron-Bridge)
#1792
opened Jun 22, 2026 by
kevalmorabia97
Collaborator
Loading…
Create adding_new_model_tutorial.md
#1784
opened Jun 22, 2026 by
danielkorzekwa
Contributor
Loading…
Add: support input_shape_profile for trt-rtx ep
#1782
opened Jun 22, 2026 by
haoxiz-nvidia
Contributor
Loading…
Fix low_memory_mode meta-device crash on fused-MoE models
#1781
opened Jun 21, 2026 by
abatilo
Loading…
Experimental claude skill for puzzletron algoritgm
#1769
opened Jun 18, 2026 by
danielkorzekwa
Contributor
Loading…
feat(launcher): add Megatron-Bridge quantize/generate/export wrappers
#1767
opened Jun 17, 2026 by
yueshen2016
Contributor
Loading…
feat(recipes): add nvfp4_mlp_only-novit-kv_fp8 (exclude VL vision tower)
cherry-pick-0.45.0
After code freeze, cherry-pick to release branch for next rc (bulk update). Only for bug fixes / doc
#1760
opened Jun 17, 2026 by
Edwardf0t1
Contributor
Loading…
refactor(examples): rename llm_ptq → hf_ptq (symlink for back-compat)
#1759
opened Jun 17, 2026 by
Edwardf0t1
Contributor
Loading…
DFlash for MiniMax-M3 (WIP): synthesis thinking-mode mix
#1749
opened Jun 16, 2026 by
yeyu-nvidia
Contributor
•
Draft
Previous Next
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.