11 26 27

Mengzhao Chen

ChenMnZ

https://chenmnz.github.io/

ChenMnZ

AI & ML interests

model compression

Recent Activity

upvoted a paper 11 days ago

Every Attention Matters: An Efficient Hybrid Architecture for Long-Context Reasoning

upvoted a paper 11 days ago

Efficient Long-context Language Model Training by Core Attention Disaggregation

upvoted a paper 12 days ago

INT v.s. FP: A Comprehensive Study of Fine-Grained Low-bit Quantization Formats

View all activity

Organizations

None yet

commented a paper 13 days ago

INT v.s. FP: A Comprehensive Study of Fine-Grained Low-bit Quantization Formats

Paper • 2510.25602 • Published 17 days ago • 68 •

commented 2 papers 6 months ago

Scaling Law for Quantization-Aware Training

Paper • 2505.14302 • Published May 20 • 76 •

Model Merging in Pre-training of Large Language Models

Paper • 2505.12082 • Published May 17 • 40 •

commented a paper about 1 year ago

PrefixQuant: Static Quantization Beats Dynamic through Prefixed Outliers in LLMs

Paper • 2410.05265 • Published Oct 7, 2024 • 33 •

New activity in ChenMnZ/Mistral-Large-Instruct-2407-EfficientQAT-w2g64-GPTQ about 1 year ago

Where GGUF?

#1 opened over 1 year ago by

rdtfddgrffdgfdghfghdfujgdhgsf

commented a paper over 1 year ago

EfficientQAT: Efficient Quantization-Aware Training for Large Language Models

Paper • 2407.11062 • Published Jul 10, 2024 • 10 •

New activity in ChenMnZ/Llama-2-13b-chat-omniquant-w3a16g128asym about 2 years ago

Setting vocab_size to solve the issue of mlc-chat module not initalizing

#1 opened about 2 years ago by

kaushiknp

New activity in ChenMnZ/Llama-2-7b-chat-omniquant-w3a16g128asym about 2 years ago

Updated config file to remove special characters

#1 opened about 2 years ago by

kaushiknp

New activity in ChenMnZ/OmniQuant about 2 years ago

Models about w4a4 for LLama-2 family

#2 opened about 2 years ago by

liujingcs

more models?

#1 opened about 2 years ago by

gnomealone

more models?

#1 opened about 2 years ago by

gnomealone

more models?

#1 opened about 2 years ago by

gnomealone

Mengzhao Chen

AI & ML interests

Recent Activity

Organizations

ChenMnZ's activity

Where GGUF?

Setting vocab_size to solve the issue of mlc-chat module not initalizing

Updated config file to remove special characters

Models about w4a4 for LLama-2 family

more models?

more models?

more models?