Yi Cui
onekq
AI & ML interests
Benchmark, Code Generation Model
Recent Activity
new activity
1 day ago
moonshotai/Kimi-K2-Thinking:If we apply PTQ to a QAT model, what will happen
posted
an
update
1 day ago
The reaction on the QAT post is beyond expectations so below is my optimizer post as promised. But I found that I had lots of explanation to do about optimizer itself. So this post is actually a historical recount. The Muon optimizer (used by Kimi) post (coming very soon) can only continue after this.
https://huggingface.co/blog/onekq/adam-optimizer
If you know Adam(W) optimizer already, you can just skip and sorry for the wait. Otherwise, it should be a useful read.
commented on
their
article
1 day ago
๐ณ QAT: The Art of Growing a Bonsai Model