AI & ML interests
None defined yet.
Recent Activity
View all activity
Papers
LiteStage: Latency-aware Layer Skipping for Multi-stage Reasoning
Q-Palette: Fractional-Bit Quantizers Toward Optimal Bit Allocation for Efficient LLM Deployment
snu
's datasets
None public yet