AI & ML interests
None defined yet.
Recent Activity
View all activity
Papers
LiteStage: Latency-aware Layer Skipping for Multi-stage Reasoning
Q-Palette: Fractional-Bit Quantizers Toward Optimal Bit Allocation for Efficient LLM Deployment
models
0
None public yet
datasets
0
None public yet