Factorizing Perception and Policy for Interactive Instruction Following
Paper
•
2012.03208
•
Published
None defined yet.
LiteStage: Latency-aware Layer Skipping for Multi-stage Reasoning
Q-Palette: Fractional-Bit Quantizers Toward Optimal Bit Allocation for Efficient LLM Deployment