Hunyuan-TurboS: Advancing Large Language Models through Mamba-Transformer Synergy and Adaptive Chain-of-Thought Paper • 2505.15431 • Published May 21 • 1
Tequila: Trapping-free Ternary Quantization for Large Language Models Paper • 2509.23809 • Published Sep 28 • 2
SpecExit: Accelerating Large Reasoning Model via Speculative Exit Paper • 2509.24248 • Published Sep 29 • 1
SpecExit: Accelerating Large Reasoning Model via Speculative Exit Paper • 2509.24248 • Published Sep 29 • 1
Tequila: Trapping-free Ternary Quantization for Large Language Models Paper • 2509.23809 • Published Sep 28 • 2
Deepseek-quant Collection The collection of quantization models of DeepSeek and Deepseek_r1_distill • 14 items • Updated 18 days ago