ygh's picture

2

ygh

yghstill

·

yghstill

AI & ML interests

None yet

Recent Activity

authored a paper 17 days ago

Hunyuan-TurboS: Advancing Large Language Models through Mamba-Transformer Synergy and Adaptive Chain-of-Thought

authored a paper 17 days ago

Tequila: Trapping-free Ternary Quantization for Large Language Models

authored a paper 17 days ago

SpecExit: Accelerating Large Reasoning Model via Speculative Exit

View all activity

Organizations

authored 3 papers 17 days ago

Hunyuan-TurboS: Advancing Large Language Models through Mamba-Transformer Synergy and Adaptive Chain-of-Thought

Paper • 2505.15431 • Published May 21 • 1

Tequila: Trapping-free Ternary Quantization for Large Language Models

Paper • 2509.23809 • Published Sep 28 • 2

SpecExit: Accelerating Large Reasoning Model via Speculative Exit

Paper • 2509.24248 • Published Sep 29 • 1

updated a collection 18 days ago

Papers

2 items • Updated 18 days ago

upvoted a paper 18 days ago

SpecExit: Accelerating Large Reasoning Model via Speculative Exit

Paper • 2509.24248 • Published Sep 29 • 1

upvoted a paper about 1 month ago

Tequila: Trapping-free Ternary Quantization for Large Language Models

Paper • 2509.23809 • Published Sep 28 • 2

updated a Space 3 months ago

README

updated a collection 3 months ago

Deepseek-quant

The collection of quantization models of DeepSeek and Deepseek_r1_distill • 14 items • Updated 18 days ago

updated a model 4 months ago

AngelSlim/Qwen3-32B_fp8_static

33B • Updated Jul 23 • 6

published 11 models 5 months ago

AngelSlim/Qwen2_5-1_5B_instruct_fp8_static

Updated Jul 23 • 18

AngelSlim/Qwen2_5-1_5B_int4_gptq

0.4B • Updated Jul 10 • 4

AngelSlim/Deepseek_r1_distill_qwen-1_5b_int4_awq

0.6B • Updated Jul 10 • 3

AngelSlim/Deepseek_r1_distill_qwen-7b_int4_awq

2B • Updated Jul 10

AngelSlim/Deepseek_r1_distill_qwen-14b_int4_gptq

3B • Updated Jul 10 • 6

AngelSlim/Deepseek_r1_distill_qwen-14b_fp8_static

15B • Updated Jul 23 • 320

AngelSlim/Deepseek_r1_distill_qwen-32b_int4_awq

6B • Updated Jul 10

AngelSlim/Deepseek_r1_distill_qwen-1_5b_int4_gptq

0.6B • Updated Jul 10 • 4

AngelSlim/Deepseek_r1_distill_qwen-7b_int4_gptq

2B • Updated Jul 10 • 2

AngelSlim/Deepseek_r1_distill_qwen-7b_fp8_static

8B • Updated Jul 23 • 2

AngelSlim/Deepseek_r1_distill_qwen-14b_int4_awq

3B • Updated Jul 10 • 3