·
AI & ML interests
LLMSys, LLM, MLSys
Organizations
HectorHe/Deepseek-Coder-V2-Lite-13B-Instruct-Math10K-Distill-6-experts-test-token-specific
3B
•
Updated
HectorHe/Deepseek-Coder-V2-Lite-13B-Instruct-Math10K-Distill-6-experts-test-1
3B
•
Updated
HectorHe/Deepseek-Coder-V2-Lite-13B-Instruct-Math10K-Distill-6-experts-test
3B
•
Updated
•
1
HectorHe/Deepseek-Coder-V2-Lite-13B-Instruct-Math10K-Distill-3-experts
2B
•
Updated
HectorHe/Deepseek-Coder-V2-Lite-13B-Instruct-Math10K-Distill-9-experts
3B
•
Updated
HectorHe/Deepseek-Coder-V2-Lite-13B-Instruct-Math10K-Distill-6-experts
3B
•
Updated
HectorHe/Qwen3-8B-math220k-run5
Text Generation
•
8B
•
Updated
HectorHe/Qwen3-8B-math220k-run4
Text Generation
•
8B
•
Updated
•
1
HectorHe/Qwen3-8B-math220k-run3
Text Generation
•
8B
•
Updated
•
2
HectorHe/Qwen3-8B-math220k-run2
Text Generation
•
8B
•
Updated
•
2
HectorHe/Deepseek-Coder-V2-Lite-13B-Instruct-sft-codeforces
16B
•
Updated
•
10
HectorHe/Deepseek-Coder-V2-Lite-13B-Instruct-sft-math220k
16B
•
Updated
•
2
HectorHe/Deepseek-Coder-V2-Lite-13B-Instruct-sft-dolphin
16B
•
Updated
HectorHe/Deepseek-Coder-V2-Lite-13B-Instruct-math10k-sft
16B
•
Updated
•
1
HectorHe/Deepseek-Coder-V2-Lite-13B-Instruct-Open-R1-Distill
3B
•
Updated
•
8
HectorHe/DeepSeek-R1-Distill-Qwen-1.5B-GRPO
Updated