Ming's picture

Ming

DarrenChen

·

ICM-AI

AI & ML interests

None yet

Recent Activity

upvoted a collection 6 days ago

liked a model 9 days ago

DevQuasar/Qwen.Qwen3-VL-Embedding-2B-GGUF

new activity 29 days ago

lihongjie007/Qwen3-VL-Embedding-2B-GPTQ-Int4:How do I use this embedding model?

View all activity

Organizations

None yet

upvoted a collection 6 days ago

Qwen3.5

21 items • Updated 1 day ago • 1.09k

upvoted a collection about 1 month ago

MiniCPM-o & MiniCPM-V

Multimodal models with leading performance. • 29 items • Updated 9 days ago • 72

upvoted 2 collections 3 months ago

Nemotron v3 Pre-Training

Large scale pre-training datasets used in the Nemotron family of models. • 11 items • Updated 4 days ago • 9

Nemotron-Post-Training-v3

Collection of datasets used in the post-training phase of Nemotron Nano and Super v3. • 20 items • Updated about 3 hours ago • 72

upvoted a collection 6 months ago

Qwen3-Next

4 items • Updated Dec 31, 2025 • 186

upvoted an article 8 months ago

Article

Mixture of Tunable Experts - Behavior Modification of DeepSeek-R1 at Inference Time

Feb 18, 2025

•

35

upvoted a paper 9 months ago

MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention

Paper • 2506.13585 • Published Jun 16, 2025 • 273

upvoted a collection 9 months ago

DeepSeek R1 (All Versions)

DeepSeek-R1-0528 is here! The most powerful reasoning open LLM, available in GGUF, original & 4-bit formats. Includes Llama & Qwen distilled models. • 37 items • Updated 4 days ago • 265

upvoted a collection 10 months ago

Qwen2.5-Omni

End-to-End Omni (text, audio, image, video, and natural speech interaction) model based Qwen2.5 • 6 items • Updated 9 days ago • 164

upvoted an article about 1 year ago

Article

Open-R1: Update #1

Feb 2, 2025

•

305

upvoted 3 collections over 1 year ago

Molmo

Artifacts for open multimodal language models. • 5 items • Updated Dec 23, 2025 • 309

Taiwan LLM

Try out at twllm.com ! • 26 items • Updated 9 days ago • 52

Qwen2

Qwen2 language models, including pretrained and instruction-tuned models of 5 sizes, including 0.5B, 1.5B, 7B, 57B-A14B, and 72B. • 37 items • Updated 9 days ago • 376

upvoted a collection almost 2 years ago

DBRX

DBRX is a mixture-of-experts (MoE) large language model trained from scratch by Databricks. • 3 items • Updated Mar 27, 2024 • 96