mltrials

AI & ML interests

None yet

Recent Activity

upvoted an article about 1 month ago

We Got Claude to Fine-Tune an Open Source LLM

upvoted a collection 4 months ago

Granite Docling

liked a model 4 months ago

OmniDimen/OmniDimen-4B-Emotion-GGUF-q4_K_M

View all activity

Organizations

None yet

upvoted an article about 1 month ago

Article

We Got Claude to Fine-Tune an Open Source LLM

Dec 4, 2025

•

575

upvoted a collection 4 months ago

Granite Docling

Collection

5 items • Updated Nov 17, 2025 • 60

liked a model 4 months ago

OmniDimen/OmniDimen-4B-Emotion-GGUF-q4_K_M

Text Generation • 4B • Updated Sep 19, 2025 • 79 • 5

upvoted an article 4 months ago

Article

Welcome EmbeddingGemma, Google's new efficient embedding model

Sep 4, 2025

•

268

upvoted 2 papers 5 months ago

Prompt Orchestration Markup Language

Paper • 2508.13948 • Published Aug 19, 2025 • 48

Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL

Paper • 2508.13167 • Published Aug 6, 2025 • 129

liked a model 6 months ago

osmosis-ai/Osmosis-Apply-1.7B

Text Generation • 2B • Updated Jul 3, 2025 • 79 • 91

upvoted an article 6 months ago

Article

SmolLM3: smol, multilingual, long-context reasoner

Jul 8, 2025

•

747

upvoted a paper 7 months ago

Scaling Test-time Compute for LLM Agents

Paper • 2506.12928 • Published Jun 15, 2025 • 63

updated a model 7 months ago

mltrials/opt-350m-lora

Updated Jun 9, 2025

published a model 7 months ago

mltrials/opt-350m-lora

Updated Jun 9, 2025

liked a model 11 months ago

meta-llama/Meta-Llama-3-8B-Instruct

Text Generation • 8B • Updated Jun 18, 2025 • 1.51M • • 4.35k

liked a model 12 months ago

deepseek-ai/DeepSeek-R1

Text Generation • 685B • Updated Mar 27, 2025 • 349k • • 13k

upvoted 3 papers about 1 year ago

upvoted 4 articles about 1 year ago

Article

🌁#81: Key AI Concepts to Follow in 2025

Dec 23, 2024

•

Article

Fine-tune ModernBERT for text classification using synthetic data

Dec 30, 2024

•

Article

🐺🐦‍⬛ LLM Comparison/Test: DeepSeek-V3, QVQ-72B-Preview, Falcon3 10B, Llama 3.3 70B, Nemotron 70B in my updated MMLU-Pro CS benchmark

Jan 2, 2025

•

Article

Fine-tune a SmolLM on domain-specific synthetic data from a LLM

Jan 3, 2025

•

mltrials

AI & ML interests

Recent Activity

Organizations

mltrials's activity

We Got Claude to Fine-Tune an Open Source LLM

Welcome EmbeddingGemma, Google's new efficient embedding model

SmolLM3: smol, multilingual, long-context reasoner

🌁#81: Key AI Concepts to Follow in 2025

Fine-tune ModernBERT for text classification using synthetic data

🐺🐦‍⬛ LLM Comparison/Test: DeepSeek-V3, QVQ-72B-Preview, Falcon3 10B, Llama 3.3 70B, Nemotron 70B in my updated MMLU-Pro CS benchmark

Fine-tune a SmolLM on domain-specific synthetic data from a LLM