1 18 60

MC

Dreamer312

Dreamer

AI & ML interests

NLP, CV, LLM, AGENT, RL

Recent Activity

upvoted a paper 9 days ago

Scaling Embeddings Outperforms Scaling Experts in Language Models

upvoted a paper 13 days ago

LongCat-Flash-Thinking-2601 Technical Report

liked a model 2 months ago

WeiboAI/VibeThinker-1.5B

View all activity

Organizations

None yet

upvoted a paper 9 days ago

Scaling Embeddings Outperforms Scaling Experts in Language Models

Paper • 2601.21204 • Published 11 days ago • 97

upvoted a paper 13 days ago

LongCat-Flash-Thinking-2601 Technical Report

Paper • 2601.16725 • Published 16 days ago • 175

liked a model 2 months ago

WeiboAI/VibeThinker-1.5B

Text Generation • 2B • Updated Nov 24, 2025 • 1.97k • 512

liked a model 3 months ago

moonshotai/Kimi-K2-Thinking

Text Generation • 170B • Updated 10 days ago • 365k • • 1.66k

liked a Space 4 months ago

Robot Learning: A Tutorial

📝

325

Learn about modern robot learning techniques and applications

commented 2 papers 4 months ago

SEED-GRPO: Semantic Entropy Enhanced GRPO for Uncertainty-Aware Policy Optimization

Paper • 2505.12346 • Published May 18, 2025 • 19 •

SEED-GRPO: Semantic Entropy Enhanced GRPO for Uncertainty-Aware Policy Optimization

Paper • 2505.12346 • Published May 18, 2025 • 19 •

liked a dataset 4 months ago

Agent-Ark/Toucan-1.5M

Viewer • Updated Oct 4, 2025 • 1.65M • 4.78k • 192

commented a paper 8 months ago

SEED-GRPO: Semantic Entropy Enhanced GRPO for Uncertainty-Aware Policy Optimization

Paper • 2505.12346 • Published May 18, 2025 • 19 •

commented 3 papers 9 months ago

SEED-GRPO: Semantic Entropy Enhanced GRPO for Uncertainty-Aware Policy Optimization

Paper • 2505.12346 • Published May 18, 2025 • 19 •

SEED-GRPO: Semantic Entropy Enhanced GRPO for Uncertainty-Aware Policy Optimization

Paper • 2505.12346 • Published May 18, 2025 • 19 •

SEED-GRPO: Semantic Entropy Enhanced GRPO for Uncertainty-Aware Policy Optimization

Paper • 2505.12346 • Published May 18, 2025 • 19 •

upvoted a paper 9 months ago

Scaling Law for Quantization-Aware Training

Paper • 2505.14302 • Published May 20, 2025 • 76

upvoted a collection 9 months ago

Llama 4

Collection

Meta's new Llama 4 multimodal models, Scout & Maverick. Includes Dynamic GGUFs, 16-bit & Dynamic 4-bit uploads. Run & fine-tune them with Unsloth! • 15 items • Updated 4 days ago • 55

liked 2 models 9 months ago

unsloth/Llama-4-Maverick-17B-128E-Instruct-GGUF

Image-to-Text • 401B • Updated Jun 18, 2025 • 6k • 43

meta-llama/Llama-4-Maverick-17B-128E-Instruct

Image-to-Text • 402B • Updated May 22, 2025 • 22.7k • 458

commented a paper 9 months ago

SEED-GRPO: Semantic Entropy Enhanced GRPO for Uncertainty-Aware Policy Optimization

Paper • 2505.12346 • Published May 18, 2025 • 19 •

authored 2 papers 9 months ago

Hydra-SGG: Hybrid Relation Assignment for One-stage Scene Graph Generation

Paper • 2409.10262 • Published Sep 16, 2024 • 1

SEED-GRPO: Semantic Entropy Enhanced GRPO for Uncertainty-Aware Policy Optimization

Paper • 2505.12346 • Published May 18, 2025 • 19

commented a paper 9 months ago

SEED-GRPO: Semantic Entropy Enhanced GRPO for Uncertainty-Aware Policy Optimization

Paper • 2505.12346 • Published May 18, 2025 • 19 •

MC

AI & ML interests

Recent Activity

Organizations

Dreamer312's activity

Robot Learning: A Tutorial