In a Training Loop 🔄

5 51 104

Arthur EDMOND

Shumatsurontek

AI & ML interests

LLM & Computer Vision

Recent Activity

liked a model 10 days ago

moonshotai/Kimi-K2.5

upvoted a paper 15 days ago

Agentic Reasoning for Large Language Models

upvoted a paper 17 days ago

Advances and Frontiers of LLM-based Issue Resolution in Software Engineering: A Comprehensive Survey

View all activity

Organizations

upvoted a paper 15 days ago

Agentic Reasoning for Large Language Models

Paper • 2601.12538 • Published 19 days ago • 192

upvoted a paper 17 days ago

Advances and Frontiers of LLM-based Issue Resolution in Software Engineering: A Comprehensive Survey

Paper • 2601.11655 • Published 22 days ago • 60

upvoted a paper 25 days ago

GlimpRouter: Efficient Collaborative Inference by Glimpsing One Token of Thoughts

Paper • 2601.05110 • Published 29 days ago • 29

upvoted a paper about 2 months ago

Kling-Omni Technical Report

Paper • 2512.16776 • Published Dec 18, 2025 • 169

upvoted an article about 2 months ago

Article

Gotchas in Tokenizer Behavior Every Developer Should Know

Apr 18, 2025

•

upvoted an article 2 months ago

Article

Welcome EmbeddingGemma, Google's new efficient embedding model

Sep 4, 2025

•

273

upvoted a paper 2 months ago

From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence

Paper • 2511.18538 • Published Nov 23, 2025 • 296

upvoted 4 papers 3 months ago

MiroThinker: Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling

Paper • 2511.11793 • Published Nov 14, 2025 • 187

Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B

Paper • 2511.06221 • Published Nov 9, 2025 • 133

Every Activation Boosted: Scaling General Reasoner to 1 Trillion Open Language Foundation

Paper • 2510.22115 • Published Oct 25, 2025 • 84

DeepAgent: A General Reasoning Agent with Scalable Toolsets

Paper • 2510.21618 • Published Oct 24, 2025 • 101

upvoted 2 papers 4 months ago

Paper2Video: Automatic Video Generation from Scientific Papers

Paper • 2510.05096 • Published Oct 6, 2025 • 119

Apriel-1.5-15b-Thinker

Paper • 2510.01141 • Published Oct 1, 2025 • 120

upvoted a collection 4 months ago

Granite 4.0 Language Models

Collection

13 items • Updated Nov 17, 2025 • 206

upvoted 3 papers 4 months ago

DeepSearch: Overcome the Bottleneck of Reinforcement Learning with Verifiable Rewards via Monte Carlo Tree Search

Paper • 2509.25454 • Published Sep 29, 2025 • 146

TruthRL: Incentivizing Truthful LLMs via Reinforcement Learning

Paper • 2509.25760 • Published Sep 30, 2025 • 55

EPO: Entropy-regularized Policy Optimization for LLM Agents Reinforcement Learning

Paper • 2509.22576 • Published Sep 26, 2025 • 135

upvoted 3 papers 5 months ago