Motoki Wu's picture

Motoki Wu

tokestermw

·

https://motoki.co

AI & ML interests

None yet

Recent Activity

liked a model 4 days ago

Qwen/Qwen3.5-35B-A3B

liked a model 4 days ago

Qwen/Qwen3.5-27B

upvoted a collection 6 days ago

View all activity

Organizations

upvoted a collection 6 days ago

Qwen3.5

17 items • Updated about 13 hours ago • 693

upvoted a paper 13 days ago

Experiential Reinforcement Learning

Paper • 2602.13949 • Published 16 days ago • 68

upvoted a paper about 1 month ago

Agentic-R: Learning to Retrieve for Agentic Search

Paper • 2601.11888 • Published Jan 17 • 19

upvoted 2 collections about 1 month ago

NVIDIA Nemotron v3

Open, Production-ready Enterprise Models • 7 items • Updated 7 days ago • 146

GLM-4.7

3 items • Updated Jan 19 • 64

upvoted a paper 2 months ago

Reinforcement Learning for Self-Improving Agent with Skill Library

Paper • 2512.17102 • Published Dec 18, 2025 • 36

upvoted 4 articles 3 months ago

Article

Nemotron 3 Nano \- A new Standard for Efficient, Open, and Intelligent Agentic Models

Dec 15, 2025

•

109

Article

Apriel-1.6-15b-Thinker: Cost-efficient Frontier Multimodal Performance

Dec 9, 2025

•

84

Article

How We Use Claude Code Skills to Run 1,000+ ML Experiments a Day

Dec 8, 2025

•

53

Article

Open ASR Leaderboard: Trends and Insights with New Multilingual & Long-Form Tracks

+2

Nov 21, 2025

•

26

upvoted a collection 4 months ago

PromptMII

Prompt-MII: Meta-Learning Instruction Induction for LLMs. Link to paper: https://arxiv.org/abs/2510.16932 • 4 items • Updated Oct 21, 2025 • 2

upvoted a paper 5 months ago

Group Sequence Policy Optimization

Paper • 2507.18071 • Published Jul 24, 2025 • 318

upvoted an article 5 months ago

Article

mem-agent: Equipping LLM Agents with Memory Using RL

Oct 9, 2025

•

33

upvoted a paper 5 months ago

Agentic Context Engineering: Evolving Contexts for Self-Improving Language Models

Paper • 2510.04618 • Published Oct 6, 2025 • 129

upvoted a collection 5 months ago

Qwen3-Omni

6 items • Updated Dec 31, 2025 • 183

upvoted 5 papers 6 months ago

Why Language Models Hallucinate

Paper • 2509.04664 • Published Sep 4, 2025 • 196

The Landscape of Agentic Reinforcement Learning for LLMs: A Survey

Paper • 2509.02547 • Published Sep 2, 2025 • 232

R-4B: Incentivizing General-Purpose Auto-Thinking Capability in MLLMs via Bi-Mode Annealing and Reinforce Learning

Paper • 2508.21113 • Published Aug 28, 2025 • 110

Breaking the Exploration Bottleneck: Rubric-Scaffolded Reinforcement Learning for General LLM Reasoning

Paper • 2508.16949 • Published Aug 23, 2025 • 24

AgentFly: Fine-tuning LLM Agents without Fine-tuning LLMs

Paper • 2508.16153 • Published Aug 22, 2025 • 160