Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

Emmanuel Sugutt's picture

Emmanuel Sugutt

Sugutt

Ameeeee's profile picture

Manlux's profile picture

dvilasuero's profile picture

·

sugutt_
sugutt

AI & ML interests

Reinforcement learning Transformer models

Organizations

Sugutt 's collections 3

DuPO: Enabling Reliable LLM Self-Verification via Dual Preference Optimization

Paper • 2508.14460 • Published Aug 20 • 82
MEML-GRPO: Heterogeneous Multi-Expert Mutual Learning for RLVR Advancement

Paper • 2508.09670 • Published Aug 13
URPO: A Unified Reward & Policy Optimization Framework for Large Language Models

Paper • 2507.17515 • Published Jul 23 • 2

LearnAct: Few-Shot Mobile GUI Agent with a Unified Demonstration Benchmark

Paper • 2504.13805 • Published Apr 18 • 11
Towards Agentic Recommender Systems in the Era of Multimodal Large Language Models

Paper • 2503.16734 • Published Mar 20 • 1
LLM-Powered GUI Agents in Phone Automation: Surveying Progress and Prospects

Paper • 2504.19838 • Published Apr 28 • 22

Grove MoE: Towards Efficient and Superior MoE LLMs with Adjugate Experts

Paper • 2508.07785 • Published Aug 11 • 28
MoBE: Mixture-of-Basis-Experts for Compressing MoE-based LLMs

Paper • 2508.05257 • Published Aug 7 • 13
SmallThinker: A Family of Efficient Large Language Models Natively Trained for Local Deployment

Paper • 2507.20984 • Published Jul 28 • 56
MiniCPM4: Ultra-Efficient LLMs on End Devices

Paper • 2506.07900 • Published Jun 9 • 92

DuPO: Enabling Reliable LLM Self-Verification via Dual Preference Optimization

Paper • 2508.14460 • Published Aug 20 • 82
MEML-GRPO: Heterogeneous Multi-Expert Mutual Learning for RLVR Advancement

Paper • 2508.09670 • Published Aug 13
URPO: A Unified Reward & Policy Optimization Framework for Large Language Models

Paper • 2507.17515 • Published Jul 23 • 2

Grove MoE: Towards Efficient and Superior MoE LLMs with Adjugate Experts

Paper • 2508.07785 • Published Aug 11 • 28
MoBE: Mixture-of-Basis-Experts for Compressing MoE-based LLMs

Paper • 2508.05257 • Published Aug 7 • 13
SmallThinker: A Family of Efficient Large Language Models Natively Trained for Local Deployment

Paper • 2507.20984 • Published Jul 28 • 56
MiniCPM4: Ultra-Efficient LLMs on End Devices

Paper • 2506.07900 • Published Jun 9 • 92

LearnAct: Few-Shot Mobile GUI Agent with a Unified Demonstration Benchmark

Paper • 2504.13805 • Published Apr 18 • 11
Towards Agentic Recommender Systems in the Era of Multimodal Large Language Models

Paper • 2503.16734 • Published Mar 20 • 1
LLM-Powered GUI Agents in Phone Automation: Surveying Progress and Prospects

Paper • 2504.19838 • Published Apr 28 • 22

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs