Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arXiv:2506.10892

Esoteric Language Models

Paper • 2506.01928 • Published Jun 2 • 9
sahoo-diffusion/Eso-LM-B-alpha-1

0.2B • Updated May 29 • 1 • 2
sahoo-diffusion/Eso-LM-B-alpha-0_25

0.2B • Updated Jul 4 • 5 • 1
The Diffusion Duality

Paper • 2506.10892 • Published Jun 12 • 37

Papers of interest

The Diffusion Duality

Paper • 2506.10892 • Published Jun 12 • 37

SeerAttention-R: Sparse Attention Adaptation for Long Reasoning

Paper • 2506.08889 • Published Jun 10 • 23
MiniCPM4: Ultra-Efficient LLMs on End Devices

Paper • 2506.07900 • Published Jun 9 • 92
Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9 • 262
OpenThoughts: Data Recipes for Reasoning Models

Paper • 2506.04178 • Published Jun 4 • 48

ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models

Paper • 2505.24864 • Published May 30 • 139
ComfyUI-Copilot: An Intelligent Assistant for Automated Workflow Development

Paper • 2506.05010 • Published Jun 5 • 79
SeedVR2: One-Step Video Restoration via Diffusion Adversarial Post-Training

Paper • 2506.05301 • Published Jun 5 • 56
LLaDA-V: Large Language Diffusion Models with Visual Instruction Tuning

Paper • 2505.16933 • Published May 22 • 34

The Diffusion Duality

s-sahoo/duo

Text Generation • 0.2B • Updated Oct 8 • 411 • 3
s-sahoo/duo-distilled

Text Generation • 0.2B • Updated about 1 month ago • 182 • 1
The Diffusion Duality

Paper • 2506.10892 • Published Jun 12 • 37

Diffusion Language

d1: Scaling Reasoning in Diffusion Large Language Models via Reinforcement Learning

Paper • 2504.12216 • Published Apr 16 • 3
Unifying Autoregressive and Diffusion-Based Sequence Generation

Paper • 2504.06416 • Published Apr 8 • 3
The Diffusion Duality

Paper • 2506.10892 • Published Jun 12 • 37
Anchored Diffusion Language Model

Paper • 2505.18456 • Published May 24 • 1

The Diffusion Duality

Paper • 2506.10892 • Published Jun 12 • 37

Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9 • 262
Dreamland: Controllable World Creation with Simulator and Generative Models

Paper • 2506.08006 • Published Jun 9 • 7
The Diffusion Duality

Paper • 2506.10892 • Published Jun 12 • 37

CoRAG: Collaborative Retrieval-Augmented Generation

Paper • 2504.01883 • Published Apr 2 • 9
SQL-R1: Training Natural Language to SQL Reasoning Model By Reinforcement Learning

Paper • 2504.08600 • Published Apr 11 • 31
Reasoning-SQL: Reinforcement Learning with SQL Tailored Partial Rewards for Reasoning-Enhanced Text-to-SQL

Paper • 2503.23157 • Published Mar 29 • 10
AI Agents: Evolution, Architecture, and Real-World Applications

Paper • 2503.12687 • Published Mar 16 • 2

Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models

Paper • 2503.09573 • Published Mar 12 • 73
Diffusion vs. Autoregressive Language Models: A Text Embedding Perspective

Paper • 2505.15045 • Published May 21 • 54
Dimple: Discrete Diffusion Multimodal Large Language Model with Parallel Decoding

Paper • 2505.16990 • Published May 22 • 22
D-AR: Diffusion via Autoregressive Models

Paper • 2505.23660 • Published May 29 • 34

Esoteric Language Models

Paper • 2506.01928 • Published Jun 2 • 9
sahoo-diffusion/Eso-LM-B-alpha-1

0.2B • Updated May 29 • 1 • 2
sahoo-diffusion/Eso-LM-B-alpha-0_25

0.2B • Updated Jul 4 • 5 • 1
The Diffusion Duality

Paper • 2506.10892 • Published Jun 12 • 37

Diffusion Language

d1: Scaling Reasoning in Diffusion Large Language Models via Reinforcement Learning

Paper • 2504.12216 • Published Apr 16 • 3
Unifying Autoregressive and Diffusion-Based Sequence Generation

Paper • 2504.06416 • Published Apr 8 • 3
The Diffusion Duality

Paper • 2506.10892 • Published Jun 12 • 37
Anchored Diffusion Language Model

Paper • 2505.18456 • Published May 24 • 1

Papers of interest

The Diffusion Duality

Paper • 2506.10892 • Published Jun 12 • 37

The Diffusion Duality

Paper • 2506.10892 • Published Jun 12 • 37

SeerAttention-R: Sparse Attention Adaptation for Long Reasoning

Paper • 2506.08889 • Published Jun 10 • 23
MiniCPM4: Ultra-Efficient LLMs on End Devices

Paper • 2506.07900 • Published Jun 9 • 92
Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9 • 262
OpenThoughts: Data Recipes for Reasoning Models

Paper • 2506.04178 • Published Jun 4 • 48

Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9 • 262
Dreamland: Controllable World Creation with Simulator and Generative Models

Paper • 2506.08006 • Published Jun 9 • 7
The Diffusion Duality

Paper • 2506.10892 • Published Jun 12 • 37

ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models

Paper • 2505.24864 • Published May 30 • 139
ComfyUI-Copilot: An Intelligent Assistant for Automated Workflow Development

Paper • 2506.05010 • Published Jun 5 • 79
SeedVR2: One-Step Video Restoration via Diffusion Adversarial Post-Training

Paper • 2506.05301 • Published Jun 5 • 56
LLaDA-V: Large Language Diffusion Models with Visual Instruction Tuning

Paper • 2505.16933 • Published May 22 • 34

CoRAG: Collaborative Retrieval-Augmented Generation

Paper • 2504.01883 • Published Apr 2 • 9
SQL-R1: Training Natural Language to SQL Reasoning Model By Reinforcement Learning

Paper • 2504.08600 • Published Apr 11 • 31
Reasoning-SQL: Reinforcement Learning with SQL Tailored Partial Rewards for Reasoning-Enhanced Text-to-SQL

Paper • 2503.23157 • Published Mar 29 • 10
AI Agents: Evolution, Architecture, and Real-World Applications

Paper • 2503.12687 • Published Mar 16 • 2

The Diffusion Duality

s-sahoo/duo

Text Generation • 0.2B • Updated Oct 8 • 411 • 3
s-sahoo/duo-distilled

Text Generation • 0.2B • Updated about 1 month ago • 182 • 1
The Diffusion Duality

Paper • 2506.10892 • Published Jun 12 • 37

Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models

Paper • 2503.09573 • Published Mar 12 • 73
Diffusion vs. Autoregressive Language Models: A Text Embedding Perspective

Paper • 2505.15045 • Published May 21 • 54
Dimple: Discrete Diffusion Multimodal Large Language Model with Parallel Decoding

Paper • 2505.16990 • Published May 22 • 22
D-AR: Diffusion via Autoregressive Models

Paper • 2505.23660 • Published May 29 • 34

Previous
1
2
Next

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs