nicolas's picture

46 49

nicolas

niko91i

·

AI & ML interests

None yet

Recent Activity

liked a model 4 days ago

cerebras/MiniMax-M2-REAP-172B-A10B

liked a model 7 days ago

nvidia/Llama-3_3-Nemotron-Super-49B-v1_5-NVFP4

upvoted a paper 7 days ago

TiDAR: Think in Diffusion, Talk in Autoregression

View all activity

Organizations

None yet

upvoted a paper 7 days ago

TiDAR: Think in Diffusion, Talk in Autoregression

Paper • 2511.08923 • Published 9 days ago • 92

upvoted a paper 8 days ago

Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B

Paper • 2511.06221 • Published 11 days ago • 110

upvoted a paper 18 days ago

Continuous Autoregressive Language Models

Paper • 2510.27688 • Published 20 days ago • 65

upvoted 2 papers 27 days ago

REAP the Experts: Why Pruning Prevails for One-Shot MoE compression

Paper • 2510.13999 • Published Oct 15 • 5

SPARK: Synergistic Policy And Reward Co-Evolving Framework

Paper • 2509.22624 • Published Sep 26 • 17

upvoted 2 papers about 1 month ago

When Models Lie, We Learn: Multilingual Span-Level Hallucination Detection with PsiloQA

Paper • 2510.04849 • Published Oct 6 • 112

Less is More: Recursive Reasoning with Tiny Networks

Paper • 2510.04871 • Published Oct 6 • 484

upvoted 4 papers 3 months ago

rStar2-Agent: Agentic Reasoning Technical Report

Paper • 2508.20722 • Published Aug 28 • 115

Deep Think with Confidence

Paper • 2508.15260 • Published Aug 21 • 88

Prompt Orchestration Markup Language

Paper • 2508.13948 • Published Aug 19 • 48

R-Zero: Self-Evolving Reasoning LLM from Zero Data

Paper • 2508.05004 • Published Aug 7 • 127

upvoted 9 papers 4 months ago

ScreenCoder: Advancing Visual-to-Code Generation for Front-End Automation via Modular Multimodal Agents

Paper • 2507.22827 • Published Jul 30 • 98

HunyuanWorld 1.0: Generating Immersive, Explorable, and Interactive 3D Worlds from Words or Pixels

Paper • 2507.21809 • Published Jul 29 • 132

SmallThinker: A Family of Efficient Large Language Models Natively Trained for Local Deployment

Paper • 2507.20984 • Published Jul 28 • 56

Agentic Reinforced Policy Optimization

Paper • 2507.19849 • Published Jul 26 • 156

Deep Researcher with Test-Time Diffusion

Paper • 2507.16075 • Published Jul 21 • 66

MiroMind-M1: An Open-Source Advancement in Mathematical Reasoning via Context-Aware Multi-Stage Policy Optimization

Paper • 2507.14683 • Published Jul 19 • 133

KV Cache Steering for Inducing Reasoning in Small Language Models

Paper • 2507.08799 • Published Jul 11 • 40

NeuralOS: Towards Simulating Operating Systems via Neural Generative Models

Paper • 2507.08800 • Published Jul 11 • 80

Muon is Scalable for LLM Training

Paper • 2502.16982 • Published Feb 24 • 8