ARTDECO: Towards Efficient and High-Fidelity On-the-Fly 3D Reconstruction with Structured Scene Representation Paper • 2510.08551 • Published Oct 9 • 31
Why Low-Precision Transformer Training Fails: An Analysis on Flash Attention Paper • 2510.04212 • Published Oct 5 • 23
ERA: Transforming VLMs into Embodied Agents via Embodied Prior Learning and Online Reinforcement Learning Paper • 2510.12693 • Published Oct 14 • 26
ARGenSeg: Image Segmentation with Autoregressive Image Generation Model Paper • 2510.20803 • Published 26 days ago • 9
Baichuan-M2: Scaling Medical Capability with Large Verifier System Paper • 2509.02208 • Published Sep 2 • 41
Reasoning with Sampling: Your Base Model is Smarter Than You Think Paper • 2510.14901 • Published Oct 16 • 47
Search Self-play: Pushing the Frontier of Agent Capability without Supervision Paper • 2510.18821 • Published 28 days ago • 16
Concerto: Joint 2D-3D Self-Supervised Learning Emerges Spatial Representations Paper • 2510.23607 • Published 22 days ago • 172
π_RL: Online RL Fine-tuning for Flow-based Vision-Language-Action Models Paper • 2510.25889 • Published 20 days ago • 62
huihui-ai/Huihui-Qwen3-VL-235B-A22B-Instruct-abliterated-GGUF Image-Text-to-Text • 235B • Updated 17 days ago • 5.62k • 14
tencent/KaLM-Embedding-Gemma3-12B-2511 Sentence Similarity • 12B • Updated about 3 hours ago • 8.63k • 21
Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B Paper • 2511.06221 • Published 10 days ago • 103