-
Nuclear Norm Regularization for Deep Learning
Paper • 2405.14544 • Published • 1 -
Token embeddings violate the manifold hypothesis
Paper • 2504.01002 • Published • 1 -
Approximate Nullspace Augmented Finetuning for Robust Vision Transformers
Paper • 2403.10476 • Published • 1 -
ElaLoRA: Elastic & Learnable Low-Rank Adaptation for Efficient Model Fine-Tuning
Paper • 2504.00254 • Published • 1
Collections
Discover the best community collections!
Collections including paper arxiv:2510.26697
-
The Era of Agentic Organization: Learning to Organize with Language Models
Paper • 2510.26658 • Published • 25 -
Supervised Reinforcement Learning: From Expert Trajectories to Step-wise Reasoning
Paper • 2510.25992 • Published • 42 -
The End of Manual Decoding: Towards Truly End-to-End Language Models
Paper • 2510.26697 • Published • 113
-
Less is More: Recursive Reasoning with Tiny Networks
Paper • 2510.04871 • Published • 483 -
The Dragon Hatchling: The Missing Link between the Transformer and Models of the Brain
Paper • 2509.26507 • Published • 532 -
LightMem: Lightweight and Efficient Memory-Augmented Generation
Paper • 2510.18866 • Published • 109 -
The End of Manual Decoding: Towards Truly End-to-End Language Models
Paper • 2510.26697 • Published • 113
-
Less is More: Recursive Reasoning with Tiny Networks
Paper • 2510.04871 • Published • 483 -
SpikingBrain Technical Report: Spiking Brain-inspired Large Models
Paper • 2509.05276 • Published • 3 -
Self-Adapting Language Models
Paper • 2506.10943 • Published • 6 -
The Art of Scaling Reinforcement Learning Compute for LLMs
Paper • 2510.13786 • Published • 30
-
lusxvr/nanoVLM-222M
Image-Text-to-Text • 0.2B • Updated • 243 • 98 -
Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning
Paper • 2503.09516 • Published • 36 -
AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time
Paper • 2505.24863 • Published • 97 -
QwenLong-L1: Towards Long-Context Large Reasoning Models with Reinforcement Learning
Paper • 2505.17667 • Published • 88
-
MADD: Multi-Agent Drug Discovery Orchestra
Paper • 2511.08217 • Published • 54 -
The Station: An Open-World Environment for AI-Driven Discovery
Paper • 2511.06309 • Published • 34 -
An AI system to help scientists write expert-level empirical software
Paper • 2509.06503 • Published • 6 -
The Era of Agentic Organization: Learning to Organize with Language Models
Paper • 2510.26658 • Published • 25
-
Less is More: Recursive Reasoning with Tiny Networks
Paper • 2510.04871 • Published • 483 -
Cache-to-Cache: Direct Semantic Communication Between Large Language Models
Paper • 2510.03215 • Published • 96 -
When Thoughts Meet Facts: Reusable Reasoning for Long-Context LMs
Paper • 2510.07499 • Published • 48 -
StreamingVLM: Real-Time Understanding for Infinite Video Streams
Paper • 2510.09608 • Published • 50
-
Describe What You See with Multimodal Large Language Models to Enhance Video Recommendations
Paper • 2508.09789 • Published • 5 -
MM-BrowseComp: A Comprehensive Benchmark for Multimodal Browsing Agents
Paper • 2508.13186 • Published • 18 -
ZARA: Zero-shot Motion Time-Series Analysis via Knowledge and Retrieval Driven LLM Agents
Paper • 2508.04038 • Published • 1 -
Prompt Orchestration Markup Language
Paper • 2508.13948 • Published • 48
-
UCFE: A User-Centric Financial Expertise Benchmark for Large Language Models
Paper • 2410.14059 • Published • 61 -
Sketch-of-Thought: Efficient LLM Reasoning with Adaptive Cognitive-Inspired Sketching
Paper • 2503.05179 • Published • 46 -
Token-Efficient Long Video Understanding for Multimodal LLMs
Paper • 2503.04130 • Published • 96 -
GoT: Unleashing Reasoning Capability of Multimodal Large Language Model for Visual Generation and Editing
Paper • 2503.10639 • Published • 53
-
Nuclear Norm Regularization for Deep Learning
Paper • 2405.14544 • Published • 1 -
Token embeddings violate the manifold hypothesis
Paper • 2504.01002 • Published • 1 -
Approximate Nullspace Augmented Finetuning for Robust Vision Transformers
Paper • 2403.10476 • Published • 1 -
ElaLoRA: Elastic & Learnable Low-Rank Adaptation for Efficient Model Fine-Tuning
Paper • 2504.00254 • Published • 1
-
MADD: Multi-Agent Drug Discovery Orchestra
Paper • 2511.08217 • Published • 54 -
The Station: An Open-World Environment for AI-Driven Discovery
Paper • 2511.06309 • Published • 34 -
An AI system to help scientists write expert-level empirical software
Paper • 2509.06503 • Published • 6 -
The Era of Agentic Organization: Learning to Organize with Language Models
Paper • 2510.26658 • Published • 25
-
The Era of Agentic Organization: Learning to Organize with Language Models
Paper • 2510.26658 • Published • 25 -
Supervised Reinforcement Learning: From Expert Trajectories to Step-wise Reasoning
Paper • 2510.25992 • Published • 42 -
The End of Manual Decoding: Towards Truly End-to-End Language Models
Paper • 2510.26697 • Published • 113
-
Less is More: Recursive Reasoning with Tiny Networks
Paper • 2510.04871 • Published • 483 -
The Dragon Hatchling: The Missing Link between the Transformer and Models of the Brain
Paper • 2509.26507 • Published • 532 -
LightMem: Lightweight and Efficient Memory-Augmented Generation
Paper • 2510.18866 • Published • 109 -
The End of Manual Decoding: Towards Truly End-to-End Language Models
Paper • 2510.26697 • Published • 113
-
Less is More: Recursive Reasoning with Tiny Networks
Paper • 2510.04871 • Published • 483 -
Cache-to-Cache: Direct Semantic Communication Between Large Language Models
Paper • 2510.03215 • Published • 96 -
When Thoughts Meet Facts: Reusable Reasoning for Long-Context LMs
Paper • 2510.07499 • Published • 48 -
StreamingVLM: Real-Time Understanding for Infinite Video Streams
Paper • 2510.09608 • Published • 50
-
Less is More: Recursive Reasoning with Tiny Networks
Paper • 2510.04871 • Published • 483 -
SpikingBrain Technical Report: Spiking Brain-inspired Large Models
Paper • 2509.05276 • Published • 3 -
Self-Adapting Language Models
Paper • 2506.10943 • Published • 6 -
The Art of Scaling Reinforcement Learning Compute for LLMs
Paper • 2510.13786 • Published • 30
-
Describe What You See with Multimodal Large Language Models to Enhance Video Recommendations
Paper • 2508.09789 • Published • 5 -
MM-BrowseComp: A Comprehensive Benchmark for Multimodal Browsing Agents
Paper • 2508.13186 • Published • 18 -
ZARA: Zero-shot Motion Time-Series Analysis via Knowledge and Retrieval Driven LLM Agents
Paper • 2508.04038 • Published • 1 -
Prompt Orchestration Markup Language
Paper • 2508.13948 • Published • 48
-
lusxvr/nanoVLM-222M
Image-Text-to-Text • 0.2B • Updated • 243 • 98 -
Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning
Paper • 2503.09516 • Published • 36 -
AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time
Paper • 2505.24863 • Published • 97 -
QwenLong-L1: Towards Long-Context Large Reasoning Models with Reinforcement Learning
Paper • 2505.17667 • Published • 88
-
UCFE: A User-Centric Financial Expertise Benchmark for Large Language Models
Paper • 2410.14059 • Published • 61 -
Sketch-of-Thought: Efficient LLM Reasoning with Adaptive Cognitive-Inspired Sketching
Paper • 2503.05179 • Published • 46 -
Token-Efficient Long Video Understanding for Multimodal LLMs
Paper • 2503.04130 • Published • 96 -
GoT: Unleashing Reasoning Capability of Multimodal Large Language Model for Visual Generation and Editing
Paper • 2503.10639 • Published • 53