reading-list - a maroahma Collection

maroahma 's Collections

reading-list

updated 20 days ago

What Characterizes Effective Reasoning? Revisiting Length, Review, and Structure of CoT

Paper • 2509.19284 • Published Sep 23, 2025 • 22
Soft Tokens, Hard Truths

Paper • 2509.19170 • Published Sep 23, 2025 • 15
CompLLM: Compression for Long Context Q&A

Paper • 2509.19228 • Published Sep 23, 2025 • 8
Test-Time Scaling in Reasoning Models Is Not Effective for Knowledge-Intensive Tasks Yet

Paper • 2509.06861 • Published Sep 8, 2025 • 8
DeepSearch: Overcome the Bottleneck of Reinforcement Learning with Verifiable Rewards via Monte Carlo Tree Search

Paper • 2509.25454 • Published Sep 29, 2025 • 141
SINQ: Sinkhorn-Normalized Quantization for Calibration-Free Low-Precision LLM Weights

Paper • 2509.22944 • Published Sep 26, 2025 • 79
Running

3.62k

The Ultra-Scale Playbook

🌌

3.62k

The ultimate guide to training LLM on large GPU Clusters
Multimodal Prompt Optimization: Why Not Leverage Multiple Modalities for MLLMs

Paper • 2510.09201 • Published Oct 10, 2025 • 49
Native Parallel Reasoner: Reasoning in Parallelism via Self-Distilled Reinforcement Learning

Paper • 2512.07461 • Published 27 days ago • 74