BAT: Learning to Reason about Spatial Sounds with Large Language Models Paper • 2402.01591 • Published Feb 2, 2024 • 1
AMO Sampler: Enhancing Text Rendering with Overshooting Paper • 2411.19415 • Published Nov 28, 2024 • 5
On DeepSeekMoE: Statistical Benefits of Shared Experts and Normalized Sigmoid Gating Paper • 2505.10860 • Published May 16 • 1
Memory-Efficient LLM Training with Online Subspace Descent Paper • 2408.12857 • Published Aug 23, 2024 • 16
Semantic Image Inversion and Editing using Rectified Stochastic Differential Equations Paper • 2410.10792 • Published Oct 14, 2024 • 31
DataComp-LM: In search of the next generation of training sets for language models Paper • 2406.11794 • Published Jun 17, 2024 • 54