The Trinity of Consistency as a Defining Principle for General World Models Paper • 2602.23152 • Published 13 days ago • 196
DICE: Diffusion Large Language Models Excel at Generating CUDA Kernels Paper • 2602.11715 • Published 27 days ago • 6
DICE: Diffusion Large Language Models Excel at Generating CUDA Kernels Paper • 2602.11715 • Published 27 days ago • 6
DICE Collection A series of diffusion language models tailored for CUDA kernel generation. • 4 items • Updated 27 days ago • 3
DICE: Diffusion Large Language Models Excel at Generating CUDA Kernels Paper • 2602.11715 • Published 27 days ago • 6
DICE Collection A series of diffusion language models tailored for CUDA kernel generation. • 4 items • Updated 27 days ago • 3
DICE Collection A series of diffusion language models tailored for CUDA kernel generation. • 4 items • Updated 27 days ago • 3
OmniZip: Audio-Guided Dynamic Token Compression for Fast Omnimodal Large Language Models Paper • 2511.14582 • Published Nov 18, 2025 • 19
MergeMix: A Unified Augmentation Paradigm for Visual and Multi-Modal Understanding Paper • 2510.23479 • Published Oct 27, 2025 • 18
OBS-Diff: Accurate Pruning For Diffusion Models in One-Shot Paper • 2510.06751 • Published Oct 8, 2025 • 22
HoliTom: Holistic Token Merging for Fast Video Large Language Models Paper • 2505.21334 • Published May 27, 2025 • 21