Motion 3-to-4: 3D Motion Reconstruction for 4D Synthesis Paper β’ 2601.14253 β’ Published 4 days ago β’ 5
V-DPM: 4D Video Reconstruction with Dynamic Point Maps Paper β’ 2601.09499 β’ Published 10 days ago β’ 9
UM-Text: A Unified Multimodal Model for Image Understanding Paper β’ 2601.08321 β’ Published 12 days ago β’ 8
ResTok: Learning Hierarchical Residuals in 1D Visual Tokenizers for Autoregressive Image Generation Paper β’ 2601.03955 β’ Published 17 days ago β’ 3
FlowBlending: Stage-Aware Multi-Model Sampling for Fast and High-Fidelity Video Generation Paper β’ 2512.24724 β’ Published 25 days ago β’ 7
Dream2Flow: Bridging Video Generation and Open-World Manipulation with 3D Object Flow Paper β’ 2512.24766 β’ Published 24 days ago β’ 8
What matters for Representation Alignment: Global Information or Spatial Structure? Paper β’ 2512.10794 β’ Published Dec 11, 2025 β’ 9
VUGEN: Visual Understanding priors for GENeration Paper β’ 2510.06529 β’ Published Oct 8, 2025 β’ 1
TV2TV: A Unified Framework for Interleaved Language and Video Generation Paper β’ 2512.05103 β’ Published Dec 4, 2025 β’ 19
ThreadWeaver: Adaptive Threading for Efficient Parallel Reasoning in Language Models Paper β’ 2512.07843 β’ Published Nov 24, 2025 β’ 22
BigCodeArena: Unveiling More Reliable Human Preferences in Code Generation via Execution Paper β’ 2510.08697 β’ Published Oct 9, 2025 β’ 37
ARE: Scaling Up Agent Environments and Evaluations Paper β’ 2509.17158 β’ Published Sep 21, 2025 β’ 36
view post Post 49846 Google drops Gemini 2.0 Flash Thinkinga new experimental model that unlocks stronger reasoning capabilities and shows its thoughts. The model plans (with thoughts visible), can solve complex problems with Flash speeds, and morenow available in anychat, try it out: https://huggingface.co/spaces/akhaliq/anychat See translation 5 replies Β· π 12 12 π₯ 6 6 π 4 4 π 2 2 + Reply
Token-level and sequence-level loss smoothing for RNN language models Paper β’ 1805.05062 β’ Published May 14, 2018
Efficient Wait-k Models for Simultaneous Machine Translation Paper β’ 2005.08595 β’ Published May 18, 2020
Added Toxicity Mitigation at Inference Time for Multimodal and Massively Multilingual Translation Paper β’ 2311.06532 β’ Published Nov 11, 2023
Large Concept Models: Language Modeling in a Sentence Representation Space Paper β’ 2412.08821 β’ Published Dec 11, 2024 β’ 17
view post Post 48907 QwQ-32B-Preview is now available in anychatA reasoning model that is competitive with OpenAI o1-mini and o1-previewtry it out: https://huggingface.co/spaces/akhaliq/anychat See translation 2 replies Β· β€οΈ 3 3 π 2 2 + Reply
view post Post 5070 New model drop in anychatallenai/Llama-3.1-Tulu-3-8B is now availabletry it here: https://huggingface.co/spaces/akhaliq/anychat See translation π₯ 3 3 π 1 1 + Reply