Swanish Realm's picture

71

Swanish Realm

swanishrealm

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 9 days ago

Emu3.5: Native Multimodal Models are World Learners

upvoted a paper 10 days ago

Continuous Autoregressive Language Models

upvoted a paper 10 days ago

Diffusion Language Models are Super Data Learners

View all activity

Organizations

None yet

upvoted a paper 9 days ago

Emu3.5: Native Multimodal Models are World Learners

Paper • 2510.26583 • Published 17 days ago • 103

upvoted 2 papers 10 days ago

Continuous Autoregressive Language Models

Paper • 2510.27688 • Published 16 days ago • 64

Diffusion Language Models are Super Data Learners

Paper • 2511.03276 • Published 11 days ago • 113

upvoted a paper 15 days ago

Kimi Linear: An Expressive, Efficient Attention Architecture

Paper • 2510.26692 • Published 17 days ago • 105

upvoted a paper 20 days ago

olmOCR: Unlocking Trillions of Tokens in PDFs with Vision Language Models

Paper • 2502.18443 • Published Feb 25 • 9

upvoted a paper 22 days ago

Fish-Speech: Leveraging Large Language Models for Advanced Multilingual Text-to-Speech Synthesis

Paper • 2411.01156 • Published Nov 2, 2024 • 11

upvoted 3 papers 29 days ago

Agentic Entropy-Balanced Policy Optimization

Paper • 2510.14545 • Published about 1 month ago • 102

BitNet Distillation

Paper • 2510.13998 • Published Oct 15 • 53

PaddleOCR-VL: Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model

Paper • 2510.14528 • Published about 1 month ago • 90

upvoted 8 papers about 1 month ago

Training-Free Group Relative Policy Optimization

Paper • 2510.08191 • Published Oct 9 • 44

LLaVA-OneVision-1.5: Fully Open Framework for Democratized Multimodal Training

Paper • 2509.23661 • Published Sep 28 • 44

QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning for LLMs

Paper • 2510.11696 • Published Oct 13 • 173

Diffusion Transformers with Representation Autoencoders

Paper • 2510.11690 • Published Oct 13 • 162

In-the-Flow Agentic System Optimization for Effective Planning and Tool Use

Paper • 2510.05592 • Published Oct 7 • 101

Less is More: Recursive Reasoning with Tiny Networks

Paper • 2510.04871 • Published Oct 6 • 475

GEM: A Gym for Agentic LLMs

Paper • 2510.01051 • Published Oct 1 • 88

The Dragon Hatchling: The Missing Link between the Transformer and Models of the Brain

Paper • 2509.26507 • Published Sep 30 • 531

upvoted 3 papers about 2 months ago

EPO: Entropy-regularized Policy Optimization for LLM Agents Reinforcement Learning

Paper • 2509.22576 • Published Sep 26 • 133

MinerU2.5: A Decoupled Vision-Language Model for Efficient High-Resolution Document Parsing

Paper • 2509.22186 • Published Sep 26 • 131

LongLive: Real-time Interactive Long Video Generation

Paper • 2509.22622 • Published Sep 26 • 182