1 74 130

Unknown Entity

unknownentity

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago

Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm

liked a model 2 days ago

moonshotai/Kimi-K2-Thinking

liked a model 4 days ago

ByteDance/BindWeave

View all activity

Organizations

None yet

upvoted a paper 2 days ago

Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm

Paper • 2511.04570 • Published 3 days ago • 160

upvoted a paper 6 days ago

NeuroAda: Activating Each Neuron's Potential for Parameter-Efficient Fine-Tuning

Paper • 2510.18940 • Published 19 days ago • 8

upvoted 2 papers 7 days ago

Surfer 2: The Next Generation of Cross-Platform Computer Use Agents

Paper • 2510.19949 • Published 18 days ago • 36

Tongyi DeepResearch Technical Report

Paper • 2510.24701 • Published 12 days ago • 90

upvoted a collection 11 days ago

Emu3.5

Collection

Native Multimodal Models are World Learners 🌍 • 3 items • Updated 10 days ago • 68

upvoted a paper 15 days ago

DyPE: Dynamic Position Extrapolation for Ultra High Resolution Diffusion

Paper • 2510.20766 • Published 17 days ago • 34

upvoted a paper 18 days ago

UltraGen: High-Resolution Video Generation with Hierarchical Attention

Paper • 2510.18775 • Published 19 days ago • 16

upvoted a paper 23 days ago

PhysMaster: Mastering Physical Representation for Video Generation via Reinforcement Learning

Paper • 2510.13809 • Published 25 days ago • 36

upvoted 2 papers 4 months ago

4KAgent: Agentic Any Image to 4K Super-Resolution

Paper • 2507.07105 • Published Jul 9 • 104

Lumos-1: On Autoregressive Video Generation from a Unified Model Perspective

Paper • 2507.08801 • Published Jul 11 • 30

upvoted a collection 4 months ago

MedGemma Release

Collection

Collection of Gemma 3 variants for performance on medical text and image comprehension to accelerate building healthcare-based AI applications. • 7 items • Updated Jul 11 • 340

upvoted an article 5 months ago

Article

Gemma 3n fully available in the open-source ecosystem!

Jun 26

• 120

upvoted a paper 5 months ago

OmniGen2: Exploration to Advanced Multimodal Generation

Paper • 2506.18871 • Published Jun 23 • 77

upvoted a paper 6 months ago

In-Context Edit: Enabling Instructional Image Editing with In-Context Generation in Large Scale Diffusion Transformer

Paper • 2504.20690 • Published Apr 29 • 19

upvoted a paper 8 months ago

Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models

Paper • 2503.09573 • Published Mar 12 • 73

upvoted an article 8 months ago

Article

Open R1: Update #3

and 9 others •

Mar 11

• 295

upvoted a paper 8 months ago

R1-Omni: Explainable Omni-Multimodal Emotion Recognition with Reinforcing Learning

Paper • 2503.05379 • Published Mar 7 • 38

upvoted a collection 8 months ago

SkyReels-V1

Collection

SkyReels V1 open models collections • 2 items • Updated Feb 17 • 20

upvoted 2 papers 9 months ago

Enhance-A-Video: Better Generated Video for Free

Paper • 2502.07508 • Published Feb 11 • 21

Magic 1-For-1: Generating One Minute Video Clips within One Minute

Paper • 2502.07701 • Published Feb 11 • 35

Unknown Entity

AI & ML interests

Recent Activity

Organizations

unknownentity's activity

Gemma 3n fully available in the open-source ecosystem!

Open R1: Update #3