1 20 1

taewoongkang

Keh0t0

keh0t0

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

PHUMA: Physically-Grounded Humanoid Locomotion Dataset

upvoted a paper 1 day ago

EgoX: Egocentric Video Generation from a Single Exocentric Video

submitted a paper 1 day ago

EgoX: Egocentric Video Generation from a Single Exocentric Video

View all activity

Organizations

None yet

upvoted 2 papers 1 day ago

PHUMA: Physically-Grounded Humanoid Locomotion Dataset

Paper • 2510.26236 • Published Oct 30 • 29

EgoX: Egocentric Video Generation from a Single Exocentric Video

Paper • 2512.08269 • Published 8 days ago • 73

submitted a paper to Daily Papers 1 day ago

EgoX: Egocentric Video Generation from a Single Exocentric Video

Paper • 2512.08269 • Published 8 days ago • 73

authored a paper 5 days ago

EgoX: Egocentric Video Generation from a Single Exocentric Video

Paper • 2512.08269 • Published 8 days ago • 73

upvoted 2 papers 5 days ago

DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models

Paper • 2512.02556 • Published 15 days ago • 213

Z-Image: An Efficient Image Generation Foundation Model with Single-Stream Diffusion Transformer

Paper • 2511.22699 • Published 19 days ago • 199

upvoted a paper about 2 months ago

ACG: Action Coherence Guidance for Flow-based VLA models

Paper • 2510.22201 • Published Oct 25 • 36

liked a dataset about 2 months ago

InternRobotics/OmniWorld

Viewer • Updated 8 days ago • 5.54B • 30.5k • 74

upvoted a paper about 2 months ago

RL makes MLLMs see better than SFT

Paper • 2510.16333 • Published Oct 18 • 48

upvoted 3 papers 2 months ago

VLA-Adapter: An Effective Paradigm for Tiny-Scale Vision-Language-Action Model

Paper • 2509.09372 • Published Sep 11 • 240

Less is More: Recursive Reasoning with Tiny Networks

Paper • 2510.04871 • Published Oct 6 • 497

Hybrid Architectures for Language Models: Systematic Analysis and Design Insights

Paper • 2510.04800 • Published Oct 6 • 36

upvoted a paper 3 months ago

Why Language Models Hallucinate

Paper • 2509.04664 • Published Sep 4 • 193

upvoted a paper 5 months ago

DesignLab: Designing Slides Through Iterative Detection and Correction

Paper • 2507.17202 • Published Jul 23 • 50

upvoted a paper 8 months ago

SphereDiff: Tuning-free Omnidirectional Panoramic Image and Video Generation via Spherical Latent Representation

Paper • 2504.14396 • Published Apr 19 • 27

upvoted 5 papers 11 months ago

VideoAnydoor: High-fidelity Video Object Insertion with Precise Motion Control

Paper • 2501.01427 • Published Jan 2 • 54

Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models

Paper • 2501.01423 • Published Jan 2 • 44

LTX-Video: Realtime Video Latent Diffusion

Paper • 2501.00103 • Published Dec 30, 2024 • 47

GS-DiT: Advancing Video Generation with Pseudo 4D Gaussian Fields through Efficient Dense 3D Point Tracking

Paper • 2501.02690 • Published Jan 5 • 17

STAR: Spatial-Temporal Augmentation with Text-to-Video Models for Real-World Video Super-Resolution

Paper • 2501.02976 • Published Jan 6 • 56

taewoongkang

AI & ML interests

Recent Activity

Organizations

Keh0t0's activity