Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
xlalex
's Collections
encoder
data
svg
video
interleaved
ocr
3d
world model
omni
infra
synthesis
perception
survey
RL
critic
speech full duplex
agent
self-paly
world model
updated
15 days ago
Upvote
-
Emu3.5: Native Multimodal Models are World Learners
Paper
•
2510.26583
•
Published
19 days ago
•
103
Upvote
-
Share collection
View history
Collection guide
Browse collections