ZiangZhang's picture

ZiangZhang

Viglong

·

AI & ML interests

None yet

Recent Activity

liked a Space 21 days ago

HuggingFaceFW/blogpost-fineweb-v1

liked a Space 21 days ago

nanotron/ultrascale-playbook

liked a model 21 days ago

HuggingFaceTB/SmolLM3-3B

View all activity

Organizations

None yet

authored 8 papers about 1 month ago

FreeBind: Free Lunch in Unified Multimodal Space via Knowledge Fusion

Paper • 2405.04883 • Published May 8, 2024

OmniBind: Large-scale Omni Multimodal Representation via Binding Spaces

Paper • 2407.11895 • Published Jul 16, 2024 • 7

WavTokenizer: an Efficient Acoustic Discrete Codec Tokenizer for Audio Language Modeling

Paper • 2408.16532 • Published Aug 29, 2024 • 50

MuVi: Video-to-Music Generation with Semantic Alignment and Rhythmic Synchronization

Paper • 2410.12957 • Published Oct 16, 2024 • 9

Depth Anything with Any Prior

Paper • 2505.10565 • Published May 15 • 12

OmniSep: Unified Omni-Modality Sound Separation with Query-Mixup

Paper • 2410.21269 • Published Oct 28, 2024

APO: Enhancing Reasoning Ability of MLLMs via Asymmetric Policy Optimization

Paper • 2506.21655 • Published Jun 26

DSI-Bench: A Benchmark for Dynamic Spatial Intelligence

Paper • 2510.18873 • Published Oct 21 • 8