7 96 676

Full Name PRO

Gatozu35

AI & ML interests

Text-to-Speech, Voice Conversion

Recent Activity

liked a Space 1 day ago

Qwen/Qwen3-TTS

liked a model 3 days ago

frothywater/kanade-12.5hz

liked a model 3 days ago

ayousanz/cosy-voice3-onnx

View all activity

Organizations

upvoted a paper 12 days ago

Continuous Audio Language Models

Paper • 2509.06926 • Published Sep 8, 2025 • 2

upvoted 2 articles 2 months ago

Article

Fish Speech V1 - New Multilingual Open Source TTS Model

May 3, 2024

•

Article

Text-to-image Architectural Experiments

Nov 13, 2025

•

upvoted 11 papers 3 months ago

MAGA: MAssive Genre-Audience Reformulation to Pretraining Corpus Expansion

Paper • 2502.04235 • Published Feb 6, 2025 • 23

Heptapod: Language Modeling on Visual Signals

Paper • 2510.06673 • Published Oct 8, 2025 • 5

Memory Retrieval and Consolidation in Large Language Models through Function Tokens

Paper • 2510.08203 • Published Oct 9, 2025 • 10

When Visualizing is the First Step to Reasoning: MIRA, a Benchmark for Visual Chain-of-Thought

Paper • 2511.02779 • Published Nov 4, 2025 • 59

upvoted 2 papers 4 months ago

Compressed Convolutional Attention: Efficient Attention in a Compressed Latent Space

Paper • 2510.04476 • Published Oct 6, 2025 • 16

FCPE: A Fast Context-based Pitch Estimation Model

Paper • 2509.15140 • Published Sep 18, 2025 • 8

upvoted a collection 5 months ago

DINOv3

Collection

DINOv3: foundation models producing excellent dense features, outperforming SotA w/o fine-tuning - https://arxiv.org/abs/2508.10104 • 13 items • Updated Aug 21, 2025 • 464

upvoted a collection 6 months ago

Deep Ignorance

Collection

This collection contains the model and data artifacts from O'Brien et al. (2025). https://deepignorance.ai • 44 items • Updated Dec 17, 2025 • 10

upvoted a collection 7 months ago

H-Net

Collection

The family of hierarchical networks (H-Nets) from https://arxiv.org/abs/2507.07955 • 8 items • Updated Jul 11, 2025 • 20

upvoted a paper 9 months ago

RADLADS: Rapid Attention Distillation to Linear Attention Decoders at Scale

Paper • 2505.03005 • Published May 5, 2025 • 36

Full Name PRO

AI & ML interests

Recent Activity

Organizations

Gatozu35's activity

Fish Speech V1 - New Multilingual Open Source TTS Model

Text-to-image Architectural Experiments