5 17 100

Neko Ayaka

nekomeowww

https://github.com/nekomeowww

AI & ML interests

Multimodal, LLM, VLM, Robotics, and RL

Recent Activity

updated a model 4 days ago

moeru-ai/sherpa-onnx-streaming-zipformer-ar_en_id_ja_ru_th_vi_zh-2025-02-10

published a model 4 days ago

moeru-ai/sherpa-onnx-streaming-zipformer-ar_en_id_ja_ru_th_vi_zh-2025-02-10

upvoted a paper 27 days ago

DFlash: Block Diffusion for Flash Speculative Decoding

View all activity

Organizations

upvoted 2 papers 27 days ago

DFlash: Block Diffusion for Flash Speculative Decoding

Paper • 2602.06036 • Published Feb 5 • 42

ProAct: Agentic Lookahead in Interactive Environments

Paper • 2602.05327 • Published Feb 5 • 25

upvoted a paper about 1 month ago

D2E: Scaling Vision-Action Pretraining on Desktop Data for Transfer to Embodied AI

Paper • 2510.05684 • Published Oct 7, 2025 • 143

upvoted 2 papers about 2 months ago

FrankenMotion: Part-level Human Motion Generation and Composition

Paper • 2601.10909 • Published Jan 15 • 18

RigMo: Unifying Rig and Motion Learning for Generative Animation

Paper • 2601.06378 • Published Jan 10 • 12

upvoted a paper 5 months ago

MCPMark: A Benchmark for Stress-Testing Realistic and Comprehensive MCP Use

Paper • 2509.24002 • Published Sep 28, 2025 • 176

upvoted 2 articles 7 months ago

Article

Blazingly fast whisper transcriptions with Inference Endpoints

May 13, 2025

•

Article

Vision Language Models (Better, faster, stronger)

May 12, 2025

•

600

upvoted a paper 10 months ago

TÜLU 3: Pushing Frontiers in Open Language Model Post-Training

Paper • 2411.15124 • Published Nov 22, 2024 • 67

upvoted a collection 10 months ago

Physical AI

Collection

Collection of open, commercial-grade datasets for physical AI developers • 27 items • Updated 3 days ago • 126

upvoted an article 11 months ago

Article

Cohere on Hugging Face Inference Providers 🔥

Apr 16, 2025

•

129

upvoted a collection 11 months ago

Spaces for Audio / Voices

Collection

540 items • Updated 7 days ago • 32

upvoted 3 articles 12 months ago

Article

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

Mar 12, 2025

•

489

Article

SmolVLM2: Bringing Video Understanding to Every Device

Feb 20, 2025

•

330

Article

A Deepdive into Aya Vision: Advancing the Frontier of Multilingual Multimodality

Mar 4, 2025

•

upvoted a collection 12 months ago

InternVL2.5

Collection

Better than InternVL 2.0 • 17 items • Updated 7 days ago • 93

upvoted an article about 1 year ago

Article

Making Browser-Based Inference Actually Usable

Mar 1, 2025

•

Neko Ayaka

AI & ML interests

Recent Activity

Organizations

nekomeowww's activity

Blazingly fast whisper transcriptions with Inference Endpoints

Vision Language Models (Better, faster, stronger)

Cohere on Hugging Face Inference Providers 🔥

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

SmolVLM2: Bringing Video Understanding to Every Device

A Deepdive into Aya Vision: Advancing the Frontier of Multilingual Multimodality

Making Browser-Based Inference Actually Usable