-
MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases
Paper • 2402.14905 • Published • 134 -
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits
Paper • 2402.17764 • Published • 625 -
MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training
Paper • 2403.09611 • Published • 129 -
Jamba: A Hybrid Transformer-Mamba Language Model
Paper • 2403.19887 • Published • 111
박지연
ella0106
AI & ML interests
None yet
Recent Activity
updated
a model
about 1 month ago
ella0106/icp-video
published
a model
about 1 month ago
ella0106/icp-video
updated
a collection
over 1 year ago
interests
Organizations
None yet