view article Article LeMaterial: an open source initiative to accelerate materials discovery and research Dec 10, 2024 โข 54
BIOMEDICA: An Open Biomedical Image-Caption Archive, Dataset, and Vision-Language Models Derived from Scientific Literature Paper โข 2501.07171 โข Published Jan 13 โข 55
ChatQA 2: Bridging the Gap to Proprietary LLMs in Long Context and RAG Capabilities Paper โข 2407.14482 โข Published Jul 19, 2024 โข 26
swap-uniba/LLaMAntino-3-ANITA-8B-Inst-DPO-ITA Text Generation โข 8B โข Updated Sep 1 โข 1.16k โข โข 29
Running on CPU Upgrade 80 80 Open Ita Llm Leaderboard ๐ Track, rank and evaluate open LLMs in the italian language!
view post Post 8517 Working on a concept GPT-2 (small) that uses KANs instead of MLPs.The ckpt and training code will be soon on the hub. 6 replies ยท ๐ 31 31 ๐ 14 14 ๐ฅ 11 11 ๐คฏ 4 4 โ 4 4 + Reply
Granite 2.0 Code Models Collection A series of code models trained by IBM licensed under Apache 2.0 license. We release both the base pretrained and instruct models. โข 23 items โข Updated 10 days ago โข 201
Rethinking Interpretability in the Era of Large Language Models Paper โข 2402.01761 โข Published Jan 30, 2024 โข 23
RAG vs Fine-tuning: Pipelines, Tradeoffs, and a Case Study on Agriculture Paper โข 2401.08406 โข Published Jan 16, 2024 โข 37