Char Level Models My Character level models I trained. Corianas/Microllama_Char_88k_step Text Generation β’ 85.2M β’ Updated Feb 3, 2025 β’ 5 Corianas/Corianas-micro-reactor Text Generation β’ 85.2M β’ Updated Feb 17, 2025 β’ 3 Corianas/Microllama_Char_100k_step Text Generation β’ 85.2M β’ Updated Feb 3, 2025 Corianas/Microllama_Char_300k_step Text Generation β’ 85.2M β’ Updated Feb 3, 2025
Foundational_data TinyGSM: achieving >80% on GSM8k with small language models Paper β’ 2312.09241 β’ Published Dec 14, 2023 β’ 39 TinyStories: How Small Can Language Models Be and Still Speak Coherent English? Paper β’ 2305.07759 β’ Published May 12, 2023 β’ 38
TinyGSM: achieving >80% on GSM8k with small language models Paper β’ 2312.09241 β’ Published Dec 14, 2023 β’ 39
TinyStories: How Small Can Language Models Be and Still Speak Coherent English? Paper β’ 2305.07759 β’ Published May 12, 2023 β’ 38
Char Level Models My Character level models I trained. Corianas/Microllama_Char_88k_step Text Generation β’ 85.2M β’ Updated Feb 3, 2025 β’ 5 Corianas/Corianas-micro-reactor Text Generation β’ 85.2M β’ Updated Feb 17, 2025 β’ 3 Corianas/Microllama_Char_100k_step Text Generation β’ 85.2M β’ Updated Feb 3, 2025 Corianas/Microllama_Char_300k_step Text Generation β’ 85.2M β’ Updated Feb 3, 2025
Foundational_data TinyGSM: achieving >80% on GSM8k with small language models Paper β’ 2312.09241 β’ Published Dec 14, 2023 β’ 39 TinyStories: How Small Can Language Models Be and Still Speak Coherent English? Paper β’ 2305.07759 β’ Published May 12, 2023 β’ 38
TinyGSM: achieving >80% on GSM8k with small language models Paper β’ 2312.09241 β’ Published Dec 14, 2023 β’ 39
TinyStories: How Small Can Language Models Be and Still Speak Coherent English? Paper β’ 2305.07759 β’ Published May 12, 2023 β’ 38