lm-course-hw3 A collection of models that were finetuned in hw3 of LLM course in HSE. xiryss/llm-course-hw3-lora Text Generation • 0.3B • Updated Mar 29, 2025 • 11 xiryss/llm-course-hw3-dora Text Generation • 0.3B • Updated Mar 29, 2025 • 2 xiryss/llm-course-hw3-tinyllama-qlora Updated Mar 29, 2025
lm-course-hw2 A collection of models that were trained in hw2 of LLM course in HSE. xiryss/llm-course-hw2-reward-model Text Classification • 0.1B • Updated Mar 9, 2025 • 2 xiryss/llm-course-hw2-dpo Text Generation • 0.1B • Updated Mar 9, 2025 • 1 xiryss/llm-course-hw2-ppo Text Generation • 0.1B • Updated Mar 9, 2025 • 1
lm-course-hw3 A collection of models that were finetuned in hw3 of LLM course in HSE. xiryss/llm-course-hw3-lora Text Generation • 0.3B • Updated Mar 29, 2025 • 11 xiryss/llm-course-hw3-dora Text Generation • 0.3B • Updated Mar 29, 2025 • 2 xiryss/llm-course-hw3-tinyllama-qlora Updated Mar 29, 2025
lm-course-hw2 A collection of models that were trained in hw2 of LLM course in HSE. xiryss/llm-course-hw2-reward-model Text Classification • 0.1B • Updated Mar 9, 2025 • 2 xiryss/llm-course-hw2-dpo Text Generation • 0.1B • Updated Mar 9, 2025 • 1 xiryss/llm-course-hw2-ppo Text Generation • 0.1B • Updated Mar 9, 2025 • 1