Collection of models from the third LLM course homework. It containes three LLMs fine-tuned using LoRA, QLoRA, and DoRA.
Sergey Pankevich
spankevich
AI & ML interests
None yet
Organizations
None yet
models
9
spankevich/output
1.26M
•
Updated
•
9
spankevich/llm-course-hw3-tinyllamma-qlora
Updated
spankevich/llm-course-hw3-dora
Text Generation
•
0.3B
•
Updated
•
16
spankevich/llm-course-hw3-lora
Text Generation
•
0.3B
•
Updated
•
16
spankevich/llm-hw-2-ppo
Text Generation
•
0.1B
•
Updated
•
11
spankevich/trainer_output
Text Classification
•
0.1B
•
Updated
•
17
spankevich/llm-hw-2-dpo
Text Generation
•
0.1B
•
Updated
•
7
spankevich/llm-hw-2
Updated
spankevich/llm-course-hw1
Text Generation
•
Updated
•
18
datasets
0
None public yet