Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

Aleksanian Ekaterina's picture

2

Aleksanian Ekaterina

estnafinema0

·

estnafinema0

AI & ML interests

None yet

Organizations

None yet

estnafinema0 's collections 3

PEFT variations

estnafinema0/llm-course-hw3-dora

Text Generation • 0.3B • Updated Apr 11, 2025 • 1
estnafinema0/llm-course-hw3-lora

Text Generation • 0.3B • Updated Apr 11, 2025 • 2
estnafinema0/llm-course-hw3-tinyllama-qlora

Updated Apr 11, 2025
estnafinema0/llm-course-hw3-tinyllamma-qlora

Updated Apr 11, 2025

SmolLM Variation: PPO & DPO Fine-Tuning for RLHF

This collection presents the fine-tuning of the SmolLM model using two (RLHF) approaches: DPO and PPO.

estnafinema0/trainer_output

Text Classification • 0.1B • Updated Mar 30, 2025 • 2
estnafinema0/smolLM-variation-dpo

Text Generation • 0.1B • Updated Mar 30, 2025 • 2
estnafinema0/smolLM-variation-ppo

Text Generation • 0.1B • Updated Mar 30, 2025 • 4

NER Extraction. Active Learning Approach.

estnafinema0/active-learning-nerc-models-kfold

Updated Apr 4, 2025
estnafinema0/nerc-extraction

0.1B • Updated Apr 4, 2025 • 3

PEFT variations

estnafinema0/llm-course-hw3-dora

Text Generation • 0.3B • Updated Apr 11, 2025 • 1
estnafinema0/llm-course-hw3-lora

Text Generation • 0.3B • Updated Apr 11, 2025 • 2
estnafinema0/llm-course-hw3-tinyllama-qlora

Updated Apr 11, 2025
estnafinema0/llm-course-hw3-tinyllamma-qlora

Updated Apr 11, 2025

NER Extraction. Active Learning Approach.

estnafinema0/active-learning-nerc-models-kfold

Updated Apr 4, 2025
estnafinema0/nerc-extraction

0.1B • Updated Apr 4, 2025 • 3

SmolLM Variation: PPO & DPO Fine-Tuning for RLHF

This collection presents the fine-tuning of the SmolLM model using two (RLHF) approaches: DPO and PPO.

estnafinema0/trainer_output

Text Classification • 0.1B • Updated Mar 30, 2025 • 2
estnafinema0/smolLM-variation-dpo

Text Generation • 0.1B • Updated Mar 30, 2025 • 2
estnafinema0/smolLM-variation-ppo

Text Generation • 0.1B • Updated Mar 30, 2025 • 4

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs