Aleksei Dorkin PRO
adorkin
AI & ML interests
Computational Linguistics
Recent Activity
liked
a model
about 20 hours ago
zai-org/GLM-OCR
updated
a collection
about 22 hours ago
Code SFT Datasets
updated
a collection
about 22 hours ago
Code SFT Datasets
Organizations
Multilingual Text Embedding Models
-
tencent/KaLM-Embedding-Gemma3-12B-2511
Sentence Similarity • 12B • Updated • 6.82k • 75 -
nvidia/llama-embed-nemotron-8b
Feature Extraction • 8B • Updated • 42.8k • 137 -
Qwen/Qwen3-Embedding-8B
Feature Extraction • 8B • Updated • 1.83M • • 574 -
Qwen/Qwen3-Embedding-4B
Feature Extraction • 4B • Updated • 525k • 221
Code SFT Datasets
Multilingual Text Embedding Models
-
tencent/KaLM-Embedding-Gemma3-12B-2511
Sentence Similarity • 12B • Updated • 6.82k • 75 -
nvidia/llama-embed-nemotron-8b
Feature Extraction • 8B • Updated • 42.8k • 137 -
Qwen/Qwen3-Embedding-8B
Feature Extraction • 8B • Updated • 1.83M • • 574 -
Qwen/Qwen3-Embedding-4B
Feature Extraction • 4B • Updated • 525k • 221
spaces
6
Running
1
NLI Zero Shot Classification
🔍
Zero-shot classification based on natural language inference
Sleeping
2
GliLem
🤓
Lemmatization disambiguation for Estonian with GliNER
Running
SigLIP2 + Clothes
🤔
Text-to-image clothing search using SigLIP2
Sleeping
1
M-CLIP + Clothes
🦀
Text-to-image clothing search using multilingual CLIP
Sleeping
1
Tweet Emoji Predictor
🧐
Predict an emoji for your tweet (...your X?)
Sleeping
Sõnajaht Demo
🐠
Keeltevaheline pöördsõnastik
datasets
15
adorkin/tulu-3-sft-mixture
Viewer
•
Updated
•
939k
•
7
adorkin/extended_tweet_emojis
Viewer
•
Updated
•
52.7k
•
88
•
3
adorkin/cosmopedia-v2-translate-append-instructions-et
Viewer
•
Updated
•
6.85k
•
8
adorkin/flan-v2-converted-en
Viewer
•
Updated
•
58.2k
•
8
adorkin/mala-bilingual-et-en-scores
Viewer
•
Updated
•
50.9M
•
31
adorkin/dclm-sample-13k-en-et-translation
Viewer
•
Updated
•
13.7k
•
7
adorkin/nllb-et-en-scores
Viewer
•
Updated
•
22M
•
24
adorkin/Magpie-Llama-3.1-Pro-300K-Filtered-18K-sample-et
Viewer
•
Updated
•
36.6k
•
3
•
1
adorkin/general-instruction-augmented-corpora
Viewer
•
Updated
•
20M
•
79
•
1
adorkin/dbpedia-entity-est
Viewer
•
Updated
•
4.69M
•
27