Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
In a Training Loop š
38.7
TFLOPS
13
13
93
fahrizalfarid
akahana
Follow
sundarshanmu's profile picture
Gargaz's profile picture
evalstate's profile picture
11 followers
Ā·
49 following
fahrizalfarid
fahrizalfarid
AI & ML interests
NLP
Recent Activity
updated
a model
3 days ago
akahana/indo-psikologi-sft
reacted
to
sagar007
's
post
with š„
3 days ago
š I built a Multimodal Vision-Language Model from using Gemma-270M + CLIP! Just finished training my multimodal model on the full LLaVA-Instruct-150K dataset (157K samples) and wanted to share the results! š§ What I Built: A vision-language model that can understand images and answer questions about them, combining: - Google Gemma-3-270M (language) - OpenAI CLIP ViT-Large/14 (vision) - LoRA fine-tuning for efficiency š Training Stats: - 157,712 training samples (full LLaVA dataset) - 3 epochs on A100 40GB - ~9 hours training time - Final loss: 1.333 training / 1.430 validation - Only 18.6M trainable params (3.4% of 539M total) š https://huggingface.co/sagar007/multigemma Benchmark Results: - VQA Accuracy: 53.8% - Works great for: animal detection, room identification, scene understanding š **Try it yourself:** - š¤ Model: https://huggingface.co/sagar007/multigemma - š® Demo: https://huggingface.co/spaces/sagar007/Multimodal-Gemma - š» GitHub: https://github.com/sagar431/multimodal-gemma-270m Built with PyTorch Lightning + MLflow for experiment tracking. Full MLOps pipeline with CI/CD! Would love to hear your feedback! š #multimodal #gemma #clip #llava #vision-language #pytorch
published
a model
6 days ago
akahana/indo-psikologi-sft
View all activity
Organizations
None yet
akahana
's datasets
55
Sort:Ā Recently updated
akahana/LLaVA-Instruct-150K
Preview
ā¢
Updated
10 days ago
ā¢
9
akahana/wikipedia-full
Viewer
ā¢
Updated
30 days ago
ā¢
61.6M
ā¢
38
akahana/Medical-Reasoning-SFT-GPT-OSS-120B
Viewer
ā¢
Updated
30 days ago
ā¢
200k
ā¢
5
akahana/alpaca-gpt4-indonesian
Viewer
ā¢
Updated
30 days ago
ā¢
50k
ā¢
27
ā¢
1
akahana/tesis
Preview
ā¢
Updated
Dec 19, 2025
akahana/doodle-blip-captions
Viewer
ā¢
Updated
Dec 18, 2025
ā¢
1k
ā¢
5
akahana/pokemon-blip-captions
Viewer
ā¢
Updated
Dec 18, 2025
ā¢
833
ā¢
7
akahana/geo
Updated
Dec 16, 2025
ā¢
3
akahana/flickr30k
Updated
Dec 16, 2025
akahana/english-indonesia-wikimatrix-token
Viewer
ā¢
Updated
Dec 11, 2025
ā¢
1.02M
ā¢
5
akahana/english-indonesia-wikimatrix
Viewer
ā¢
Updated
Dec 9, 2025
ā¢
1.02M
ā¢
2
akahana/english-indonesia
Viewer
ā¢
Updated
Dec 9, 2025
ā¢
1M
ā¢
13
akahana/ubuntu
Updated
Nov 27, 2025
ā¢
2
akahana/anti-spoofing-nuaaaa
Viewer
ā¢
Updated
Jun 4, 2025
ā¢
8.6k
ā¢
6
akahana/anti-spoofing-casiafasd
Viewer
ā¢
Updated
Jun 4, 2025
ā¢
4.06k
ā¢
4
akahana/hifi-gan
Updated
Jun 1, 2025
ā¢
26
akahana/Driver-Drowsiness-Dataset
Viewer
ā¢
Updated
May 14, 2025
ā¢
41.8k
ā¢
244
ā¢
2
akahana/mpii-face-gaze
Updated
May 12, 2025
ā¢
6
akahana/common-voice-11-eng-sample
Updated
May 9, 2025
ā¢
5
akahana/children-codes-stories
Updated
Mar 19, 2025
ā¢
14
akahana/vlm
Updated
Mar 18, 2025
ā¢
2
akahana/medical
Updated
Mar 15, 2025
ā¢
9
akahana/llm-opus-ParaCrawl-english-id-v2
Updated
Mar 13, 2025
ā¢
1
akahana/llamacpp
Updated
Mar 11, 2025
ā¢
1
akahana/camel-ai-sains
Updated
Mar 10, 2025
akahana/big-machine-translations
Updated
Mar 9, 2025
ā¢
48
akahana/rocov2-full
Updated
Mar 8, 2025
ā¢
1
akahana/dolphin-r1
Viewer
ā¢
Updated
Feb 3, 2025
ā¢
814k
akahana/OpenThoughts-114k
Viewer
ā¢
Updated
Feb 3, 2025
ā¢
114k
ā¢
11
akahana/OpenThoughts-114k-math
Viewer
ā¢
Updated
Feb 3, 2025
ā¢
89.1k
ā¢
5
Previous
1
2
Next