fahrizalfarid's picture

In a Training Loop 🔄

fahrizalfarid

akahana

·

AI & ML interests

NLP

Recent Activity

updated a model 3 days ago

akahana/indo-psikologi-sft

reacted to sagar007's post with 🔥 3 days ago

🚀 I built a Multimodal Vision-Language Model from using Gemma-270M + CLIP! Just finished training my multimodal model on the full LLaVA-Instruct-150K dataset (157K samples) and wanted to share the results! 🔧 What I Built: A vision-language model that can understand images and answer questions about them, combining: - Google Gemma-3-270M (language) - OpenAI CLIP ViT-Large/14 (vision) - LoRA fine-tuning for efficiency 📊 Training Stats: - 157,712 training samples (full LLaVA dataset) - 3 epochs on A100 40GB - ~9 hours training time - Final loss: 1.333 training / 1.430 validation - Only 18.6M trainable params (3.4% of 539M total) 📈 https://huggingface.co/sagar007/multigemma Benchmark Results: - VQA Accuracy: 53.8% - Works great for: animal detection, room identification, scene understanding 🔗 **Try it yourself:** - 🤗 Model: https://huggingface.co/sagar007/multigemma - 🎮 Demo: https://huggingface.co/spaces/sagar007/Multimodal-Gemma - 💻 GitHub: https://github.com/sagar431/multimodal-gemma-270m Built with PyTorch Lightning + MLflow for experiment tracking. Full MLOps pipeline with CI/CD! Would love to hear your feedback! 🙏 #multimodal #gemma #clip #llava #vision-language #pytorch

published a model 6 days ago

akahana/indo-psikologi-sft

View all activity

Organizations

None yet

akahana 's datasets 55

akahana/LLaVA-Instruct-150K

Preview • Updated 10 days ago • 9

akahana/wikipedia-full

Viewer • Updated 30 days ago • 61.6M • 38

akahana/Medical-Reasoning-SFT-GPT-OSS-120B

Viewer • Updated 30 days ago • 200k • 5

akahana/alpaca-gpt4-indonesian

Viewer • Updated 30 days ago • 50k • 27 • 1

akahana/tesis

Preview • Updated Dec 19, 2025

akahana/doodle-blip-captions

Viewer • Updated Dec 18, 2025 • 1k • 5

akahana/pokemon-blip-captions

Viewer • Updated Dec 18, 2025 • 833 • 7

akahana/geo

Updated Dec 16, 2025 • 3

akahana/flickr30k

Updated Dec 16, 2025

akahana/english-indonesia-wikimatrix-token

Viewer • Updated Dec 11, 2025 • 1.02M • 5

akahana/english-indonesia-wikimatrix

Viewer • Updated Dec 9, 2025 • 1.02M • 2

akahana/english-indonesia

Viewer • Updated Dec 9, 2025 • 1M • 13

akahana/ubuntu

Updated Nov 27, 2025 • 2

akahana/anti-spoofing-nuaaaa

Viewer • Updated Jun 4, 2025 • 8.6k • 6

akahana/anti-spoofing-casiafasd

Viewer • Updated Jun 4, 2025 • 4.06k • 4

akahana/hifi-gan

Updated Jun 1, 2025 • 26

akahana/Driver-Drowsiness-Dataset

Viewer • Updated May 14, 2025 • 41.8k • 244 • 2

akahana/mpii-face-gaze

Updated May 12, 2025 • 6

akahana/common-voice-11-eng-sample

Updated May 9, 2025 • 5

akahana/children-codes-stories

Updated Mar 19, 2025 • 14

akahana/vlm

Updated Mar 18, 2025 • 2

akahana/medical

Updated Mar 15, 2025 • 9

akahana/llm-opus-ParaCrawl-english-id-v2

Updated Mar 13, 2025 • 1

akahana/llamacpp

Updated Mar 11, 2025 • 1

akahana/camel-ai-sains

Updated Mar 10, 2025

akahana/big-machine-translations

Updated Mar 9, 2025 • 48

akahana/rocov2-full

Updated Mar 8, 2025 • 1

akahana/dolphin-r1

Viewer • Updated Feb 3, 2025 • 814k

akahana/OpenThoughts-114k

Viewer • Updated Feb 3, 2025 • 114k • 11

akahana/OpenThoughts-114k-math

Viewer • Updated Feb 3, 2025 • 89.1k • 5