AI & ML interests

Waifu and Husbando Research

Recent Activity

lunarfluΒ 
posted an update 10 days ago
lunarfluΒ 
posted an update 10 days ago
view post
Post
353
The new King πŸ‘‘has arrived!

Moonshot AI now the top model on Hugging Face πŸ”₯
moonshotai/Kimi-K2-Thinking
lunarfluΒ 
posted an update 10 days ago
view post
Post
2566
πŸ’ΈπŸ€‘You don’t need 100 GPUs to train something amazing!

Our Smol Training Playbook teaches you a better path to world-class LLMs, for free!

Check out the #1 trending space on πŸ€— :
HuggingFaceTB/smol-training-playbook
narugo1992Β 
posted an update 13 days ago
view post
Post
1032
Org Rate Limits = Free DDoS Invitation? 🀑
One serious question: Is there any way to actually ban clowns abusing this system?
Right now all it takes is one bored script kiddie with a grudge (or too much caffeine) to lawnmower an entire org's API endpoints into the stone age. They get to bathe in 429s while we're sitting here like 🀑 "Gee I wonder whose IP is carpet-bombing us today!"
The kicker? Zero accountability. Zero fingerprints. Just vibesβ„’ and chaos. It’s basically a public invitation to hold entire communities hostage while wearing pajamas.
"Come for the open-source collaboration, stay for the unhinged DDoS piΓ±ata party!" πŸŽ‰
Fix when?
  • 2 replies
Β·
s3nhΒ 
posted an update about 1 month ago
view post
Post
501
Eduhelp with more empathy, based on model finetuned on
psychotheraputic preferences just landed on


Beck-8B as a base model, 13000 steps on educational dataset.
Time to go further and build more πŸ₯°
s3nh/EduHelp_Beck_8B
Thanks to @basilic_ai for computations <3
s3nhΒ 
posted an update about 1 month ago
view post
Post
4072
Just tried to create an educational assistant for younger people who can struggle with visualsation of 'what is this sorcery all about'.
Its first step of my spare time projects, sft on Qwen3-8B,

EduHelper is a child-friendly tutoring assistant fine-tuned from the Qwen3-8B base model using parameter-efficient fine-tuning (PEFT) with LoRA on the ajibawa-2023/Education-Young-Children dataset.

s3nh/EduHelp-8B

Glad to share my work, have a wonderful day!
  • 2 replies
Β·
lunarfluΒ 
posted an update about 1 month ago
view post
Post
2249
Cool stuff these past weeks on huggingface! πŸ€— πŸš€ !
β€’ πŸ“ˆTrackio, local-first W&B alternative
https://github.com/gradio-app/trackio/issues
β€’ 🌍EmbeddingGemma, 300M-param, multilingual embeddings, on-device
https://huggingface.co/blog/embeddinggemma
β€’ πŸ’»Open LLMs in VS Code (Inference Providers)
https://x.com/reach_vb/status/1966185427582497171
β€’ πŸ€–Smol2Operator GUI agents
https://huggingface.co/blog/smol2operator
β€’ πŸ–ΌοΈGradio visible watermarking
https://huggingface.co/blog/watermarking-with-gradio
ajibawa-2023Β 
posted an update 7 months ago
view post
Post
4514
Hi All, I recently released two Audio datasets which are generated using my earlier released dataset: ajibawa-2023/Children-Stories-Collection

First Audio Dataset:https://huggingface.co/datasets/ajibawa-2023/Audio-Children-Stories-Collection-Large has 5600++ stories in .mp3 format.

Second Audio Dataset:https://huggingface.co/datasets/ajibawa-2023/Audio-Children-Stories-Collection has 600 stories in .mp3 format.
Β·
s3nhΒ 
posted an update 11 months ago
view post
Post
2539
Welcome back,

Small Language Models Enthusiasts and GPU Poor oss enjoyers lets connect.
Just created an organization which main target is to have fun with smaller models tuneable on consumer range GPUs, feel free to join and lets have some fun, much love ;3

SmolTuners
Β·
lunarfluΒ 
posted an update 12 months ago
ajibawa-2023Β 
posted an update about 1 year ago
view post
Post
3846
New Dataset: Software-Architecture
Link: ajibawa-2023/Software-Architecture

I am releasing a Large Dataset covering topics related to Software-Architecture. This dataset consists of around 450,000 lines of data in jsonl.

I have included following topics:

Architectural Frameworks

Architectural Patterns for Reliability

Architectural Patterns for Scalability

Architectural Patterns

Architectural Quality Attributes

Architectural Testing

Architectural Views

Architectural Decision-Making

Advanced Research

Cloud-Based Architectures

Component-Based Architecture

Data Architecture

Emerging Trends

Event-Driven Architecture

Evolvability and Maintainability

Microservices and Monolithic

Microservices Architecture

Security Architecture

Service-Oriented Architecture

Software Design Principles

and Many More!

This dataset is useful in LLM development. Also those who are working on developing Software development related LLMs then this dataset can be useful.

This dataset is very useful to Researchers as well.
Β·
lunarfluΒ 
posted an update about 1 year ago
lunarfluΒ 
posted an update over 1 year ago
view post
Post
2009
Cool things this week from @huggingface !

🌎AI math olympiad winner NuminaMath is here!
πŸ€—Announcing New Hugging Face and Keras NLP integration
✨UI overhaul to HF tokens!
🧊 Embed our dataset viewer on any webpage!

https://huggingface.co/blog/winning-aimo-progress-prize
https://huggingface.co/blog/keras-nlp-integration
https://huggingface.co/settings/tokens
https://x.com/julien_c/status/1812099420726456457

Check out the full list on our discord! πŸ‘‡
https://discord.com/invite/JfAtkvEtRb
lunarfluΒ 
posted an update over 1 year ago
view post
Post
2409
By popular demand, HF activity tracker v1.0 is here! πŸ“Š let's build it together!πŸ€—

Lots of things to improve, feel free to open PRs in the community tab!

good PR ideas:
- track more types of actions that include date+time
- bigger plot
- track discord activity too 🀯
- link github? ⚑

https://huggingface.co/spaces/huggingface-projects/LevelBot
  • 2 replies
Β·
lunarfluΒ 
posted an update over 1 year ago
view post
Post
2028
Weekly highlights for the HF ecosystem!

πŸš€ Phi 3
πŸ¦… Falcon VLM
πŸ€— sentence-transformers v3.0 is here! Train and finetune embedding models with multi-GPU training, bf16 support, loss logging, callbacks and more!
πŸ₯³ Gradio launch event 6/6! We're launching 1.0 versions of two new libraries, Python + JS client libraries to programmatically query Gradio apps, and several new features making it easier to use Gradio apps in production!
✨ Tools now available in HuggingChat! Use any AI apps built by the community! πŸ”₯
🧊 ML for 3D Course Unit 3 is here! Covering Gaussian splatting, how it fits in the generative 3D pipeline, and hands-on code to build your own demo!

See the full list here!
https://discord.com/channels/879548962464493619/897387888663232554/1245036889539612764 !
  • 2 replies
Β·
lunarfluΒ 
posted an update over 1 year ago
view post
Post
2116
cooking up something....anyone interested in a daily activity tracker for HF?
Β·
lunarfluΒ 
posted an update over 1 year ago
ajibawa-2023Β 
posted an update over 1 year ago
view post
Post
2208
Thank you very much hf team for accepting me! I was waiting for very long time. Thank you