AI & ML interests

None defined yet.

Shivansh000 
posted an update 9 days ago
view post
Post
276
Just did my research on the latest Kimi K2 thinking launch from Moonshot AI, and I strongly believe that this moment is a major inflection point in open agentic A ecosystem.

The breakthrough lies in test-time scaling, moving us past constrained generation to long-horizon problem solving agents. They’ve shown capacity for 200-300 sequential tool calling, with preservation of context and reasoning finely, alongside self correction over extended computation. And, remember, this is the worst it’s ever going to be.

We have demonstrably entered the phase of deep, structured cognition, and the ability to perform 23 interleaved reasoning steps to solve a phd-level math problem is a great demonstration of this cognitive depth.

Unsurprisingly, the SOTA benchmarks reinforce this reality. More crucially for the industry, this is an open-weights release.

Thanks to Moonshot team, for providing a new anchor point for the open-AI ecosystem.

moonshotai

moonshotai/Kimi-K2-Thinking

Shivansh000 
posted an update 17 days ago
view post
Post
1972
I am dedicating this weekend to practicing/reading the latest b(ook)log from hugging face. It is meant to be a guide for anyone trying to go from “we have a great dataset and GPUs” to “we built a really strong model.” Will share thoughts upon completion.

Thanks for the treat @eliebak @ThomasWolf and HF team!

HuggingFaceTB/smol-training-playbook
Shivansh000 
posted an update over 1 year ago
view post
Post
1594
My 1st post on 🤗 I would love to discuss topics related to bias in LLMs:

1) Are researchers and enterprises concerned about detecting and addressing social bias in the Gen AI applications? If so, what are the existing approaches?

2) Are there trusted and labeled datasets to evaluate bias in LLM generations?