Shivansh Chaudhary

Shivansh000

AI & ML interests

None yet

Recent Activity

posted an update 3 days ago

Just did my research on the latest Kimi K2 thinking launch from Moonshot AI, and I strongly believe that this moment is a major inflection point in open agentic A ecosystem. The breakthrough lies in test-time scaling, moving us past constrained generation to long-horizon problem solving agents. They’ve shown capacity for 200-300 sequential tool calling, with preservation of context and reasoning finely, alongside self correction over extended computation. And, remember, this is the worst it’s ever going to be. We have demonstrably entered the phase of deep, structured cognition, and the ability to perform 23 interleaved reasoning steps to solve a phd-level math problem is a great demonstration of this cognitive depth. Unsurprisingly, the SOTA benchmarks reinforce this reality. More crucially for the industry, this is an open-weights release. Thanks to Moonshot team, for providing a new anchor point for the open-AI ecosystem. https://huggingface.co/moonshotai https://huggingface.co/moonshotai/Kimi-K2-Thinking

posted an update 10 days ago

I am dedicating this weekend to practicing/reading the latest b(ook)log from hugging face. It is meant to be a guide for anyone trying to go from “we have a great dataset and GPUs” to “we built a really strong model.” Will share thoughts upon completion. Thanks for the treat @eliebak @ThomasWolf and HF team! https://huggingface.co/spaces/HuggingFaceTB/smol-training-playbook

reacted to their post with 🤗 over 1 year ago

My 1st post on 🤗 I would love to discuss topics related to bias in LLMs: 1) Are researchers and enterprises concerned about detecting and addressing social bias in the Gen AI applications? If so, what are the existing approaches? 2) Are there trusted and labeled datasets to evaluate bias in LLM generations?

View all activity

Organizations

Posts 3

Post

233

Just did my research on the latest Kimi K2 thinking launch from Moonshot AI, and I strongly believe that this moment is a major inflection point in open agentic A ecosystem.

The breakthrough lies in test-time scaling, moving us past constrained generation to long-horizon problem solving agents. They’ve shown capacity for 200-300 sequential tool calling, with preservation of context and reasoning finely, alongside self correction over extended computation. And, remember, this is the worst it’s ever going to be.

We have demonstrably entered the phase of deep, structured cognition, and the ability to perform 23 interleaved reasoning steps to solve a phd-level math problem is a great demonstration of this cognitive depth.

Unsurprisingly, the SOTA benchmarks reinforce this reality. More crucially for the industry, this is an open-weights release.

Thanks to Moonshot team, for providing a new anchor point for the open-AI ecosystem.

moonshotai

moonshotai/Kimi-K2-Thinking

Post

1964

I am dedicating this weekend to practicing/reading the latest b(ook)log from hugging face. It is meant to be a guide for anyone trying to go from “we have a great dataset and GPUs” to “we built a really strong model.” Will share thoughts upon completion.

Thanks for the treat @eliebak @ThomasWolf and HF team!

HuggingFaceTB/smol-training-playbook

View all Posts