Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Andrey's picture
In a Training Loop πŸ”„
17 3

Andrey

Bochkov
stas-isaev's profile picture radiocpp's profile picture gitglubber's profile picture
Β·
  • E6E831728
  • AVBochkov
  • andreybochkov

AI & ML interests

None yet

Recent Activity

reacted to sergiopaniego's post with πŸ”₯ 16 days ago
New REPL environment in OpenEnv available! ✨ Used in the Recursive Language Models (RLM) paper by Alex Zhang. Ready for inference & post-training using trajectories. Handles long contexts: > Run Python code in a sandbox > Make recursive calls to LMs > Explore data programmatically > Return final result Docs: https://meta-pytorch.org/OpenEnv/environments/repl/ Inference script: https://github.com/meta-pytorch/OpenEnv/blob/main/examples/repl_oolong_simple.py
upvoted a paper 19 days ago
NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model
updated a model 20 days ago
Bochkov/growing-transformers-model-frozen-16-bit-baseline-monolyth-181m
View all activity

Organizations

None yet

authored 2 papers 7 months ago

Growing Transformers: Modular Composition and Layer-wise Expansion on a Frozen Substrate

Paper β€’ 2507.07129 β€’ Published Jul 8, 2025 β€’ 3

Emergent Semantics Beyond Token Embeddings: Transformer LMs with Frozen Visual Unicode Representations

Paper β€’ 2507.04886 β€’ Published Jul 7, 2025 β€’ 3
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs