18 20 52

Alexander Kovrigin

waleko

https://alexkovrigin.me

waleko

AI & ML interests

AI for Code

Recent Activity

liked a dataset 6 days ago

jxie/stl10

upvoted a paper 21 days ago

The Best of N Worlds: Aligning Reinforcement Learning with Best-of-N Sampling via max@k Optimisation

upvoted a paper about 1 month ago

The Complexity Trap: Simple Observation Masking Is as Efficient as LLM Summarization for Agent Context Management

View all activity

Organizations

liked a dataset 6 days ago

jxie/stl10

Viewer • Updated Aug 10, 2023 • 123k • 488 • 2

upvoted a paper 21 days ago

The Best of N Worlds: Aligning Reinforcement Learning with Best-of-N Sampling via max@k Optimisation

Paper • 2510.23393 • Published 22 days ago • 20

upvoted a paper about 1 month ago

The Complexity Trap: Simple Observation Masking Is as Efficient as LLM Summarization for Agent Context Management

Paper • 2508.21433 • Published Aug 29 • 7

upvoted a collection about 1 month ago

🦫 PIPer

Collection

All the resources for our paper "PIPer: On-Device Environment Setup via Online Reinforcement Learning"! • 9 items • Updated Oct 1 • 2

liked a model about 1 month ago

agentica-org/DeepSWE-Preview

Text Generation • 33B • Updated Jul 3 • 1.08k • • 186

upvoted a paper about 2 months ago

GEM: A Gym for Agentic LLMs

Paper • 2510.01051 • Published Oct 1 • 88

commented a paper about 2 months ago

PIPer: On-Device Environment Setup via Online Reinforcement Learning

Paper • 2509.25455 • Published Sep 29 • 36 •

updated 2 datasets about 2 months ago

JetBrains-Research/PIPer-SFT-2500-sharegpt

Viewer • Updated Oct 2 • 2.5k • 45 • 1

JetBrains-Research/PIPer-envbench-zeroshot-rl

Viewer • Updated Oct 2 • 742 • 86 • 1

upvoted a paper about 2 months ago

PIPer: On-Device Environment Setup via Online Reinforcement Learning

Paper • 2509.25455 • Published Sep 29 • 36

authored a paper about 2 months ago

PIPer: On-Device Environment Setup via Online Reinforcement Learning

Paper • 2509.25455 • Published Sep 29 • 36

liked a model about 2 months ago

JetBrains-Research/PIPer-8B

Text Generation • 8B • Updated Oct 1 • 16 • 2

updated a dataset about 2 months ago

JetBrains-Research/PIPer-eval

Preview • Updated Sep 30 • 44

updated 4 models about 2 months ago

published a model about 2 months ago

waleko/latent-diffusion-autoencoder-128

Updated Sep 27

Alexander Kovrigin

AI & ML interests

Recent Activity

Organizations

waleko's activity