Jared Smith

ThePharmer

AI & ML interests

None yet

Recent Activity

upvoted a paper 4 days ago

DeepCode: Open Agentic Coding

upvoted an article 6 days ago

How We Use Claude Code Skills to Run 1,000+ ML Experiments a Day

liked a dataset 6 days ago

Anthropic/AnthropicInterviewer

View all activity

Organizations

None yet

upvoted a paper 4 days ago

DeepCode: Open Agentic Coding

Paper • 2512.07921 • Published 7 days ago • 21

upvoted an article 6 days ago

Article

How We Use Claude Code Skills to Run 1,000+ ML Experiments a Day

7 days ago

•

upvoted an article 11 days ago

Article

We Got Claude to Fine-Tune an Open Source LLM

12 days ago

•

499

upvoted 2 articles about 1 month ago

Article

Building for an Open Future - our new partnership with Google Cloud

Nov 13

•

Article

The 1 Billion Token Challenge: Finding the Perfect Pre-training Mix

Nov 3

•

upvoted 2 papers about 1 month ago

Scaling Agent Learning via Experience Synthesis

Paper • 2511.03773 • Published Nov 5 • 81

Diffusion Language Models are Super Data Learners

Paper • 2511.03276 • Published Nov 5 • 124

upvoted a paper about 2 months ago

AgentFold: Long-Horizon Web Agents with Proactive Context Management

Paper • 2510.24699 • Published Oct 28 • 68

upvoted 4 articles 2 months ago

Article

Back to The Future: Evaluating AI Agents on Predicting Future Events

Jul 17

•

Article

MCP for Research: How to Connect AI to Research Tools

Aug 18

•

Article

Jupyter Agents: training LLMs to reason with notebooks

Sep 10

•

Article

Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers

Sep 11

•

168

upvoted 2 papers 2 months ago

Cache-to-Cache: Direct Semantic Communication Between Large Language Models

Paper • 2510.03215 • Published Oct 3 • 97

LongCodeZip: Compress Long Context for Code Language Models

Paper • 2510.00446 • Published Oct 1 • 108

upvoted 2 articles 2 months ago

Article

Democratizing AI Safety with RiskRubric.ai

Sep 18

•

Article

Smol2Operator: Post-Training GUI Agents for Computer Use

Sep 23

•

133

upvoted 4 papers 2 months ago

TruthRL: Incentivizing Truthful LLMs via Reinforcement Learning

Paper • 2509.25760 • Published Sep 30 • 55

Why Language Models Hallucinate

Paper • 2509.04664 • Published Sep 4 • 193

Sharing is Caring: Efficient LM Post-Training with Collective RL Experience Sharing

Paper • 2509.08721 • Published Sep 10 • 661

Multi-Agent Tool-Integrated Policy Optimization

Paper • 2510.04678 • Published Oct 6 • 30

Jared Smith

AI & ML interests

Recent Activity

Organizations

ThePharmer's activity

How We Use Claude Code Skills to Run 1,000+ ML Experiments a Day

We Got Claude to Fine-Tune an Open Source LLM

Building for an Open Future - our new partnership with Google Cloud

The 1 Billion Token Challenge: Finding the Perfect Pre-training Mix

Back to The Future: Evaluating AI Agents on Predicting Future Events

MCP for Research: How to Connect AI to Research Tools

Jupyter Agents: training LLMs to reason with notebooks

Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers

Democratizing AI Safety with RiskRubric.ai

Smol2Operator: Post-Training GUI Agents for Computer Use