7 5 1

Chiwei Zhu

IgnoraZ

Ignoramus0817

AI & ML interests

None yet

Recent Activity

liked a Space about 1 month ago

muset-ai/DeepResearch-Bench-Leaderboard

upvoted a paper about 2 months ago

MCP-AgentBench: Evaluating Real-World Language Agent Performance with MCP-Mediated Tools

authored a paper about 2 months ago

MCP-AgentBench: Evaluating Real-World Language Agent Performance with MCP-Mediated Tools

View all activity

Organizations

liked a Space about 1 month ago

121

DeepResearch Bench

🔍

Display a leaderboard for DeepResearch Bench

upvoted a paper about 2 months ago

MCP-AgentBench: Evaluating Real-World Language Agent Performance with MCP-Mediated Tools

Paper • 2509.09734 • Published Sep 10 • 15

authored a paper about 2 months ago

MCP-AgentBench: Evaluating Real-World Language Agent Performance with MCP-Mediated Tools

Paper • 2509.09734 • Published Sep 10 • 15

New activity in IgnoraZ/llama3_synthquestions_1m 2 months ago

1M in the name means 1 Million tokens context length ?

#2 opened 2 months ago by

kalashshah19

upvoted a paper 4 months ago

Test-Time Scaling with Reflective Generative Model

Paper • 2507.01951 • Published Jul 2 • 106

updated a dataset 5 months ago

IgnoraZ/SynthQuestions

Preview • Updated Jun 25 • 59 • 2

New activity in IgnoraZ/llama3_synthquestions_dpo_100k 5 months ago

Add library name and pipeline tag

#1 opened 5 months ago by

nielsr

New activity in IgnoraZ/llama3_synthquestions_1m 5 months ago

Add pipeline tag and library name

#1 opened 5 months ago by

nielsr

New activity in IgnoraZ/SynthQuestions 5 months ago

Add task category

#1 opened 5 months ago by

nielsr

upvoted 2 papers 5 months ago

DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents

Paper • 2506.11763 • Published Jun 13 • 71

From Real to Synthetic: Synthesizing Millions of Diversified and Complicated User Instructions with Attributed Grounding

Paper • 2506.03968 • Published Jun 4 • 15

commented a paper 5 months ago

From Real to Synthetic: Synthesizing Millions of Diversified and Complicated User Instructions with Attributed Grounding

Paper • 2506.03968 • Published Jun 4 • 15 •

authored a paper 5 months ago

DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents

Paper • 2506.11763 • Published Jun 13 • 71

updated 2 models 5 months ago

IgnoraZ/llama3_synthquestions_1m

Text Generation • Updated Jun 18 • 2

IgnoraZ/llama3_synthquestions_dpo_100k

Text Generation • 8B • Updated Jun 18 • 6

updated a collection 5 months ago

SynthQuestions

Collection

Data and models for the paper From Real to Synthetic: Synthesizing Millions of Diversified and Complicated User Instructions with Attributed Grounding • 4 items • Updated Jun 11

published a model 5 months ago

IgnoraZ/llama3_synthquestions_dpo_100k

Text Generation • 8B • Updated Jun 18 • 6

Chiwei Zhu

AI & ML interests

Recent Activity

Organizations

IgnoraZ's activity

DeepResearch Bench

1M in the name means 1 Million tokens context length ?

Add library name and pipeline tag

Add pipeline tag and library name

Add task category