TreezzZ/Ferret_ParallelSearch_Qwen3-30b-a3b-instruct_ppo
31B
•
Updated
•
3
A framework for training LLM agents via RL with advanced search capability: https://github.com/Tree-Shu-Zhao/ferret