A framework for training LLM agents via RL with advanced search capability: https://github.com/Tree-Shu-Zhao/ferret
Shu Zhao
TreezzZ
AI & ML interests
None yet
Recent Activity
updated
a collection
about 1 month ago
Ferret
updated
a model
about 1 month ago
TreezzZ/Ferret_Search-R1_Qwen2.5-14b-instruct_ppo
published
a model
about 1 month ago
TreezzZ/Ferret_Search-R1_Qwen2.5-14b-instruct_ppo