AdityaaXD/Multi-Agent_Reinforcement_Learning_Trading_System_Models Reinforcement Learning • Updated 21 days ago • 204 • 4
webxos/microclaw-for-openclaw-version-2026.2.17 Text Generation • Updated about 18 hours ago • 206 • 2
NousResearch/DeepHermes-Egregore-v1-RLAIF-8b-Atropos-GGUF Reinforcement Learning • 8B • Updated May 5, 2025 • 45 • 4
JonusNattapong/Reinforcement-Learning-for-Gold-Trading-Model Reinforcement Learning • Updated Dec 23, 2025 • 17 • 4
LightningRodLabs/future-as-label-paper-step160 Reinforcement Learning • 33B • Updated Jan 16 • 58 • 4
NurseCitizenDeveloper/NurseSim-Triage-Llama-3.2-3B Reinforcement Learning • 3B • Updated 11 days ago • 25 • 1