HriDal/agent-2048-game-qwen-7b-2k-ds Reinforcement Learning • 8B • Updated Apr 1, 2025 • 1 • 1