Quantized-FT-Orca-Math Collection Models trained during quantization aware fine-tuning experiments using PyTorch's FSDP. • 8 items • Updated Aug 20, 2024 • 8
Teaching Large Language Models to Reason with Reinforcement Learning Paper • 2403.04642 • Published Mar 7, 2024 • 50