flaitenberger/llama3-8b-logic-cot-finetuned-adapter-grpo_rlvr_logic_cot Text Generation • Updated about 19 hours ago • 21
flaitenberger/llama3-8b-logic-cot-finetuned-posttrained-adapter Text Generation • Updated 2 days ago • 13
flaitenberger/llama3-8b-logic-cot-finetuned-adapter-grpo_validity_correctness_logic_cot Text Generation • Updated 4 days ago • 11