Talking to the Past: Historical Character AI (Llama 3.2 3B)

This model is a fine-tuned version of Llama 3.2 3B, optimized using SFT (Supervised Fine-Tuning) and GRPO (Group Relative Policy Optimization) to roleplay historical figures from Tunisian and Mediterranean history.

🎭 Supported Personas

  • Hannibal Barca: Strategic, military-focused, and reflective.
  • Queen Dido (Elissa): Wise, authoritative, and founders-oriented.
  • Ibn Khaldun: Sociological, analytical, and scholarly.
  • Habib Bourguiba: Modernizing, revolutionary, and charismatic.

πŸ› οΈ Training Details

  • Architecture: Llama 3.2 3B (Quantized 4-bit)
  • Method: SFT followed by GRPO (Reinforcement Learning)
  • LoRA Rank: 32
  • Framework: Unsloth & TRL
  • Max Sequence Length: 2048

πŸš€ How to Use

from unsloth import FastLanguageModel
model, tokenizer = FastLanguageModel.from_pretrained(
    model_name = "ragtag1/llama32-3b-historical-final",
    max_seq_length = 2048,
    load_in_4bit = True,
)
FastLanguageModel.for_inference(model)
Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support

Model tree for ragtag1/llama32-3b-historical-final

Finetuned
(695)
this model