Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
beyoru 's Collections
MinCoder
Evolution Model
Agent-rc
RP/Storytelling - Luna-I
Reasoning model (CoT)

RP/Storytelling - Luna-I

updated 24 days ago

RP model trained with GRPO

Upvote
2

  • beyoru/Luna

    Text Generation • 4B • Updated Sep 26 • 36 • 11

    Note Lora grpo


  • beyoru/Lunaa

    Text Generation • 4B • Updated Sep 27 • 36 • 6

    Note Lora grpo wirh reasoning


  • beyoru/Luna-Fusion-RP

    Text Generation • 4B • Updated Oct 26 • 32 • 4

    Note Model merging of all RP models and base model ft evolution merging


  • beyoru/Luna-7B-A4B

    Text Generation • 7B • Updated 25 days ago • 63 • 1

    Note MoE version, finetuned

Upvote
2
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs