GLM-4.5-Air-CPT-LoRA-v2-HyperSwitch

A LoRA fine-tuned model based on zai-org/GLM-4.5-Air specialized for the Hyperswitch Rust codebase. This model excels at understanding payment processing patterns, Hyperswitch architecture, and Rust development practices.

🎯 Model Description

This LoRA adapter was trained on 16,731 samples extracted from the Hyperswitch codebase to enhance code understanding, explanation, and generation within the payment processing domain.

  • Base Model: zai-org/GLM-4.5-Air
  • Training Type: Causal Language Modeling (CLM) with LoRA
  • Domain: Payment Processing, Rust Development
  • Specialization: Hyperswitch codebase patterns and architecture

πŸ“Š Training Details

Dataset Composition

  • Total Samples: 16,731
    • File-level samples: 2,120 complete files
    • Granular samples: 14,611 extracted components
      • Functions: 4,121
      • Structs: 5,710
      • Traits: 223
      • Implementations: 4,296
      • Modules: 261

LoRA Configuration

r: 64                   # LoRA rank
alpha: 128              # LoRA alpha (2*r)
dropout: 0.05           # LoRA dropout
  target_modules:
    - "q_proj"   
    - "k_proj"    
    - "v_proj"   
    - "o_proj"    
  
  exclude_modules:
    - "block_sparse_moe"  
    - "w1"                
    - "w2"               
    - "w3"               
    - "gate"             

Training Hyperparameters

  • Epochs: 3
  • Learning Rate: 2e-4 (cosine schedule)
  • Hardware: 8 x NVIDIA H200

πŸ› οΈ Technical Specifications

  • Precision: bfloat16
  • Inference Speed: Optimized with Flash Attention 2

πŸ™ Acknowledgments

  • Zai Team for the excellent GLM 4.5 Air base model
  • Hyperswitch Team for the open-source payment processing platform
  • Hugging Face for the transformers and PEFT libraries

πŸ“ž Citation

@misc{hyperswitch-glm-air-lora-v2-2024,
  title={AdityaNarayan/GLM-4.5-Air-CPT-LoRA-v2-HyperSwitch},
  author={Aditya Narayan},
  year={2024},
  publisher={Hugging Face},
  url={https://huggingface.co/AdityaNarayan/GLM-4.5-Air-CPT-LoRA-v2-HyperSwitch}
}
Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support

Model tree for AdityaNarayan/GLM-4.5-Air-CPT-LoRA-v2-HyperSwitch

Finetuned
(30)
this model

Dataset used to train AdityaNarayan/GLM-4.5-Air-CPT-LoRA-v2-HyperSwitch