Qwen3-4B-Maid

中文

模型简介

Qwen3-4B-Maid 是基于 Qwen3-4B-Instruct-2507 进行 LoRA 微调的对话模型，使用自制的女仆风格对话数据集训练，使模型在保留原有语言能力的同时，具备温柔、礼貌、带有女仆语气的对话风格。

模型信息

项目	详情
基座模型	Qwen3-4B-Instruct-2507
微调方式	LoRA
微调框架	LLaMA-Factory
训练数据量	1,717 条对话
训练步数	100 steps
学习率	5e-5
最终 Loss	~1.3

训练数据

使用自制的中文女仆风格对话数据集，共 1,717 条，涵盖日常聊天、角色扮演等场景，旨在让模型学习温柔、礼貌、带有「ご主人様」语气的表达方式。

使用方式

推荐使用 transformers 加载（需配合对应的 LoRA adapter）：

from transformers import AutoModelForCausalLM, AutoTokenizer
from peft import PeftModel

base_model = AutoModelForCausalLM.from_pretrained("Qwen/Qwen3-4B-Instruct-2507")
tokenizer = AutoTokenizer.from_pretrained("Qwen/Qwen3-4B-Instruct-2507")
model = PeftModel.from_pretrained(base_model, "your-username/Qwen3-4B-Maid")

效果说明

✅ 模型已学会女仆语气与说话风格
⚠️ 回复有时偏冗长，建议在推理时适当限制 max_new_tokens
本模型为个人实验性项目，效果仅供参考

局限性与注意事项

训练步数较少（100 steps），模型对风格的学习尚不完全稳定
不建议用于需要精确事实输出的场景
本模型不具备安全过滤能力，请使用者自行负责使用场景

开源协议

本模型遵循基座模型 Qwen3 的 Apache 2.0 License。

English

Model Description

Qwen3-4B-Maid is a LoRA fine-tuned conversational model based on Qwen3-4B-Instruct-2507. It was trained on a custom maid-style dialogue dataset to give the model a gentle, polite, and maid-like conversational tone, while retaining the original model's language capabilities.

Model Details

Field	Details
Base Model	Qwen3-4B-Instruct-2507
Fine-tuning Method	LoRA
Fine-tuning Framework	LLaMA-Factory
Training Data Size	1,717 dialogues
Training Steps	100 steps
Learning Rate	5e-5
Final Loss	~1.3

Training Data

A custom Chinese maid-style dialogue dataset with 1,717 entries, covering daily conversation and roleplay scenarios. The goal is to teach the model a gentle, polite tone reminiscent of a maid character.

Usage

Recommended to load via transformers with the LoRA adapter:

from transformers import AutoModelForCausalLM, AutoTokenizer
from peft import PeftModel

base_model = AutoModelForCausalLM.from_pretrained("Qwen/Qwen3-4B-Instruct-2507")
tokenizer = AutoTokenizer.from_pretrained("Qwen/Qwen3-4B-Instruct-2507")
model = PeftModel.from_pretrained(base_model, "your-username/Qwen3-4B-Maid")

Performance Notes

✅ The model has successfully learned the maid speech style and tone
⚠️ Responses can be verbose; consider limiting max_new_tokens during inference
This is a personal experimental project; results are for reference only

Limitations

Limited training (100 steps); style consistency may vary
Not recommended for tasks requiring precise factual output
No safety filtering; users are responsible for appropriate use

License

This model follows the Apache 2.0 License of the base Qwen3 model.

Downloads last month: -

Safetensors

Model size

4B params

Tensor type

F16

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for Zyi4082/qwen3-4b-maid

Base model

Qwen/Qwen3-4B-Instruct-2507

Adapter

(3101)

this model

Adapters

2 models