agentlans's picture
Update README.md
6b8b38b verified
---
license: apache-2.0
language:
- multilingual
datasets:
- agentlans/chat-annotated
base_model:
- ibm-granite/granite-embedding-107m-multilingual
pipeline_tag: text-classification
---
# granite-embedding-107m-multilingual-chat-difficulty
A fine-tuned model that estimates the difficulty of multilingual, multi-turn human–AI conversations based on reasoning complexity.
- **Input:** A condensed conversation in the format `<|user|>prompt<|assistant|>reply...`
- **Output:** A normalized difficulty score (lower scores indicate easier conversations)
Based on [ibm-granite/granite-embedding-107m-multilingual](https://huggingface.co/ibm-granite/granite-embedding-107m-multilingual).
**Evaluation results:**
- **Loss:** 0.5663
- **MSE:** 0.5663
- **Tokens processed:** 51,173,120
## Model description
This model maps multi-turn chat logs to a continuous difficulty representation, enabling comparison across languages and reasoning styles.
Use cases include:
- Categorizing multilingual chat transcripts by reasoning depth.
- Supporting dataset curation or curriculum design.
- Serving as a difficulty scoring component in evaluation pipelines.
## Intended uses and limitations
**Use cases**
- Estimating reasoning difficulty in multilingual conversations.
- Comparing dialogue complexity across datasets.
- Benchmarking conversational reasoning.
**Limitations**
- Not suitable for assessing factual accuracy, coherence, or sentiment.
- May not generalize well to highly domain-specific data.
- Produces relative difficulty scores, not absolute intelligence measures.
## Training procedure
### Hyperparameters
| Parameter | Value |
|------------|-------|
| learning_rate | 5e-5 |
| train_batch_size | 8 |
| eval_batch_size | 8 |
| seed | 42 |
| optimizer | AdamW (fused), betas=(0.9, 0.999), epsilon=1e-8 |
| lr_scheduler_type | linear |
| num_epochs | 5.0 |
### Results
| Metric | Value |
|---------|-------|
| Training loss | 0.5663 |
| MSE | 0.5663 |
| Tokens processed | 51,173,120 |
## Framework versions
- **Transformers:** 5.0.0.dev0
- **PyTorch:** 2.9.1+cu128
- **Datasets:** 4.4.1
- **Tokenizers:** 0.22.1
## See also
[agentlans/bge-small-en-v1.5-prompt-difficulty](https://huggingface.co/agentlans/bge-small-en-v1.5-prompt-difficulty) for single-turn English conversations and prompts