Update README.md
Browse files
README.md
CHANGED
|
@@ -49,6 +49,8 @@ MEL is trained on a **curated corpus** of **5.52 million legal texts (~92.7GB)**
|
|
| 49 |
|
| 50 |
To ensure high-quality text processing, documents were preprocessed by **removing unwanted characters, normalizing spacing, chunking texts, and filtering non-Spanish content**.
|
| 51 |
|
|
|
|
|
|
|
| 52 |
### Training Configuration
|
| 53 |
- **GPU:** NVIDIA A100 80GB PCIe
|
| 54 |
- **Training Time:** 13.9 days (~7 days per epoch, 2 epochs total)
|
|
|
|
| 49 |
|
| 50 |
To ensure high-quality text processing, documents were preprocessed by **removing unwanted characters, normalizing spacing, chunking texts, and filtering non-Spanish content**.
|
| 51 |
|
| 52 |
+
**Cutoff date:** February 2024
|
| 53 |
+
|
| 54 |
### Training Configuration
|
| 55 |
- **GPU:** NVIDIA A100 80GB PCIe
|
| 56 |
- **Training Time:** 13.9 days (~7 days per epoch, 2 epochs total)
|