advexon
/

multilingual-sentiment-classifier

@@ -6,6 +6,7 @@ tags:
 - transformers
 - pytorch
 - multilingual
 license: mit
 ---
@@ -15,14 +16,18 @@ Multilingual text classification model trained on XLM-RoBERTa base for sentiment
 ## Model Description
-This is a multilingual text classification model based on XLM-RoBERTa. It has been trained for sentiment analysis across multiple languages and can classify text into positive, negative, and neutral categories.
 ## Model Details
 - **Base Model**: XLM-RoBERTa Base
-- **Number of Labels**: 3 (Positive, Negative, Neutral)
 - **Languages**: Multilingual (English, Russian, Tajik, and others)
 - **Max Sequence Length**: 512 tokens
 ## Performance
@@ -75,6 +80,14 @@ This model was trained using:
 - **Training Epochs**: 2
 - **Languages**: English, Russian, Tajik
 ## Limitations
 - The model's performance may vary across different languages
@@ -89,7 +102,7 @@ If you use this model in your research, please cite:
 ```bibtex
 @misc{multilingual-text-classifier,
   title={Multilingual Text Classification Model},
-  author={Your Name},
   year={2024},
   publisher={Hugging Face},
   journal={Hugging Face Hub},

 - transformers
 - pytorch
 - multilingual
+- xlm-roberta
 license: mit
 ---
 ## Model Description
+This is a multilingual text classification model based on XLM-RoBERTa. It has been fine-tuned for sentiment analysis across multiple languages and can classify text into positive, negative, and neutral categories.
 ## Model Details
 - **Base Model**: XLM-RoBERTa Base
+- **Model Type**: XLMRobertaForSequenceClassification
+- **Number of Labels**: 3 (Negative, Neutral, Positive)
 - **Languages**: Multilingual (English, Russian, Tajik, and others)
 - **Max Sequence Length**: 512 tokens
+- **Hidden Size**: 768
+- **Attention Heads**: 12
+- **Layers**: 12
 ## Performance
 - **Training Epochs**: 2
 - **Languages**: English, Russian, Tajik
+## Model Architecture
+The model uses the XLM-RoBERTa architecture with:
+- 12 transformer layers
+- 768 hidden dimensions
+- 12 attention heads
+- 3 classification heads for sentiment analysis
 ## Limitations
 - The model's performance may vary across different languages
 ```bibtex
 @misc{multilingual-text-classifier,
   title={Multilingual Text Classification Model},
+  author={Advexon},
   year={2024},
   publisher={Hugging Face},
   journal={Hugging Face Hub},

config.json ADDED Viewed

+{
+  "model_type": "xlm-roberta",
+  "architectures": [
+    "XLMRobertaForSequenceClassification"
+  ],
+  "attention_probs_dropout_prob": 0.1,
+  "bos_token_id": 0,
+  "eos_token_id": 2,
+  "hidden_act": "gelu",
+  "hidden_dropout_prob": 0.1,
+  "hidden_size": 768,
+  "initializer_range": 0.02,
+  "intermediate_size": 3072,
+  "layer_norm_eps": 1e-12,
+  "max_position_embeddings": 514,
+  "num_attention_heads": 12,
+  "num_hidden_layers": 12,
+  "pad_token_id": 1,
+  "type_vocab_size": 1,
+  "vocab_size": 250002,
+  "num_labels": 3,
+  "id2label": {
+    "0": "negative",
+    "1": "neutral",
+    "2": "positive"
+  },
+  "label2id": {
+    "negative": 0,
+    "neutral": 1,
+    "positive": 2
+  }
+}