---
tags:
- sentence-transformers
- sentence-similarity
- feature-extraction
- generated_from_trainer
- dataset_size:130899
- loss:MultipleNegativesRankingLoss
base_model: chandar-lab/NeoBERT
widget:
- source_sentence: Also, Lou Reed is tough.
sentences:
- Lou Reed is tough.
- The snow was so deep in the field that if you fell, you wouldn't feel it.
- Some organizations don't like change.
- source_sentence: Justice, said the scout.
sentences:
- 'At any moment, there are over 100 people guarding the president. '
- Our three kids did a lot of camping with us.
- The scout called for justice.
- source_sentence: More importantly, I looked accurate.
sentences:
- 'Kal was not the only one whose eyes went out of focus. '
- The Commission interpreted it.
- I looked right.
- source_sentence: no no they're not real hard
sentences:
- I waited eight seconds.
- G.M. has demonstrated it is capable of producing a first-class commercial with
it's Saturn line.
- Hard isn't a word I would use to describe them.
- source_sentence: but there the majority really haven't done anything with their
yards this neighborhood is is four years old
sentences:
- Perret inhabited Saint-Pierre during the 1930s
- Most of them haven't done anything, this neighborhood is four years old.
- Clinton stood to the side and was not in the middle of the attacks.
pipeline_tag: sentence-similarity
library_name: sentence-transformers
---
# SentenceTransformer based on chandar-lab/NeoBERT
This is a [sentence-transformers](https://www.SBERT.net) model finetuned from [chandar-lab/NeoBERT](https://huggingface.co/chandar-lab/NeoBERT). It maps sentences & paragraphs to a 1536-dimensional dense vector space and can be used for semantic textual similarity, semantic search, paraphrase mining, text classification, clustering, and more.
## Model Details
### Model Description
- **Model Type:** Sentence Transformer
- **Base model:** [chandar-lab/NeoBERT](https://huggingface.co/chandar-lab/NeoBERT)
- **Maximum Sequence Length:** None tokens
- **Output Dimensionality:** 1536 dimensions
- **Similarity Function:** Cosine Similarity
### Model Sources
- **Documentation:** [Sentence Transformers Documentation](https://sbert.net)
- **Repository:** [Sentence Transformers on GitHub](https://github.com/UKPLab/sentence-transformers)
- **Hugging Face:** [Sentence Transformers on Hugging Face](https://huggingface.co/models?library=sentence-transformers)
### Full Model Architecture
```
SentenceTransformer(
(0): Transformer({'max_seq_length': None, 'do_lower_case': False}) with Transformer model: NeoBERT
(1): Pooling({'word_embedding_dimension': 768, 'pooling_mode_cls_token': True, 'pooling_mode_mean_tokens': True, 'pooling_mode_max_tokens': False, 'pooling_mode_mean_sqrt_len_tokens': False, 'pooling_mode_weightedmean_tokens': False, 'pooling_mode_lasttoken': False, 'include_prompt': True})
)
```
## Usage
### Direct Usage (Sentence Transformers)
First install the Sentence Transformers library:
```bash
pip install -U sentence-transformers
```
Then you can load this model and run inference.
```python
from sentence_transformers import SentenceTransformer
# Download from the 🤗 Hub
model = SentenceTransformer("ashercn97/neobert-multi-nli")
# Run inference
sentences = [
"but there the majority really haven't done anything with their yards this neighborhood is is four years old",
"Most of them haven't done anything, this neighborhood is four years old.",
'Perret inhabited Saint-Pierre during the 1930s',
]
embeddings = model.encode(sentences)
print(embeddings.shape)
# [3, 1536]
# Get the similarity scores for the embeddings
similarities = model.similarity(embeddings, embeddings)
print(similarities.shape)
# [3, 3]
```
## Training Details
### Training Dataset
#### Unnamed Dataset
* Size: 130,899 training samples
* Columns: sentence_0 and sentence_1
* Approximate statistics based on the first 1000 samples:
| | sentence_0 | sentence_1 |
|:--------|:-----------------------------------------------------------------------------------|:----------------------------------------------------------------------------------|
| type | string | string |
| details |
I can tell you, Hastings, it's making life jolly difficult for us. | This is making life a lot harder for us. |
| The striking thing about workers' comments after the vote was how many of them mentioned the possibility of the company shutting down its operations. | The striking thing about workers' comments after the vote was how many of them mentioned the possibility of their company shutting down its operations. |
| Stephen Hargarten said that screening for alcohol applies not only to the potential for interventions, but also to the patient's overall quality of care, including safety from injury due to alcohol impairment or from alcohol withdrawal during the acute phase of treatment for medical or surgical conditions. | Stephen Hargarten said that screening for alcohol applies to patient's overall quality of care. |
* Loss: [MultipleNegativesRankingLoss](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#multiplenegativesrankingloss) with these parameters:
```json
{
"scale": 20.0,
"similarity_fct": "cos_sim"
}
```
### Training Hyperparameters
#### Non-Default Hyperparameters
- `per_device_train_batch_size`: 128
- `per_device_eval_batch_size`: 128
- `fp16`: True
- `multi_dataset_batch_sampler`: round_robin
#### All Hyperparameters