lunahr commited on
Commit
34371d8
·
verified ·
1 Parent(s): 7e30823

OpenLLM satisfaction information

Browse files
Files changed (1) hide show
  1. README.md +28 -18
README.md CHANGED
@@ -1,21 +1,21 @@
1
- ---
2
- language:
3
- - en
4
- license: llama3.2
5
- tags:
6
- - text-generation-inference
7
- - transformers
8
- - llama
9
- - trl
10
- - sft
11
- - reasoning
12
- - llama-3
13
- base_model: CreitinGameplays/Llama-3.2-3b-Instruct-uncensored-refinetune
14
- datasets:
15
- - KingNish/reasoning-base-20k
16
- pipeline_tag: text-generation
17
- library_name: transformers
18
- ---
19
 
20
  # Model Description
21
 
@@ -65,8 +65,18 @@ response_output = tokenizer.decode(response_ids[0, response_inputs.input_ids.sha
65
  print("ANSWER: " + response_output)
66
  ```
67
 
 
 
 
 
 
 
 
 
 
68
  - **Trained by:** [Piotr Zalewski](https://huggingface.co/lunahr)
69
  - **License:** llama3.2
 
70
  - **Finetuned from model:** [CreitinGameplays/Llama-3.2-3b-Instruct-uncensored-refinetune](https://huggingface.co/CreitinGameplays/Llama-3.2-3b-Instruct-uncensored-refinetune)
71
  - **Dataset used:** [KingNish/reasoning-base-20k](https://huggingface.co/datasets/KingNish/reasoning-base-20k)
72
 
 
1
+ ---
2
+ language:
3
+ - en
4
+ license: llama3.2
5
+ tags:
6
+ - text-generation-inference
7
+ - transformers
8
+ - llama
9
+ - trl
10
+ - sft
11
+ - reasoning
12
+ - llama-3
13
+ base_model: CreitinGameplays/Llama-3.2-3b-Instruct-uncensored-refinetune
14
+ datasets:
15
+ - KingNish/reasoning-base-20k
16
+ pipeline_tag: text-generation
17
+ library_name: transformers
18
+ ---
19
 
20
  # Model Description
21
 
 
65
  print("ANSWER: " + response_output)
66
  ```
67
 
68
+ # Intended Use
69
+ This model is intended as an OpenAI o1 replacement for weaker hardware, mimicking o1 in the response formatting.
70
+
71
+ # Limitations
72
+ - There may be a higher chance of getting hallucinations with this model due to its small size.
73
+ - Some questions may be answered incorrectly.
74
+ - This model is uncensored, exercise caution when generating sensitive content.
75
+
76
+
77
  - **Trained by:** [Piotr Zalewski](https://huggingface.co/lunahr)
78
  - **License:** llama3.2
79
+ - **Architecture:**: llama3.2
80
  - **Finetuned from model:** [CreitinGameplays/Llama-3.2-3b-Instruct-uncensored-refinetune](https://huggingface.co/CreitinGameplays/Llama-3.2-3b-Instruct-uncensored-refinetune)
81
  - **Dataset used:** [KingNish/reasoning-base-20k](https://huggingface.co/datasets/KingNish/reasoning-base-20k)
82