Update README.md
Browse files
README.md
CHANGED
|
@@ -7,7 +7,7 @@ This is simply a patched version of the original Qwen-QwQ-32B model.
|
|
| 7 |
Changed Functionality:
|
| 8 |
- This version of the model will "remember thinking" as the conversation progresses. That is to say, all text contained between the \<think\> tags will be preserved in the context window and be utilized during all future response generations.
|
| 9 |
|
| 10 |
-
Pros: It can remember its previous thinking processes,
|
| 11 |
|
| 12 |
Cons: The context window will fill much faster. And coherence will likely drop-off sooner.
|
| 13 |
|
|
|
|
| 7 |
Changed Functionality:
|
| 8 |
- This version of the model will "remember thinking" as the conversation progresses. That is to say, all text contained between the \<think\> tags will be preserved in the context window and be utilized during all future response generations.
|
| 9 |
|
| 10 |
+
Pros: It can remember its previous thinking processes, thus potentially being better at unpacking complex concepts over a series of back-and-forth prompts.
|
| 11 |
|
| 12 |
Cons: The context window will fill much faster. And coherence will likely drop-off sooner.
|
| 13 |
|