theouterspaced
/

Qwen_QwQ-32B-Q8_0_remember-thinking.gguf

Model card Files Files and versions

theouterspaced commited on Jul 21

Commit

23647c1

·

verified ·

1 Parent(s): bb60543

Update README.md

Files changed (1) hide show

README.md +13 -1

README.md CHANGED Viewed

@@ -2,4 +2,16 @@
 base_model:
 - Qwen/QwQ-32B
 ---
-description

 base_model:
 - Qwen/QwQ-32B
 ---
+description
+This is simply a patched version of the original Qwen-QwQ-32B model.
+Changed Functionality:
+ - This version of the model will "remember thinking" as the conversation progresses. That is to say, all text contained between the <think> tags will be preserved in the context window to be utilized during all future response generations.
+Pros: It can remember its previous thinking process, thus potentially be better at unpacking complex concepts over a series of back-and-forth prompts.
+Cons: Context window will fill much faster. And coherence will likely drop-off faster.
+Download GGUF:
+- https://huggingface.co/theouterspaced/Qwen_QwQ-32B-Q8_0_remember-thinking.gguf/resolve/main/Qwen_QwQ-32B-Q8_0%2Bremember-thinking.gguf