--- base_model: - Qwen/QwQ-32B --- This is simply a patched version of the original Qwen-QwQ-32B model. (https://huggingface.co/Qwen/QwQ-32B) Changed Functionality: - This version of the model will "remember thinking" as the conversation progresses. That is to say, all text contained between the \ tags will be preserved in the context window and be utilized during all future response generations. Pros: It can remember its previous thinking processes, thus potentially being better at unpacking complex concepts over a series of back-and-forth prompts. Cons: The context window will fill much faster. And coherence will likely drop-off sooner. Download GGUF: - https://huggingface.co/theouterspaced/Qwen_QwQ-32B-Q8_0_remember-thinking.gguf/resolve/main/Qwen_QwQ-32B-Q8_0%2Bremember-thinking.gguf