ubergarm commited on
Commit
62b4b9a
·
1 Parent(s): 7873ecb

update perplexity graph with full Q8_0 baseline

Browse files
Files changed (2) hide show
  1. README.md +3 -1
  2. images/perplexity.png +2 -2
README.md CHANGED
@@ -377,6 +377,7 @@ numactl -N ${SOCKET} -m ${SOCKET} \
377
  ## Quick Start
378
  You might need to override the template as needed. The original is here: https://huggingface.co/moonshotai/Kimi-K2-Thinking/blob/main/chat_template.jinja
379
  You can do stuff like `--jinja --chat-template-file ./my-custom-template.jinja`.
 
380
 
381
  ```bash
382
  # Example running Hybrid CPU+GPU(s) on ik_llama.cpp
@@ -396,7 +397,8 @@ You can do stuff like `--jinja --chat-template-file ./my-custom-template.jinja`.
396
  --host 127.0.0.1 \
397
  --port 8080 \
398
  --no-mmap \
399
- --jinja
 
400
 
401
  # Example running mainline llama.cpp
402
  # remove `-mla 3` from commands and you should be :gucci:
 
377
  ## Quick Start
378
  You might need to override the template as needed. The original is here: https://huggingface.co/moonshotai/Kimi-K2-Thinking/blob/main/chat_template.jinja
379
  You can do stuff like `--jinja --chat-template-file ./my-custom-template.jinja`.
380
+ You may also need to pass `--special` for it to output `<think>` and` </think>` tags correctly depending on endpoint and client used, thanks [u/Melodic-Network4374](https://www.reddit.com/r/LocalLLaMA/comments/1oqo57j/comment/nnpqxjx/)
381
 
382
  ```bash
383
  # Example running Hybrid CPU+GPU(s) on ik_llama.cpp
 
397
  --host 127.0.0.1 \
398
  --port 8080 \
399
  --no-mmap \
400
+ --jinja \
401
+ --special
402
 
403
  # Example running mainline llama.cpp
404
  # remove `-mla 3` from commands and you should be :gucci:
images/perplexity.png CHANGED

Git LFS Details

  • SHA256: 58266a6751ac90dff2b694b7a8cc807c942c8ad959d88e90a85bd8f71bee870c
  • Pointer size: 131 Bytes
  • Size of remote file: 140 kB

Git LFS Details

  • SHA256: 437a4ad2a70057e0abc5fac9c12b3c559b23c8fe5749a5ddb16271ee6119b6c8
  • Pointer size: 131 Bytes
  • Size of remote file: 147 kB