CUDA error when prompt start processing
π
1
1
#7 opened about 2 months ago
by
illeniumx
Custom jinja template and draft model usage
π
1
9
#6 opened 3 months ago
by
ubergarm
KL Divergence as Performance Metric
1
#5 opened 3 months ago
by
joaquinrfs
IQ2_KL Testing - Runs Great Until The Model The Model The Model (lol)
π₯
1
8
#4 opened 3 months ago
by
phakio
Can you provide some low-precision quantization options?
β
π
3
11
#3 opened 4 months ago
by
lingyezhixing
Good job
56
#2 opened 4 months ago
by
huccjj
Works like a charm on ik_llama.cpp server with PR 668
π₯
3
11
#1 opened 4 months ago
by
Nexesenex