Multiple chat template fixes
pinnedπ
β€οΈ
7
14
#2 opened about 2 months ago
by
danielhanchen
Problem with llama.cpp input batch sequence
#11 opened 8 days ago
by
TheFastestPhoton
Perplexity?
#10 opened 8 days ago
by
dxkna
Llama.cpp reasoning_content
3
#8 opened about 2 months ago
by
brianw
Missing tensor 'blk.92.nextn.embed_tokens.weight error
π
3
5
#7 opened about 2 months ago
by
MadManDan
what's the best Q4 quant?
4
#4 opened about 2 months ago
by
SlavikF
Is it the same architecture than GLM 4.5 ?
π
β
2
5
#3 opened about 2 months ago
by
AliceThirty
Fingers crossed for the 4.6-air
β
β€οΈ
6
14
#1 opened about 2 months ago
by
aaron-newsome