ubergarm
/

GLM-4.7-GGUF

Text Generation

Model card Files Files and versions

Resources

View closed (0)

can someone help me with how to ofload tensors

#12 opened 11 days ago by

IQ2_K_L Tuned - Mirostat Settings

#11 opened 26 days ago by

Is this a thinking model?

#10 opened about 1 month ago by

Why does this double my PP and improve TG?

#9 opened about 2 months ago by

anyone running via cpu+gpu+rpc gpu ?

#8 opened about 2 months ago by

EPYC, RTX 5090 vs RTX 6000

#7 opened about 2 months ago by

Testing IQ5_K

#6 opened about 2 months ago by

Stable run on 2x RTX 5090 and 2 Xeon E5 2696 V4 and DDR4 with ik_llama.cpp - 6.1 t/s on IQ4_K and 5.1 t/s on IQ5_K, opencode works with this

#5 opened about 2 months ago by

IQ3_KS is awesome!

#4 opened about 2 months ago by

9.31mb first part Q5?

#3 opened about 2 months ago by

Please make IQ2_KS version 🙏

#2 opened about 2 months ago by

Can't wait for a q4 quant from you

#1 opened about 2 months ago by