Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

ubergarm
/
GLM-4.7-GGUF

Text Generation
GGUF
English
Chinese
imatrix
conversational
ik_llama.cpp
glm4_moe
Model card Files Files and versions
xet
Community
12
New discussion
Resources
  • PR & discussions documentation
  • Code of Conduct
  • Hub documentation

can someone help me with how to ofload tensors

3
#12 opened 11 days ago by
theracn

IQ2_K_L Tuned - Mirostat Settings

πŸ‘ 2
#11 opened 26 days ago by
Hunterx

Is this a thinking model?

3
#10 opened about 1 month ago by
geveent

Why does this double my PP and improve TG?

4
#9 opened about 2 months ago by
gtkunit

anyone running via cpu+gpu+rpc gpu ?

3
#8 opened about 2 months ago by
gopi87

EPYC, RTX 5090 vs RTX 6000

πŸ”₯ 1
7
#7 opened about 2 months ago by
sousekd

Testing IQ5_K

πŸ‘ 1
1
#6 opened about 2 months ago by
shewin

Stable run on 2x RTX 5090 and 2 Xeon E5 2696 V4 and DDR4 with ik_llama.cpp - 6.1 t/s on IQ4_K and 5.1 t/s on IQ5_K, opencode works with this

πŸ‘ 1
17
#5 opened about 2 months ago by
martossien

IQ3_KS is awesome!

πŸ”₯ ❀️ 3
#4 opened about 2 months ago by
mtcl

9.31mb first part Q5?

πŸ‘ 1
2
#3 opened about 2 months ago by
inritwritten

Please make IQ2_KS version πŸ™

❀️ 3
2
#2 opened about 2 months ago by
Buridda

Can't wait for a q4 quant from you

πŸ€— 1
5
#1 opened about 2 months ago by
mtcl
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs