-
-
-
-
-
-
Inference Providers
Active filters:
量化修复
JunHowie/Qwen3-4B-Instruct-2507-GPTQ-Int4
Text Generation
•
4B
•
Updated
•
1.85k
•
1
model-scope/glm-4-9b-chat-GPTQ-Int4
Text Generation
•
9B
•
Updated
•
63
•
6
model-scope/glm-4-9b-chat-GPTQ-Int8
Text Generation
•
9B
•
Updated
•
16
•
2
JunHowie/Qwen3-0.6B-GPTQ-Int4
Text Generation
•
0.6B
•
Updated
•
419
•
1
JunHowie/Qwen3-0.6B-GPTQ-Int8
Text Generation
•
0.6B
•
Updated
•
20
JunHowie/Qwen3-1.7B-GPTQ-Int4
Text Generation
•
2B
•
Updated
•
451
•
1
JunHowie/Qwen3-1.7B-GPTQ-Int8
Text Generation
•
2B
•
Updated
•
17
JunHowie/Qwen3-32B-GPTQ-Int4
Text Generation
•
33B
•
Updated
•
846
•
3
JunHowie/Qwen3-32B-GPTQ-Int8
Text Generation
•
33B
•
Updated
•
250
•
3
JunHowie/Qwen3-30B-A3B-GPTQ-Int4
Text Generation
•
5B
•
Updated
•
132
•
1
JunHowie/Qwen3-14B-GPTQ-Int8
Text Generation
•
15B
•
Updated
•
83
•
1
JunHowie/Qwen3-14B-GPTQ-Int4
Text Generation
•
15B
•
Updated
•
751
•
4
JunHowie/Qwen3-8B-GPTQ-Int8
Text Generation
•
8B
•
Updated
•
112
JunHowie/Qwen3-8B-GPTQ-Int4
Text Generation
•
8B
•
Updated
•
1.37k
•
4
JunHowie/Qwen3-4B-GPTQ-Int4
Text Generation
•
4B
•
Updated
•
208
•
1
JunHowie/Qwen3-4B-GPTQ-Int8
Text Generation
•
4B
•
Updated
•
9
JunHowie/Qwen3-30B-A3B-GPTQ-Int8
Text Generation
•
8B
•
Updated
•
7.26k
QuantTrio/Qwen3-235B-A22B-GPTQ-Int8
Text Generation
•
235B
•
Updated
•
53
QuantTrio/DeepSeek-R1-0528-Qwen3-8B-GPTQ-Int4-Int8Mix
Text Generation
•
11B
•
Updated
•
87
•
3
QuantTrio/DeepSeek-R1-0528-GPTQ-Int4-Int8Mix-Lite
Text Generation
•
721B
•
Updated
•
11
•
1
QuantTrio/DeepSeek-R1-0528-GPTQ-Int4-Int8Mix-Compact
Text Generation
•
847B
•
Updated
•
8
•
5
QuantTrio/DeepSeek-R1-0528-GPTQ-Int4-Int8Mix-Medium
Text Generation
•
912B
•
Updated
•
43
•
1
koushd/Qwen3-235B-A22B-Instruct-2507-AWQ
Text Generation
•
235B
•
Updated
•
65
•
4
QuantTrio/Qwen3-235B-A22B-Instruct-2507-GPTQ-Int4-Int8Mix
Text Generation
•
248B
•
Updated
•
317
•
2
QuantTrio/Qwen3-235B-A22B-Instruct-2507-AWQ
Text Generation
•
235B
•
Updated
•
2.76k
•
10
QuantTrio/Qwen3-Coder-480B-A35B-Instruct-AWQ
Text Generation
•
480B
•
Updated
•
515
•
8
QuantTrio/Qwen3-Coder-480B-A35B-Instruct-GPTQ-Int4-Int8Mix
Text Generation
•
534B
•
Updated
•
184
•
6
QuantTrio/Qwen3-235B-A22B-Thinking-2507-AWQ
Text Generation
•
235B
•
Updated
•
1.71k
•
5
QuantTrio/Qwen3-235B-A22B-Thinking-2507-GPTQ-Int4-Int8Mix
Text Generation
•
253B
•
Updated
•
128
•
2
QuantTrio/Qwen3-30B-A3B-Instruct-2507-GPTQ-Int8
Text Generation
•
31B
•
Updated
•
899
•
9