Model Architecture Context

Model Description

This is Ex0bit/GLM-4.7-Flash-PRISM

PLEASE SUPPORT MY WORK!



โ‘กSupport Donation Option:

PRISM VIP Member Sign-Up All Models

One-Time Support This Model

โœ“ Priority Access

GLM-4.7-Flash-PRISM: Unrestricted (Zero Over-Refusals and Zero Propoganda) GLM-4.7-Flash Model Access

Access GLM-4.7-Flash-PRISM, an abliterated version of ZAI's efficient 30B-A3B MoE model with over-refusal mechanisms removed.

What You Get:

  • 30B-A3B MoE Architecture โ€” Lightweight yet powerful Mixture-of-Experts model with 30 billion total parameters and ~3 billion active per token for fast, efficient inference
  • PRISM (Projected Refusal Isolation via Subspace Modification) โ€” State-of-the-art abliteration technique that removes over-refusal behaviors while preserving capabilities
  • 128K Context Window โ€” Extended context for complex tasks and large codebases
  • Interleaved & Preserved Thinking โ€” Multi-turn reasoning that persists across conversations with per-turn thinking control
  • Strong In-Class Benchmarks โ€” 91.6% AIME 2025, 79.5% ฯ„ยฒ-Bench, 59.2% SWE-bench Verified, 75.2% GPQA
Downloads last month
134
GGUF
Model size
30B params
Architecture
deepseek2
Hardware compatibility
Log In to view the estimation

3-bit

4-bit

8-bit

16-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support