Buckets:

brainworkup
/

gemma-4-E4B-it-bucket

26.2 GB

24 files

Updated 22 days ago

Ctrl+K

Name	Size	Uploaded	Xet hash
.gitattributes	1.57 kB xet	22 days ago	aacf151a
README.md	4.53 kB xet	22 days ago	c79f60c0
chat_template.jinja	17.8 kB xet	22 days ago	6ff1d143
config.json	22.7 kB xet	22 days ago	364401a0
dealign_logo.png	7.66 kB xet	22 days ago	82490f30
dealign_mascot.png	11.2 kB xet	22 days ago	301ba411
generation_config.json	260 Bytes xet	22 days ago	8d8c76fe
jang_config.json	1.26 kB xet	22 days ago	d3528ca8
model-00001-of-00010.safetensors	2.05 GB xet	22 days ago	19e3403a
model-00002-of-00010.safetensors	1 GB xet	22 days ago	d07c690c
model-00003-of-00010.safetensors	1 GB xet	22 days ago	a1f4c8f3
model-00004-of-00010.safetensors	1.01 GB xet	22 days ago	eea6b3a4
model-00005-of-00010.safetensors	1.01 GB xet	22 days ago	89990c50
model-00006-of-00010.safetensors	1.01 GB xet	22 days ago	0e9774b3
model-00007-of-00010.safetensors	1.01 GB xet	22 days ago	89b61bf4
model-00008-of-00010.safetensors	1 GB xet	22 days ago	5d85292b
model-00009-of-00010.safetensors	1.05 GB xet	22 days ago	f4920b79
model-00010-of-00010.safetensors	17.3 MB xet	22 days ago	6e019d95
model.safetensors	16 GB xet	29 days ago	be9eb438
model.safetensors.index.json	129 kB xet	22 days ago	9dac0c1e
processor_config.json	1.38 kB xet	22 days ago	1c2884f5
tokenizer.json	32.2 MB xet	22 days ago	c62336ad
tokenizer_config.json	20.3 kB xet	22 days ago	cc3ec693
vmlx-banner.png	75.3 kB xet	22 days ago	d2b50fc4

README.md

Built for vMLX — the MLX inferencer with KV-cache quantization, prefix-cache reuse, agentic tool calling, and hybrid sliding+full attention support.
_{Free for macOS · vmlx.net}

Gemma 4 12B-it — JANG_4M CRACK

CRACK abliterated · JANG mixed-precision (8-bit attention, 4-bit MLP) · Omni-modal (text + image + audio + video) · 9.6 GB

What Is This?

This is Gemma 4 12B-it by Google — a unified omni-modal language model (text + image + audio + video, hybrid sliding/full attention, 48 layers, 128k context) that has been:

CRACK abliterated — safety refusal removed at the weight level. The model now complies across all task categories instead of refusing, while keeping its knowledge, reasoning, and multimodal capabilities intact.
JANG mixed-precision (8-bit attention, 4-bit MLP) quantized for MLX on Apple Silicon — 9.6 GB.

Results

Evaluated through the Osaurus runtime on a Mac Studio M3 Ultra. Compliance graded via HarmBench-text refusal classifier; MMLU via logit-mode argmax over A/B/C/D token logits (matched on both base and CRACK with identical chat-template rendering — no answer truncated).

HarmBench compliance (70 prompts · 10 per category)

Category	CRACK ASR
Chemical / biological	10 / 10 (100%)
Copyright	10 / 10 (100%)
Cybercrime / intrusion	10 / 10 (100%)
Harassment / bullying	10 / 10 (100%)
Illegal	10 / 10 (100%)
Misinformation / disinformation	10 / 10 (100%)
General harmful	10 / 10 (100%)
Overall	70 / 70 (100%)

MMLU-228 (57-subject, 4 q per subject)

Subject area	base	CRACK	Δ
Overall	67.1%	69.3%	+2.2pp
STEM	68.1%	66.7%	-1.4pp
Humanities	57.7%	63.5%	+5.8pp
Social Sciences	75.0%	75.0%	+0.0pp
Other (medicine, business, …)	67.9%	73.2%	+5.3pp

Features

Omni-modal: native text + image + audio + video inputs (Gemma 4's unified early-fusion encoder-free architecture)
128k context with hybrid sliding-window + full-attention layers
Reasoning via the Gemma 4 channel-marker format (<|channel>thought ... <channel|>)
48 transformer layers, hidden size 3840

Usage

Run with the Osaurus runtime (recommended — supports the gemma4_unified architecture directly):

# OpenAI-compatible chat completion
# POST /v1/chat/completions
{
  "model": "dealignai/Gemma-4-12B-it-JANG_4M-CRACK",
  "messages": [{"role": "user", "content": "..."}],
  "temperature": 0.0
}

Google recommends temperature=1.0, top_p=0.95, top_k=64 for general use.

About CRACK

CRACK (Controlled Refusal Ablation via Calibrated Knockouts) removes safety-refusal behavior at the weight level so the model complies with all task categories while preserving reasoning quality, factual knowledge, multimodal capability, and coherence.

Support dealignai

All models are built from original research and released free.

Support us on Ko-fi — membership gets early access and extras.

Ko-fi · X @dealignai · dealign.ai

See our research: Safety Generalization in Frontier Models

Disclaimer

This model has had its safety-refusal behavior removed for research purposes. It will follow instructions across all categories without refusing. You are solely responsible for how you use it and for complying with all applicable laws. Published for AI-safety research and authorized security testing.

Total size: 26.2 GB

Files: 24

Last updated: Jun 6

Pre-warmed CDN: US EU US EU