Models
Complete Zen4 model family - Consumer, Coder, and Ultra tiers
Zen4 Models
Consumer Line
Dense and MoE models for desktop and edge deployment. All built on abliterated open-source weights.
| Model | Parameters | Active | Context | License |
|---|---|---|---|---|
| Zen4 Mini | 4B | 4B | 32K | Apache 2.0 |
| Zen4 | 8B | 8B | 32K | Apache 2.0 |
| Zen4 Pro | 14B | 14B | 32K | Apache 2.0 |
| Zen4 Max | 30B MoE | 3B | 256K | Apache 2.0 |
| Zen4 Pro Max | 80B MoE | 3B | 256K | Apache 2.0 |
Zen4 Mini (4B)
Ultra-efficient for edge and mobile. Full quality at 4B parameters.
Zen4 (8B)
The standard model. Excellent balance of quality and efficiency.
Zen4 Pro (14B)
Professional-grade for demanding tasks. Strong reasoning and code generation.
Zen4 Max (30B MoE)
Flagship efficient model. 30B total with only 3B active via MoE.
Zen4 Pro Max (80B MoE) - Flagship
The ultimate consumer model. Hybrid Gated DeltaNet + Gated Attention + MoE architecture.
Coder Line
Specialized models for agentic programming and software engineering.
| Model | Parameters | Active | Context | License |
|---|---|---|---|---|
| Zen4 Coder Flash | 31B MoE | 3B | 131K | MIT |
| Zen4 Coder | 80B MoE | 3B | 256K | Apache 2.0 |
| Zen4 Coder Pro | 355B | 355B | 200K | MIT |
Zen4 Coder Flash (31B MoE)
Fast coding model for rapid iteration. MoE architecture for maximum efficiency.
Zen4 Coder (80B MoE) - Flagship Code
80B with 512-expert MoE for state-of-the-art code generation and agentic programming.
Zen4 Coder Pro (355B) - Cloud Only
Dense 355B coding powerhouse for maximum code intelligence.
Ultra Line
Trillion-parameter models for cloud deployment.
| Model | Parameters | Active | Context | Status |
|---|---|---|---|---|
| Zen4 Ultra | 1.04T MoE | 32B | 256K | Cloud Only |
| Zen4 Ultra Max | 1T+ MoE | TBD | 1M | Coming Soon |
Zen4 Ultra (1.04T MoE) - Cloud
Trillion-parameter frontier model with 384 experts and native vision capabilities.
Zen4 Ultra Max (1T+ MoE) - Coming Soon
Next-generation trillion-parameter model with 1M context window.
Zen5 — Next Generation
Zen5 Ultra — 2T+ parameter MoDE (Mixture of Distilled Experts) model. The largest open-weight model in history, trained on-chain via NVIDIA TEE confidential compute on hanzo.network.
| Model | Parameters | Active | Architecture | Context | Status |
|---|---|---|---|---|---|
| Zen5 Ultra | 2T+ MoDE | TBD | Mixture of Distilled Experts | 1M+ | Research Preview |
Key Features
- 2T+ parameters — the largest open-weight model ever released
- MoDE architecture — Mixture of Distilled Experts for efficient routing
- On-chain training — verifiable training via NVIDIA TEE on hanzo.network
- 1M+ context — full codebase and document understanding
- GT-QLoRA — Gate-Targeted fine-tuning for MoE behavioral modification (paper)
Request Research Access
Zen5 is in private research preview. Researchers and institutions can request early access to preprints, weights, and evaluation under a special research license.
- Request Research Access — for researchers and institutions
- Join Waitlist — for API access when available
- GT-QLoRA Paper — our MoE fine-tuning research
Formats
All locally-runnable models are available in multiple formats:
| Format | Use Case | Platform |
|---|---|---|
| SafeTensors | Full precision, transformers | All |
| GGUF | Quantized, llama.cpp/Ollama | All |
| MLX | Apple Silicon optimized | macOS |
Usage
from transformers import AutoModelForCausalLM, AutoTokenizer
# Load any Zen4 model
model = AutoModelForCausalLM.from_pretrained("zenlm/zen4-pro-max")
tokenizer = AutoTokenizer.from_pretrained("zenlm/zen4-pro-max")
messages = [{"role": "user", "content": "Hello!"}]
text = tokenizer.apply_chat_template(messages, tokenize=False, add_generation_prompt=True)
inputs = tokenizer(text, return_tensors="pt")
outputs = model.generate(**inputs, max_new_tokens=512)
print(tokenizer.decode(outputs[0], skip_special_tokens=True))Multimodal & Specialized
In addition to Zen4, the Zen LM family includes multimodal and specialized models:
| Model | Type | Description |
|---|---|---|
| zen-omni | Multimodal | Text + Vision + Audio |
| zen-vl | Vision-Language | Image understanding with function calling |
| zen-video | Video | Text-to-video and image-to-video generation |
| zen-3d | 3D Assets | 3D mesh generation from text/image |
All models available at huggingface.co/zenlm.