modelstop.top
Back to models
zai-orgmodelOpen Source

GLM-5-FP8

Open-source GLM-5-FP8 model from zai-org — available for download and self-hosting on Hugging Face.

Best for

General ChatSummarisationContent Generation
ollama run zai:org:glm:5:fp8
Run locally
Context Window
Input Cost
Free
Output Cost
Latency p50

Pricing Details

Standard PricingFree tier
Input (per 1M tokens)
$0.00
Output (per 1M tokens)
$0.00

Run it locally

Local run command

pip install transformers accelerate && python -c "from transformers import AutoModelForCausalLM, AutoTokenizer; tok = AutoTokenizer.from_pretrained('zai-org/GLM-5-FP8'); model = AutoModelForCausalLM.from_pretrained('zai-org/GLM-5-FP8'); print('Loaded zai-org/GLM-5-FP8')"

Presumptive Specs:

GPU: 12+ GB VRAM; System RAM: 16+ GB; Disk: 20+ GB free

Hallucination Score™ (est.)

Community reliability estimate · not official

Not yet rated

About this score: Community-estimated based on user reports and publicly available benchmark data (e.g. TruthfulQA). This is not an official score from the model provider. Scores may be inaccurate — always verify with the official leaderboard before making production decisions.

Price History

Not enough historical data yet. Check back after the next pricing sync.

Provider

zai-org

Community Prompts

Proven prompts shared by the community for this model

Loading prompts…
GLM-5-FP8 — modelstop.top