Back to modelsRun locally
zai-orgmodelOpen Source
GLM-5-FP8
Open-source GLM-5-FP8 model from zai-org — available for download and self-hosting on Hugging Face.
Best for
General ChatSummarisationContent Generation
ollama run zai:org:glm:5:fp8Context Window
—
Input Cost
Free
Output Cost
—
Latency p50
—
Pricing Details
Standard PricingFree tier
Input (per 1M tokens)
$0.00
Output (per 1M tokens)
$0.00
Run it locally
Local run command
pip install transformers accelerate && python -c "from transformers import AutoModelForCausalLM, AutoTokenizer; tok = AutoTokenizer.from_pretrained('zai-org/GLM-5-FP8'); model = AutoModelForCausalLM.from_pretrained('zai-org/GLM-5-FP8'); print('Loaded zai-org/GLM-5-FP8')"Presumptive Specs:
GPU: 12+ GB VRAM; System RAM: 16+ GB; Disk: 20+ GB free
Hallucination Score™ (est.)
Community reliability estimate · not official
—
Not yet rated
About this score: Community-estimated based on user reports and publicly available benchmark data (e.g. TruthfulQA). This is not an official score from the model provider. Scores may be inaccurate — always verify with the official leaderboard before making production decisions.
Price History
Not enough historical data yet. Check back after the next pricing sync.
Provider
zai-org
Community Prompts
Proven prompts shared by the community for this model
Loading prompts…
