granite-4.0-h-micro
ibm-granite
Granite 4.0 instruct models deliver strong performance across benchmarks, achieving industry-leading results in key agentic tasks like instruction following and function calling. These efficiencies make the models well-suited for a wide range of use cases like retrieval-augmented generation (RAG), multi-agent workflows, and edge deployments.
- Context window
- 131,000 tokens
- Input cost
- $0.02 / 1M
- Output cost
- $0.11 / 1M
- Latency (p50)
- —
