modelstop.top

Compare Models

Run side-by-side checks for pricing, context window, and latency.

granite-4.0-h-micro

ibm-granite

Granite 4.0 instruct models deliver strong performance across benchmarks, achieving industry-leading results in key agentic tasks like instruction following and function calling. These efficiencies make the models well-suited for a wide range of use cases like retrieval-augmented generation (RAG), multi-agent workflows, and edge deployments.

Context window
131,000 tokens
Input cost
$0.02 / 1M
Output cost
$0.11 / 1M
Latency (p50)