modelstop.top

Compare Models

Run side-by-side checks for pricing, context window, and latency.

Goliath 120B

alpindale

A large LLM created by combining two fine-tuned Llama 70B models into one 120B model. Combines Xwin and Euryale. Credits to - [@chargoddard](https://huggingface.co/chargoddard) for developing the framework used to merge...

Context window
6,144 tokens
Input cost
$3.75 / 1M
Output cost
$7.50 / 1M
Latency (p50)