modelstop.top

Compare Models

Run side-by-side checks for pricing, context window, and latency.

bge-reranker-base

baai

Different from embedding model, reranker uses question and document as input and directly output similarity instead of embedding. You can get a relevance score by inputting query and passage to the reranker. And the score can be mapped to a float value in [0,1] by sigmoid function.

Context window
tokens
Input cost
$0.00 / 1M
Output cost
$0.00 / 1M
Latency (p50)