modelstop.top — Every AI Model, One Place

whisper-tiny-en

openai

Whisper is a pre-trained model for automatic speech recognition (ASR) and speech translation. Trained on 680k hours of labelled data, Whisper models demonstrate a strong ability to generalize to many datasets and domains without the need for fine-tuning. This is the English-only version of the Whisper Tiny model which was trained on the task of speech recognition.

audiofree

InputFree

Output$0.0000/1M

⚡81msp50

Explore specs and pricingView details →

whisper-large-v3-turbo

openai

Whisper is a pre-trained model for automatic speech recognition (ASR) and speech translation.

audiofree

InputFree

Output$0.0000/1M

⚡70msp50

Explore specs and pricingView details →

whisper

openai

Whisper is a general-purpose speech recognition model. It is trained on a large dataset of diverse audio and is also a multitasking model that can perform multilingual speech recognition, speech translation, and language identification.

audiomultilingualfree

InputFree

Output$0.0000/1M

⚡50msp50

Explore specs and pricingView details →

whisper-medium

openai

Open-source whisper-medium model from openai — available for download and self-hosting on Hugging Face.

audiofree

Run locally

InputFree

Explore specs and pricingView details →

whisper-tiny

openai

Open-source whisper-tiny model from openai — available for download and self-hosting on Hugging Face.

audiofree

InputFree

Explore specs and pricingView details →

whisper-large-v3

openai

Open-source whisper-large-v3 model from openai — available for download and self-hosting on Hugging Face.

audiofree

Run locally

InputFree

⚡172msp50

Explore specs and pricingView details →

whisper-large-v3-turbo

openai

Open-source whisper-large-v3-turbo model from openai — available for download and self-hosting on Hugging Face.

audiofree

Run locally

InputFree

Explore specs and pricingView details →

whisper-base

openai

Open-source whisper-base model from openai — available for download and self-hosting on Hugging Face.

audiofree

Run locally

InputFree

Explore specs and pricingView details →

whisper-small

openai

Open-source whisper-small model from openai — available for download and self-hosting on Hugging Face.

audiofree

Run locally

InputFree

Explore specs and pricingView details →

gpt-audio-mini-2025-12-15

openai

textaudiofree

InputFree

⚡35msp50

Explore specs and pricingView details →

gpt-audio-1.5

openai

textaudiofree

InputFree

⚡36msp50

Explore specs and pricingView details →

⭐Top Rated

gpt-4o-mini-audio-preview

openai

textaudiofree

InputFree

⭐1270.0%score

Explore specs and pricingView details →

gpt-4o-mini-audio-preview-2024-12-17

openai

textaudiofree

InputFree

Explore specs and pricingView details →

⭐Top Rated

gpt-4o-audio-preview-2024-12-17

openai

textaudiofree

InputFree

⭐1270.0%score

Explore specs and pricingView details →

gpt-audio-2025-08-28

openai

textaudiofree

InputFree

⚡50msp50

Explore specs and pricingView details →

gpt-audio-mini-2025-10-06

openai

textaudiofree

InputFree

⚡38msp50

Explore specs and pricingView details →

gpt-4o-audio-preview-2025-06-03

openai

textaudiofree

InputFree

Explore specs and pricingView details →

OpenAI: GPT-4o Audio

openai

The gpt-4o-audio-preview model adds support for audio inputs as prompts. This enhancement allows the model to detect nuances within audio recordings and add depth to generated user experiences. Audio outputs...

textaudiolong-context

Input$2.5000/1M

Output$10.0000/1M

📏128kcontext

Explore specs and pricingView details →

OpenAI: GPT Audio Mini

openai

A cost-efficient version of GPT Audio. The new snapshot features an upgraded decoder for more natural sounding voices and maintains better voice consistency. Input is priced at $0.60 per million...

textaudiocheap

Run locally

Input$0.6000/1M

Output$2.4000/1M

📏128kcontext

⚡39msp50

Explore specs and pricingView details →

OpenAI: GPT Audio

openai

The gpt-audio model is OpenAI's first generally available audio model. The new snapshot features an upgraded decoder for more natural sounding voices and maintains better voice consistency. Audio is priced...

textaudiolong-context

Run locally

Input$2.5000/1M

Output$10.0000/1M

📏128kcontext

⚡36msp50

Explore specs and pricingView details →

AI Model Catalogue

whisper-tiny-en

whisper-large-v3-turbo

whisper

whisper-medium

whisper-tiny

whisper-large-v3

whisper-large-v3-turbo

whisper-base

whisper-small

gpt-audio-mini-2025-12-15

gpt-audio-1.5

gpt-4o-mini-audio-preview

gpt-4o-mini-audio-preview-2024-12-17

gpt-4o-audio-preview-2024-12-17

gpt-audio-2025-08-28

gpt-audio-mini-2025-10-06

gpt-4o-audio-preview-2025-06-03

OpenAI: GPT-4o Audio

OpenAI: GPT Audio Mini

OpenAI: GPT Audio