modelstop.top
Back to models
qwenmodel

Qwen: Qwen3 VL 30B A3B Instruct

Qwen3-VL-30B-A3B-Instruct is a multimodal model that unifies strong text generation with visual understanding for images and videos. Its Instruct variant optimizes instruction-following for general multimodal tasks. It excels in perception...

Best for

Image UnderstandingVisual Q&AOCRMultimodal Tasks
Context Window
262K tokens ≈ 583 pages of text
Input Cost
$0.13/1M
Output Cost
$0.52/1M
Latency p50

Pricing Details

Standard Pricing
Input (per 1M tokens)
$0.13
Output (per 1M tokens)
$0.52

Run it locally

Local run command

# Refer to provider docs: 

Presumptive Specs:

Typical GPU

12–24 GB VRAM

System RAM

16+ GB

Disk

20+ GB free

Runtime

Docker / local runtime

Hallucination Score™ (est.)

Community reliability estimate · not official

Not yet rated

About this score: Community-estimated based on user reports and publicly available benchmark data (e.g. TruthfulQA). This is not an official score from the model provider. Scores may be inaccurate — always verify with the official leaderboard before making production decisions.

Price History

Not enough historical data yet. Check back after the next pricing sync.

Provider

qwen

Community Prompts

Proven prompts shared by the community for this model

Loading prompts…