modelstop.top
Back to models
deepinframodel

Qwen/Qwen3-VL-30B-A3B-Instruct

Qwen3-VL-30B-A3B-Instruct is a multimodal model that unifies strong text generation with visual understanding for images and videos. Its Instruct variant optimizes instruction-following for general multimodal tasks. It excels in perception...

Best for

Image UnderstandingVisual Q&AOCRInstruction Following
Context Window
131K tokens ≈ 291 pages of text
Input Cost
Free
Output Cost
Latency p50

Pricing Details

No pricing data. Model may be free or requires direct access.

Hallucination Score™ (est.)

Community reliability estimate · not official

Not yet rated

About this score: Community-estimated based on user reports and publicly available benchmark data (e.g. TruthfulQA). This is not an official score from the model provider. Scores may be inaccurate — always verify with the official leaderboard before making production decisions.

Price History

Not enough historical data yet. Check back after the next pricing sync.

Provider

deepinfra

Community Prompts

Proven prompts shared by the community for this model

Loading prompts…
Qwen/Qwen3-VL-30B-A3B-Instruct — modelstop.top