modelstop.top
Back to models
unummodel

uform-gen2-qwen-500m

UForm-Gen is a small generative vision-language model primarily designed for Image Captioning and Visual Question Answering. The model was pre-trained on the internal image captioning dataset and fine-tuned on public instructions datasets: SVIT, LVIS, VQAs datasets.

Best for

Image UnderstandingVisual Q&AOCRMultimodal Tasks
Context Window
Input Cost
Free
Output Cost
Latency p50

Pricing Details

Standard PricingFree tier
Input (per 1M tokens)
$0.00
Output (per 1M tokens)
$0.00

Hallucination Score™ (est.)

Community reliability estimate · not official

Not yet rated

About this score: Community-estimated based on user reports and publicly available benchmark data (e.g. TruthfulQA). This is not an official score from the model provider. Scores may be inaccurate — always verify with the official leaderboard before making production decisions.

Price History

Not enough historical data yet. Check back after the next pricing sync.

Provider

unum

Community Prompts

Proven prompts shared by the community for this model

Loading prompts…