Back to models
unummodel
uform-gen2-qwen-500m
UForm-Gen is a small generative vision-language model primarily designed for Image Captioning and Visual Question Answering. The model was pre-trained on the internal image captioning dataset and fine-tuned on public instructions datasets: SVIT, LVIS, VQAs datasets.
Best for
Image UnderstandingVisual Q&AOCRMultimodal Tasks
Context Window
—
Input Cost
Free
Output Cost
—
Latency p50
—
Pricing Details
Standard PricingFree tier
Input (per 1M tokens)
$0.00
Output (per 1M tokens)
$0.00
Hallucination Score™ (est.)
Community reliability estimate · not official
—
Not yet rated
About this score: Community-estimated based on user reports and publicly available benchmark data (e.g. TruthfulQA). This is not an official score from the model provider. Scores may be inaccurate — always verify with the official leaderboard before making production decisions.
Price History
Not enough historical data yet. Check back after the next pricing sync.
Provider
unum
Community Prompts
Proven prompts shared by the community for this model
Loading prompts…
