unummodel

uform-gen2-qwen-500m

UForm-Gen is a small generative vision-language model primarily designed for Image Captioning and Visual Question Answering. The model was pre-trained on the internal image captioning dataset and fine-tuned on public instructions datasets: SVIT, LVIS, VQAs datasets.

text vision image multimodal free

Best for

Image UnderstandingVisual Q&AOCRMultimodal Tasks

Try in Playground Compare →Alternatives →

Context Window

—

Input Cost

Free

Output Cost

—

Latency p50

—

Pricing Details

Standard PricingFree tier

Input (per 1M tokens)

$0.00

Output (per 1M tokens)

$0.00

Hallucination Score™ (est.)

Community reliability estimate · not official

—

Not yet rated

About this score: Community-estimated based on user reports and publicly available benchmark data (e.g. TruthfulQA). This is not an official score from the model provider. Scores may be inaccurate — always verify with the official leaderboard before making production decisions.

Price History

Not enough historical data yet. Check back after the next pricing sync.

Provider

unum

Usage & Examples

Learn how to use uform-gen2-qwen-500m

✓Complex reasoning and analysis

✓Customer support automation

✓Content generation and editing

✓Code review and debugging

✓Research summarization

💡 Tip: Start with the sample prompts above to see how uform-gen2-qwen-500m works best.

Community Prompts

Proven prompts shared by the community for this model

Loading prompts…