Back to modelsRun locally
huggingfacetbmodelOpen Source
SmolLM2-135M
Open-source SmolLM2-135M model from huggingfacetb — available for download and self-hosting on Hugging Face.
Best for
General ChatSummarisationContent Generation
ollama run huggingfacetb:smollm2:135mContext Window
—
Input Cost
Free
Output Cost
—
Latency p50
—
Pricing Details
Standard PricingFree tier
Input (per 1M tokens)
$0.00
Output (per 1M tokens)
$0.00
Run it locally
Local run command
pip install transformers accelerate && python -c "from transformers import AutoModelForCausalLM, AutoTokenizer; tok = AutoTokenizer.from_pretrained('huggingfacetb/SmolLM2-135M'); model = AutoModelForCausalLM.from_pretrained('huggingfacetb/SmolLM2-135M'); print('Loaded huggingfacetb/SmolLM2-135M')"Presumptive Specs:
GPU: 12+ GB VRAM; System RAM: 16+ GB; Disk: 20+ GB free
Hallucination Score™ (est.)
Community reliability estimate · not official
—
Not yet rated
About this score: Community-estimated based on user reports and publicly available benchmark data (e.g. TruthfulQA). This is not an official score from the model provider. Scores may be inaccurate — always verify with the official leaderboard before making production decisions.
Price History
Not enough historical data yet. Check back after the next pricing sync.
Provider
huggingfacetb
Community Prompts
Proven prompts shared by the community for this model
Loading prompts…
