SmolVLM
smolvlm-65368af4·3 events·first seen 28d agoAliases: SmolVLM
Co-occurring entities
More like this (12)
Recent events (3)
SmolVLM - Small Yet Mighty Vision Language Model
Hugging Face introduces SmolVLM, a compact vision-language model designed to deliver strong multimodal performance at small parameter counts. The model targets edge and resource-constrained deployment scenarios while maintaining competitive capabilities relative to its size. The announcement highlights efficiency improvements in both training and inference for small-scale VLMs.
SmolVLM Grows Smaller – Introducing the 256M & 500M Models
Hugging Face has released two new ultra-compact vision-language models, SmolVLM-256M and SmolVLM-500M, extending the SmolVLM family to sub-billion parameter sizes. These models are designed for on-device and resource-constrained deployment scenarios. The release continues the trend of pushing capable multimodal models into smaller footprints suitable for edge inference.
SmolVLM2: Bringing Video Understanding to Every Device
Hugging Face introduces SmolVLM2, a family of compact vision-language models designed for video understanding on resource-constrained devices. The models extend the SmolVLM line with video comprehension capabilities while maintaining small footprints suitable for edge and on-device deployment. The release targets democratizing multimodal video understanding beyond cloud-only inference.