Text2SQL using Hugging Face Dataset Viewer API and Motherduck DuckDB-NSQL-7B
This Hugging Face blog post demonstrates a Text-to-SQL pipeline combining the Hugging Face Dataset Viewer API with MotherDuck's DuckDB-NSQL-7B model, a 7-billion parameter model fine-tuned for natural language to SQL translation. The post walks through using the model to query datasets stored on Hugging Face via DuckDB. It represents a practical integration of a domain-specialized open-weights model with a data infrastructure tool.
Related guides (3)
Related events (8)
DuckDB Integration for Analyzing 50,000+ Datasets on Hugging Face Hub
Hugging Face announced a DuckDB integration enabling direct SQL-based analysis of over 50,000 datasets hosted on the Hub without downloading them. The integration allows users to query dataset metadata, statistics, and contents using DuckDB's in-process analytical engine. This lowers the barrier to dataset discovery and exploration at scale across the Hugging Face ecosystem.
Fine-tuning Florence-2 - Microsoft's Cutting-edge Vision Language Models
This Hugging Face blog post provides a technical guide for fine-tuning Microsoft's Florence-2 vision-language models. Florence-2 is a compact yet capable multimodal model supporting tasks like captioning, object detection, and OCR. The post covers practical implementation details for adapting the model to custom datasets using the Hugging Face ecosystem.
DeepSeek releases DeepSeek-OCR-2 vision-language model on Hugging Face
DeepSeek has released DeepSeek-OCR-2, a multilingual image-text-to-text model on Hugging Face, built on the DeepSeek-VL-v2 architecture and tagged for OCR and vision-language tasks. The model has accumulated over 1.8 million downloads and 980 likes, indicating substantial community uptake. It extends DeepSeek's multimodal model lineup with a specialized document/OCR capability.
DeepSeek releases DeepSeek-OCR vision-language model on Hugging Face
DeepSeek has released DeepSeek-OCR, a multilingual image-text-to-text model on Hugging Face, built on the DeepSeek-VL-v2 architecture. The model targets OCR and image feature extraction tasks and has accumulated over 2.4 million downloads and 3,275 likes, indicating significant community uptake. This represents an open-weights multimodal release from a major Chinese AI lab.
Qwen releases Qwen3.5-9B-Base multimodal model on Hugging Face
Qwen has released Qwen3.5-9B-Base, a 9-billion-parameter image-text-to-text base model on Hugging Face. The model supports conversational use and is compatible with the transformers library and inference endpoints. With over 153,000 downloads, it has seen substantial early adoption.
Qwen releases Qwen3.6-27B multimodal model on Hugging Face
Qwen published Qwen3.6-27B, a 27-billion-parameter image-text-to-text model, on Hugging Face. The model supports conversational use and is compatible with Azure deployment endpoints. With over 5.4 million downloads and 1,619 likes, it has seen substantial community uptake.
Databricks + Hugging Face Integration Achieves Up to 40% Faster LLM Training and Tuning
Databricks and Hugging Face have published a case study describing their integration that delivers up to 40% faster training and fine-tuning of large language models. The collaboration leverages Databricks' distributed compute infrastructure alongside Hugging Face's model hub and training libraries. This represents a practical infrastructure optimization for enterprise teams running LLM workloads on Databricks.
Deploying Speech-to-Speech on Hugging Face
Hugging Face published a guide on deploying speech-to-speech (S2S) pipelines using their Inference Endpoints infrastructure. The post covers the technical setup for combining speech recognition, language model inference, and text-to-speech components into a unified real-time pipeline. This represents a practical deployment pattern for voice-based AI applications on managed cloud infrastructure.


