4Hugging Face Blog·1mo ago

Text2SQL using Hugging Face Dataset Viewer API and Motherduck DuckDB-NSQL-7B

This Hugging Face blog post demonstrates a Text-to-SQL pipeline combining the Hugging Face Dataset Viewer API with MotherDuck's DuckDB-NSQL-7B model, a 7-billion parameter model fine-tuned for natural language to SQL translation. The post walks through using the model to query datasets stored on Hugging Face via DuckDB. It represents a practical integration of a domain-specialized open-weights model with a data infrastructure tool.

Enterprise Deployment Patterns Agent and Tool Ecosystem DuckDB Hugging Face DuckDB-NSQL-7B MotherDuck Hugging Face Dataset Viewer API

Related guides (3)

Hugging Face

Hugging Face: The Home of Open-Source AI

Read asBeginner In-depth

Enterprise Deployment PatternsTopic guide

Enterprise Deployment Patterns: From AI Demo to Production Reality

Read asBeginner In-depth

Agent and Tool EcosystemTopic guide

Agent and Tool Ecosystem: How AI Is Learning to Act, Not Just Answer

Read asBeginner In-depth

Related events (8)

4Hugging Face Blog·1mo ago·source ↗

DuckDB Integration for Analyzing 50,000+ Datasets on Hugging Face Hub

Hugging Face announced a DuckDB integration enabling direct SQL-based analysis of over 50,000 datasets hosted on the Hub without downloading them. The integration allows users to query dataset metadata, statistics, and contents using DuckDB's in-process analytical engine. This lowers the barrier to dataset discovery and exploration at scale across the Hugging Face ecosystem.

Enterprise Deployment Patterns Agent and Tool Ecosystem DuckDB Hugging Face

5Hugging Face Blog·1mo ago·source ↗

Fine-tuning Florence-2 - Microsoft's Cutting-edge Vision Language Models

This Hugging Face blog post provides a technical guide for fine-tuning Microsoft's Florence-2 vision-language models. Florence-2 is a compact yet capable multimodal model supporting tasks like captioning, object detection, and OCR. The post covers practical implementation details for adapting the model to custom datasets using the Hugging Face ecosystem.

Enterprise Deployment Patterns Agent and Tool Ecosystem Microsoft Hugging Face Florence-2 +1 more

6Deepseek·11d ago·source ↗

DeepSeek releases DeepSeek-OCR-2 vision-language model on Hugging Face

DeepSeek has released DeepSeek-OCR-2, a multilingual image-text-to-text model on Hugging Face, built on the DeepSeek-VL-v2 architecture and tagged for OCR and vision-language tasks. The model has accumulated over 1.8 million downloads and 980 likes, indicating substantial community uptake. It extends DeepSeek's multimodal model lineup with a specialized document/OCR capability.

Open Weights Progress Multimodal Progress DeepSeek-OCR-2 DeepSeek V4 Hugging Face

6Deepseek·11d ago·source ↗

DeepSeek releases DeepSeek-OCR vision-language model on Hugging Face

DeepSeek has released DeepSeek-OCR, a multilingual image-text-to-text model on Hugging Face, built on the DeepSeek-VL-v2 architecture. The model targets OCR and image feature extraction tasks and has accumulated over 2.4 million downloads and 3,275 likes, indicating significant community uptake. This represents an open-weights multimodal release from a major Chinese AI lab.

Open Weights Progress Multimodal Progress DeepSeek-OCR-2 DeepSeek V4

6Qwen·15d ago·source ↗

Qwen releases Qwen3.5-9B-Base multimodal model on Hugging Face

Qwen has released Qwen3.5-9B-Base, a 9-billion-parameter image-text-to-text base model on Hugging Face. The model supports conversational use and is compatible with the transformers library and inference endpoints. With over 153,000 downloads, it has seen substantial early adoption.

Frontier Model Releases Open Weights Progress Qwen3.5-2B-Base Qwen +1 more

6Qwen·15d ago·source ↗

Qwen releases Qwen3.6-27B multimodal model on Hugging Face

Qwen published Qwen3.6-27B, a 27-billion-parameter image-text-to-text model, on Hugging Face. The model supports conversational use and is compatible with Azure deployment endpoints. With over 5.4 million downloads and 1,619 likes, it has seen substantial community uptake.

Frontier Model Releases Open Weights Progress Qwen3.6-27B Qwen Hugging Face +1 more

5Hugging Face Blog·1mo ago·source ↗

Databricks + Hugging Face Integration Achieves Up to 40% Faster LLM Training and Tuning

Databricks and Hugging Face have published a case study describing their integration that delivers up to 40% faster training and fine-tuning of large language models. The collaboration leverages Databricks' distributed compute infrastructure alongside Hugging Face's model hub and training libraries. This represents a practical infrastructure optimization for enterprise teams running LLM workloads on Databricks.

Training Infrastructure Enterprise Deployment Patterns Databricks Hugging Face

4Hugging Face Blog·1mo ago·source ↗

Deploying Speech-to-Speech on Hugging Face

Hugging Face published a guide on deploying speech-to-speech (S2S) pipelines using their Inference Endpoints infrastructure. The post covers the technical setup for combining speech recognition, language model inference, and text-to-speech components into a unified real-time pipeline. This represents a practical deployment pattern for voice-based AI applications on managed cloud infrastructure.

Inference Economics Enterprise Deployment Patterns Hugging Face Inference Endpoints Speech-to-Speech Hugging Face +1 more