6Hugging Face Blog·1mo ago

Falcon LLM Integrated into Hugging Face Ecosystem

Hugging Face announced the integration of the Falcon language models (Falcon-7B and Falcon-40B) into its ecosystem, including model hosting, inference APIs, and tooling support. Falcon, developed by the Technology Innovation Institute (TII), had recently topped the Open LLM Leaderboard at the time of release. The post covers usage patterns, fine-tuning guidance, and deployment options within the Hugging Face stack.

Open Weights Progress Inference Economics Agent and Tool Ecosystem Falcon-7B Open LLM Leaderboard Falcon-40B Hugging Face Technology Innovation Institute

Related guides (4)

Hugging Face

Hugging Face: The Home of Open-Source AI

Read asBeginner In-depth

Open Weights ProgressTopic guide

Open Weights Progress: How Freely Available AI Models Caught Up to the Frontier

Read asBeginner In-depth

Agent and Tool EcosystemTopic guide

Agent and Tool Ecosystem: How the Infrastructure Layer Around LLMs Is Consolidating

Read asIn-depth

Inference EconomicsTopic guide

Inference Economics: The Cost Structure of Running AI Models in Production

Read asIn-depth

Related events (8)

6Hugging Face Blog·1mo ago·source ↗

Falcon-H1: A Family of Hybrid-Head Language Models Redefining Efficiency and Performance

TII UAE has released Falcon-H1, a new family of hybrid-head language models combining attention and state-space mechanisms to improve efficiency and performance. The models are published on Hugging Face and represent TII's latest iteration in the Falcon series. The hybrid architecture targets better inference economics and competitive benchmark results relative to model size.

Frontier Model Releases Open Weights Progress Hugging Face Hybrid-Head Architecture Falcon-H1 +2 more

5Hugging Face Blog·1mo ago·source ↗

Falcon Perception: TII Announces Multimodal Perception Capabilities for Falcon

TII (Technology Innovation Institute) has published a blog post on Hugging Face introducing Falcon Perception, a multimodal extension of the Falcon model family. The post appears to detail perception capabilities added to the Falcon series, likely covering vision-language or other sensory modalities. As the body content is empty, specific technical details about architecture, benchmarks, or release scope are unavailable from this source.

Open Weights Progress Multimodal Progress Falcon Falcon Perception Hugging Face +1 more

6Hugging Face Blog·1mo ago·source ↗

Welcome to the Falcon 3 Family of Open Models

The Technology Innovation Institute (TII) has released the Falcon 3 family of open-weights models, announced via the Hugging Face blog. The release continues TII's Falcon lineage of open models and is positioned as a significant update to the series. Details on model sizes, capabilities, and training methodology are covered in the announcement.

Frontier Model Releases Open Weights Progress Falcon Hugging Face Technology Innovation Institute

5Hugging Face Blog·1mo ago·source ↗

Accelerate a World of LLMs on Hugging Face with NVIDIA NIM

NVIDIA NIM microservices are being integrated with Hugging Face to enable optimized inference deployment for a broad range of LLMs hosted on the Hub. The partnership allows developers to deploy Hugging Face models via NIM's containerized inference stack, leveraging NVIDIA's TensorRT-LLM and other optimizations. This expands the ecosystem of models accessible through NIM beyond NVIDIA's own catalog to the wider Hugging Face model repository.

Inference Economics Enterprise Deployment Patterns NVIDIA NIM NVIDIA TensorRT-LLM +2 more

4Hugging Face Blog·1mo ago·source ↗

Hugging Face and FriendliAI Partner to Supercharge Model Deployment on the Hub

Hugging Face and FriendliAI have announced a partnership to integrate FriendliAI's inference infrastructure directly into the Hugging Face Hub. The collaboration aims to simplify and accelerate model deployment for developers accessing models through the Hub. This expands the ecosystem of inference providers available on Hugging Face's platform.

Inference Economics Enterprise Deployment Patterns FriendliAI Hugging Face +1 more

6Hugging Face Blog·1mo ago·source ↗

Falcon 2: 11B Parameter Pretrained LLM and VLM Trained on 5T+ Tokens Across 11 Languages

Technology Innovation Institute (TII) has released Falcon 2, an 11B parameter language model pretrained on over 5 trillion tokens spanning 11 languages. The release includes both a base language model and a vision-language model (VLM) variant. This represents a significant update to the Falcon model family, expanding multilingual and multimodal capabilities.

Frontier Model Releases Open Weights Progress Falcon Hugging Face Falcon 2 VLM +2 more

8Hugging Face Blog·1mo ago·source ↗

Falcon 180B Released: New Open-Weights Frontier Model

Technology Innovation Institute (TII) has released Falcon 180B, a 180-billion parameter open-weights language model announced via Hugging Face. At the time of release, it was positioned as the largest publicly available open-weights model, trained on 3.5 trillion tokens. The model is available on Hugging Face Hub for research and commercial use under a custom license.

Frontier Model Releases Open Weights Progress Hugging Face Technology Innovation Institute Falcon 180B +1 more

4Hugging Face Blog·1mo ago·source ↗

Deploy LLMs with Hugging Face Inference Endpoints

Hugging Face published a guide on deploying large language models using their Inference Endpoints service. The post covers how to set up scalable, production-ready LLM deployments with minimal infrastructure overhead. It targets developers looking to move from experimentation to hosted inference without managing raw compute.

Inference Economics Enterprise Deployment Patterns Hugging Face Inference Endpoints Hugging Face