4Hugging Face Blog·1mo ago

Deploy Meta Llama 3.1 405B on Google Cloud Vertex AI

Hugging Face published a guide detailing how to deploy Meta's Llama 3.1 405B model on Google Cloud Vertex AI. The post covers infrastructure setup, serving configuration, and integration patterns for running the large open-weights model in a managed cloud environment. This reflects the growing ecosystem of tooling and cloud partnerships enabling enterprise deployment of frontier open-weights models.

Open Weights Progress Inference Economics Enterprise Deployment Patterns Meta Llama 3.1 405B Google Cloud Vertex AI Hugging Face Meta

Related guides (4)

Hugging Face

Hugging Face: The Home of Open-Source AI

Read asBeginner In-depth

Open Weights ProgressTopic guide

Open Weights Progress: How Freely Available AI Models Caught Up to the Frontier

Read asBeginner In-depth

Enterprise Deployment PatternsTopic guide

Enterprise Deployment Patterns: From LLM Demo to Production Reality

Read asIn-depth

Inference EconomicsTopic guide

Inference Economics: The Cost Structure of Running AI Models in Production

Read asIn-depth

Related events (8)

8Hugging Face Blog·1mo ago·source ↗

Welcome Llama 3 - Meta's new open LLM

Hugging Face published a blog post welcoming Meta's Llama 3 release, covering the new open-weights large language models. Llama 3 represents a significant update to Meta's open model family, with improved capabilities over Llama 2. The post covers integration and availability on the Hugging Face platform.

Frontier Model Releases Open Weights Progress Llama 2 Llama 3 Hugging Face +2 more

8Hugging Face Blog·1mo ago·source ↗

Llama 3.2 Multimodal and Edge Models Launch on Hugging Face

Meta released Llama 3.2, introducing vision-capable multimodal models alongside lightweight models optimized for on-device inference. Hugging Face published a blog post covering integration support, model availability, and deployment options across the ecosystem. The release marks Meta's first open-weights multimodal Llama models, adding image understanding to the Llama family. Smaller 1B and 3B parameter variants target edge and mobile deployment scenarios.

Frontier Model Releases Open Weights Progress Llama 3.2 Hugging Face Meta +3 more

6Hugging Face Blog·1mo ago·source ↗

Making thousands of open LLMs bloom in the Vertex AI Model Garden

Hugging Face and Google Cloud announced an integration bringing thousands of open-source LLMs from the Hugging Face Hub into Vertex AI Model Garden. This partnership allows developers to deploy open-weight models directly through Google Cloud's managed infrastructure. The collaboration represents a significant expansion of enterprise-accessible open model deployment options on a major cloud platform.

Open Weights Progress Inference Economics Google Cloud Vertex AI Model Garden Hugging Face +1 more

9Hugging Face Blog·1mo ago·source ↗

Llama 3.1 Released: 405B, 70B & 8B Models with Multilinguality and Long Context

Meta released Llama 3.1, a family of open-weights models at three scales (405B, 70B, 8B) featuring multilingual support and extended context windows. The 405B model represents Meta's largest open-weights release to date, positioning it as a frontier-class open model. Hugging Face published a blog post covering the release, integration details, and deployment options across the ecosystem.

Long Context Evolution Frontier Model Releases Llama 3.1 70B Meta Llama 3.1 405B Hugging Face +5 more

7Meta Llama·11d ago·source ↗

Meta releases Llama 3.2 90B Vision multimodal model on Hugging Face

Meta released Llama 3.2 90B Vision, a large multimodal model supporting image-text-to-text tasks, published on Hugging Face under the meta-llama organization. The model is part of the Llama 3.2 family and supports English, German, and French. This is a significant open-weights multimodal release from Meta, extending the Llama 3 series with vision capabilities at the 90B parameter scale.

Frontier Model Releases Open Weights Progress Llama 3.2 90B Vision Hugging Face Meta +1 more

7Meta Llama·11d ago·source ↗

Meta releases Llama 3.2 90B Vision-Instruct multimodal model

Meta released Llama 3.2 90B Vision-Instruct on Hugging Face, a large multimodal model supporting image-text-to-text tasks. The model is part of the Llama 3.2 family and supports English and German. With 858 downloads and 358 likes, it represents Meta's open-weights push into vision-language capabilities at the 90B parameter scale.

Frontier Model Releases Open Weights Progress Hugging Face Meta Llama 3.2 90B Vision-Instruct +1 more

6Meta Llama·11d ago·source ↗

Meta releases Llama 3.2-3B open-weights text generation model

Meta released Llama 3.2-3B, a 3-billion parameter open-weights language model, on Hugging Face under the meta-llama organization. The model supports multiple languages including English, German, French, and Italian, and uses the standard transformers/safetensors format. With over 900K downloads and 800+ likes, it has seen substantial community adoption.

Frontier Model Releases Open Weights Progress Llama 3.2 Meta

7Meta Llama·11d ago·source ↗

Meta releases Llama 3.2 11B Vision multimodal model on Hugging Face

Meta released Llama 3.2 11B Vision, an open-weights image-text-to-text model, on Hugging Face. The model is part of the Llama 3.2 family and supports multiple languages including English, German, and French. This represents Meta's entry into open-weights multimodal models at the 11B parameter scale.

Open Weights Progress Multimodal Progress Llama 3.2 11B Vision Hugging Face Meta