6Hugging Face Blog·1mo ago

Ettin Suite: SoTA Paired Encoders and Decoders

Hugging Face introduces the Ettin Suite, a collection of paired encoder and decoder models claiming state-of-the-art performance. The suite appears to offer jointly trained or architecturally matched encoder-decoder pairs, potentially useful for tasks requiring both embedding and generation capabilities. The blog post is published on the Hugging Face platform, positioning it as a notable open-weights or open-access model release.

Frontier Model Releases Evaluation and Benchmarking Open Weights Progress Ettin Suite Hugging Face

Related guides (3)

Hugging Face

Hugging Face: The Home of Open-Source AI

Read asBeginner In-depth

Frontier Model ReleasesTopic guide

Frontier Model Releases: The Race From Language to Action

Read asBeginner In-depth

Open Weights ProgressTopic guide

Open Weights Progress: How Freely Available AI Models Caught Up to the Frontier

Read asBeginner In-depth

Related events (8)

5Hugging Face Blog·1mo ago·source ↗

Introducing the Ettin Reranker Family

Hugging Face introduces the Ettin Reranker Family, a new set of reranking models designed to improve retrieval quality in information retrieval and RAG pipelines. The models appear to be purpose-built for reranking tasks, likely targeting enterprise and research use cases where retrieval precision matters. As a Hugging Face blog post, this represents a tooling/model release in the retrieval-augmented generation ecosystem.

Enterprise Deployment Patterns Agent and Tool Ecosystem Ettin Reranker Hugging Face

4Hugging Face Blog·1mo ago·source ↗

Training and Finetuning Embedding Models with Sentence Transformers

Hugging Face published a tutorial blog post on training and fine-tuning embedding models using the Sentence Transformers library. The post covers the workflow for customizing embedding models for downstream tasks such as semantic search and retrieval. As a tier-2 source with commentary depth, this serves as practical guidance for practitioners working with text embeddings.

Enterprise Deployment Patterns Agent and Tool Ecosystem embedding models Hugging Face Sentence Transformers

5Hugging Face Blog·1mo ago·source ↗

Sentence Transformers Joins Hugging Face

Sentence Transformers, a widely-used library for generating sentence embeddings and semantic similarity, is officially joining Hugging Face. This integration brings the popular embedding framework under the Hugging Face ecosystem, likely enabling tighter integration with the Hub, datasets, and other HF tooling. The move consolidates a key component of the NLP/embedding pipeline within the dominant open-source AI platform.

Enterprise Deployment Patterns Agent and Tool Ecosystem Hugging Face Sentence Transformers

5Hugging Face Blog·1mo ago·source ↗

Training and Finetuning Sparse Embedding Models with Sentence Transformers

Hugging Face published a tutorial on training and fine-tuning sparse embedding models using the Sentence Transformers library. Sparse embeddings offer an alternative to dense vector representations for retrieval tasks, potentially improving interpretability and efficiency. The post covers the tooling and workflows available in Sentence Transformers for producing sparse encoders suitable for search and RAG pipelines.

Inference Economics Agent and Tool Ecosystem Sparse Embedding Models Hugging Face Sentence Transformers

5Hugging Face Blog·1mo ago·source ↗

Training and Finetuning Multimodal Embedding & Reranker Models with Sentence Transformers

Hugging Face published a blog post detailing how to train and finetune multimodal embedding and reranker models using the Sentence Transformers library. The post covers techniques for building models that can jointly embed text and images for retrieval and reranking tasks. This represents an extension of the Sentence Transformers ecosystem into multimodal territory, enabling practitioners to build cross-modal search and ranking systems.

Agent and Tool Ecosystem Multimodal Progress reranker models multimodal embedding Hugging Face +1 more

5Hugging Face Blog·1mo ago·source ↗

Train 400x Faster Static Embedding Models with Sentence Transformers

Hugging Face's Sentence Transformers library introduces support for static embedding models that train up to 400x faster than transformer-based alternatives. Static embeddings use fixed token-level representations averaged or pooled without attention layers, dramatically reducing compute requirements. The post covers training methodology, trade-offs in embedding quality versus speed, and practical use cases where inference latency and training cost matter more than peak accuracy.

Inference Economics Agent and Tool Ecosystem Hugging Face Sentence Transformers static embeddings

4Hugging Face Blog·1mo ago·source ↗

Deploy Embedding Models with Hugging Face Inference Endpoints

Hugging Face published a guide on deploying embedding models using their Inference Endpoints service. The post covers how to set up dedicated endpoints for embedding models, enabling scalable vector generation for downstream tasks like semantic search and retrieval-augmented generation. This is part of Hugging Face's broader push to make production deployment of specialized model types more accessible.

Inference Economics Enterprise Deployment Patterns Hugging Face Inference Endpoints embedding models Hugging Face +2 more

5Hugging Face Blog·1mo ago·source ↗

Multimodal Embedding & Reranker Models with Sentence Transformers

Hugging Face's Sentence Transformers library has added support for multimodal embedding and reranking models, enabling joint text-image (and potentially other modality) representations within a unified framework. The update extends the library's existing text-focused embedding capabilities to handle cross-modal retrieval and reranking tasks. This lowers the barrier for practitioners building multimodal search and RAG pipelines using open-weights models.

Inference Economics Agent and Tool Ecosystem multimodal embedding reranking Hugging Face +2 more