Almanac
← Events
6Hugging Face Blog·1mo ago

Ettin Suite: SoTA Paired Encoders and Decoders

Hugging Face introduces the Ettin Suite, a collection of paired encoder and decoder models claiming state-of-the-art performance. The suite appears to offer jointly trained or architecturally matched encoder-decoder pairs, potentially useful for tasks requiring both embedding and generation capabilities. The blog post is published on the Hugging Face platform, positioning it as a notable open-weights or open-access model release.

Related guides (3)

Related events (8)

5Hugging Face Blog·1mo ago·source ↗

Introducing the Ettin Reranker Family

Hugging Face introduces the Ettin Reranker Family, a new set of reranking models designed to improve retrieval quality in information retrieval and RAG pipelines. The models appear to be purpose-built for reranking tasks, likely targeting enterprise and research use cases where retrieval precision matters. As a Hugging Face blog post, this represents a tooling/model release in the retrieval-augmented generation ecosystem.

4Hugging Face Blog·1mo ago·source ↗

Training and Finetuning Embedding Models with Sentence Transformers

Hugging Face published a tutorial blog post on training and fine-tuning embedding models using the Sentence Transformers library. The post covers the workflow for customizing embedding models for downstream tasks such as semantic search and retrieval. As a tier-2 source with commentary depth, this serves as practical guidance for practitioners working with text embeddings.

5Hugging Face Blog·1mo ago·source ↗

Sentence Transformers Joins Hugging Face

Sentence Transformers, a widely-used library for generating sentence embeddings and semantic similarity, is officially joining Hugging Face. This integration brings the popular embedding framework under the Hugging Face ecosystem, likely enabling tighter integration with the Hub, datasets, and other HF tooling. The move consolidates a key component of the NLP/embedding pipeline within the dominant open-source AI platform.

5Hugging Face Blog·1mo ago·source ↗

Training and Finetuning Sparse Embedding Models with Sentence Transformers

Hugging Face published a tutorial on training and fine-tuning sparse embedding models using the Sentence Transformers library. Sparse embeddings offer an alternative to dense vector representations for retrieval tasks, potentially improving interpretability and efficiency. The post covers the tooling and workflows available in Sentence Transformers for producing sparse encoders suitable for search and RAG pipelines.

5Hugging Face Blog·1mo ago·source ↗

Training and Finetuning Multimodal Embedding & Reranker Models with Sentence Transformers

Hugging Face published a blog post detailing how to train and finetune multimodal embedding and reranker models using the Sentence Transformers library. The post covers techniques for building models that can jointly embed text and images for retrieval and reranking tasks. This represents an extension of the Sentence Transformers ecosystem into multimodal territory, enabling practitioners to build cross-modal search and ranking systems.

5Hugging Face Blog·1mo ago·source ↗

Train 400x Faster Static Embedding Models with Sentence Transformers

Hugging Face's Sentence Transformers library introduces support for static embedding models that train up to 400x faster than transformer-based alternatives. Static embeddings use fixed token-level representations averaged or pooled without attention layers, dramatically reducing compute requirements. The post covers training methodology, trade-offs in embedding quality versus speed, and practical use cases where inference latency and training cost matter more than peak accuracy.

4Hugging Face Blog·1mo ago·source ↗

Deploy Embedding Models with Hugging Face Inference Endpoints

Hugging Face published a guide on deploying embedding models using their Inference Endpoints service. The post covers how to set up dedicated endpoints for embedding models, enabling scalable vector generation for downstream tasks like semantic search and retrieval-augmented generation. This is part of Hugging Face's broader push to make production deployment of specialized model types more accessible.

5Hugging Face Blog·1mo ago·source ↗

Multimodal Embedding & Reranker Models with Sentence Transformers

Hugging Face's Sentence Transformers library has added support for multimodal embedding and reranking models, enabling joint text-image (and potentially other modality) representations within a unified framework. The update extends the library's existing text-focused embedding capabilities to handle cross-modal retrieval and reranking tasks. This lowers the barrier for practitioners building multimodal search and RAG pipelines using open-weights models.