Introducing the Ettin Reranker Family
Hugging Face introduces the Ettin Reranker Family, a new set of reranking models designed to improve retrieval quality in information retrieval and RAG pipelines. The models appear to be purpose-built for reranking tasks, likely targeting enterprise and research use cases where retrieval precision matters. As a Hugging Face blog post, this represents a tooling/model release in the retrieval-augmented generation ecosystem.
Related guides (3)
Related events (8)
Training and Finetuning Reranker Models with Sentence Transformers
Hugging Face published a tutorial on training and fine-tuning reranker models using the Sentence Transformers library. Rerankers are cross-encoder models used in retrieval-augmented generation (RAG) and search pipelines to re-score candidate documents for improved relevance. The post covers dataset preparation, loss functions, and training configurations specific to reranking tasks.
Training and Finetuning Multimodal Embedding & Reranker Models with Sentence Transformers
Hugging Face published a blog post detailing how to train and finetune multimodal embedding and reranker models using the Sentence Transformers library. The post covers techniques for building models that can jointly embed text and images for retrieval and reranking tasks. This represents an extension of the Sentence Transformers ecosystem into multimodal territory, enabling practitioners to build cross-modal search and ranking systems.
Ettin Suite: SoTA Paired Encoders and Decoders
Hugging Face introduces the Ettin Suite, a collection of paired encoder and decoder models claiming state-of-the-art performance. The suite appears to offer jointly trained or architecturally matched encoder-decoder pairs, potentially useful for tasks requiring both embedding and generation capabilities. The blog post is published on the Hugging Face platform, positioning it as a notable open-weights or open-access model release.
Multimodal Embedding & Reranker Models with Sentence Transformers
Hugging Face's Sentence Transformers library has added support for multimodal embedding and reranking models, enabling joint text-image (and potentially other modality) representations within a unified framework. The update extends the library's existing text-focused embedding capabilities to handle cross-modal retrieval and reranking tasks. This lowers the barrier for practitioners building multimodal search and RAG pipelines using open-weights models.
Introducing RTEB: A New Standard for Retrieval Evaluation
Hugging Face introduces RTEB (Retrieval Text Embedding Benchmark), a new benchmark designed to standardize evaluation of retrieval systems and text embeddings. The benchmark aims to address gaps in existing evaluation frameworks by providing more comprehensive and realistic retrieval tasks. This represents an effort to improve how the community measures progress in retrieval-augmented generation and semantic search systems.
Introducing HUGS - Scale your AI with Open Models
Hugging Face announced HUGS (Hugging Face Generative Services), a new product aimed at helping enterprises scale AI deployments using open models. The service appears to target production inference infrastructure for open-weight models, positioning Hugging Face as a managed deployment layer. This is a product launch in the enterprise AI infrastructure space, competing with managed inference offerings from other providers.
Hugging Face open reproduction of DeepSeek-R1
Hugging Face has published an open reproduction of DeepSeek-R1, the reasoning-focused language model, on GitHub. The project aims to replicate DeepSeek-R1's training methodology and capabilities in an open-weights setting. This contributes to the broader effort to make frontier reasoning model techniques accessible to the research community.
Deploy Embedding Models with Hugging Face Inference Endpoints
Hugging Face published a guide on deploying embedding models using their Inference Endpoints service. The post covers how to set up dedicated endpoints for embedding models, enabling scalable vector generation for downstream tasks like semantic search and retrieval-augmented generation. This is part of Hugging Face's broader push to make production deployment of specialized model types more accessible.


