Almanac
benchmark

NDCG

benchmarkactiveprovisionalndcg-16d136c0·2 events·first seen 15d ago

Aliases: NDCG

Co-occurring entities

More like this (12)

Recent events (2)

6Mistral Ai News·15d ago·source ↗

Mistral Releases Search Toolkit: Open-Source Composable Framework for Production RAG and Enterprise Search Pipelines

Mistral AI has launched Search Toolkit in public preview, an open-source framework that unifies document ingestion, retrieval, and evaluation into a single composable pipeline for AI applications. The toolkit ships with BM25 sparse retrieval, dense embedding-based retrieval, hybrid configurations, and built-in metrics (recall, precision, MRR, NDCG), targeting enterprise RAG workflows, domain-specific retrieval, and agentic systems. It integrates with MCP-based Connectors for live data access from CRMs, code repositories, and productivity tools. CMA CGM is cited as a production user, combining Search Toolkit with Voxtral for real-time fake news detection across audio sources.

3arXiv · cs.LG·9d ago·source ↗

Bradley-Terry model proposed for fairer ranking of recommendation algorithms across dataset types

A new arXiv preprint introduces a Bradley-Terry (BT) model-based methodology for ranking recommendation algorithms in a way that accounts for dataset characteristics such as sparsity, sequential structure, and scale. The authors argue that naive metric aggregation (e.g., averaging NDCG) produces misleading rankings and propose BT trees and covariate-extended BT models as alternatives. The framework also enables ranking predictions on unseen datasets without running the models, and includes a new metric for ranking consistency.