HippoRAG: RAG framework combining knowledge graphs and Personalized PageRank for continuous knowledge integration
HippoRAG is an open-source RAG framework published at NeurIPS 2024 by the OSU NLP Group that draws on models of human long-term memory to enable LLMs to continuously integrate knowledge across external documents. It combines retrieval-augmented generation with knowledge graphs and Personalized PageRank to improve multi-hop and associative retrieval. The repository has accumulated 3,742 GitHub stars with ongoing community traction.
Related guides (2)
Related events (8)
HistoRAG: A RAG framework embedding historiographical methodology for historical research
Researchers introduce HistoRAG, a Retrieval-Augmented Generation framework that adapts RAG architecture to the epistemological requirements of historical scholarship. Key interventions include separated retrieval and generation, temporal windowing to ensure balanced source representation across time periods, and LLM-as-judge evaluation for transparent relevance judgments. The framework is evaluated on SPIEGELragged, a corpus of 102,189 Der Spiegel articles from 1950–1979, revealing concrete deficiencies in standard RAG for historical work (e.g., era-specific vocabulary failures, weak correlation between vector similarity and LLM-assessed relevance). The paper also introduces the concept of 'Zwischentexte' as a framework for responsible integration of LLM-generated text into scholarly practice.
RAGFlow open-source RAG engine with agent capabilities trending on GitHub
RAGFlow is an open-source Retrieval-Augmented Generation engine that combines RAG with agent capabilities, positioned as a context layer for LLMs. The project has accumulated over 83,000 GitHub stars with 111 new stars today, indicating sustained community interest. It is maintained by Infiniflow and represents a notable open-source tooling option in the RAG/agent ecosystem.
Anthropic introduces Contextual Retrieval to reduce RAG retrieval failures by up to 67%
Anthropic published a technical method called Contextual Retrieval that combines Contextual Embeddings and Contextual BM25 to address the context-loss problem in traditional RAG pipelines. The approach prepends chunk-level context before encoding, reducing failed retrievals by 49% standalone and 67% when combined with reranking. The post also highlights prompt caching as a simpler alternative for knowledge bases under 200K tokens, and provides a cookbook for deployment with Claude.
HKVM-RAG: Hypergraph key-value separation improves multi-hop retrieval-augmented generation
A new arXiv preprint introduces HKVM-RAG, an evidence-organization layer for multi-hop RAG that uses weighted hyperedges as retrieval keys while retaining passage text as answer values. Under a fixed-substrate protocol controlling for tuple cache, reader, and evaluation budget, the hypergraph key-value approach improves over KG-PPR by +3.4 F1 on 2WikiMultiHopQA and +3.6 F1 on MuSiQue. A dense-aware controller combining frozen ColBERTv2 with HKVM features reaches 88.8, 65.1, and 85.8 F1 on three benchmarks, outperforming ColBERTv2 alone by 5–11 F1 points. The work positions hypergraph organization as a reusable evidence-control mechanism rather than a dense-retrieval replacement.
FastGPT: open-source knowledge-base platform with RAG and visual workflow orchestration
FastGPT is an open-source TypeScript platform for building knowledge-based question-answering systems on top of LLMs, featuring data processing pipelines, RAG retrieval, and a visual AI workflow editor. The project has accumulated 28,533 GitHub stars with modest daily growth (+65), indicating steady community traction. It targets developers who want to deploy RAG-based QA systems without extensive configuration.
UMG-RAG: Training-free hybrid retrieval with uncertainty-aware granularity fusion for long-document RAG
Researchers propose Uncertainty-aware Multi-Granularity RAG (UMG-RAG), a training-free hybrid retrieval framework that addresses the tension between large and fine-grained retrieval chunks in RAG pipelines. The system converts dense and sparse retriever scores across multiple chunk granularities into evidence distributions, estimates reliability via entropy, and fuses candidates using query-specific confidence signals. A variant called UMGP-RAG uses fine-grained hits to locate evidence while returning broader parent chunks for coherence. Experiments on QA benchmarks show improved generation quality with no changes to the underlying retriever or generator.
DocTrace: Structure-Aware On-Demand Hypergraph Memory for Long-Document QA
Researchers introduce DocTrace, a multi-agent RAG framework for long-document question answering that uses query-triggered knowledge organization rather than costly query-agnostic preprocessing. The system combines a lightweight document structural tree index, on-demand hypergraph working memory, and a graph-structured experience memory that stores successful reasoning plans for reuse. Evaluated on four long-document QA datasets, DocTrace outperforms the strongest baseline (ComoRAG) by up to 8.85% F1 and 4.40% EM while reducing computational cost by 53.32%.
Expert Support Case Study: Bolstering a RAG App with LLM-as-a-Judge
Hugging Face published a case study describing how Digital Green used an LLM-as-a-Judge approach to evaluate and improve a retrieval-augmented generation (RAG) application. The post covers the methodology for using LLMs to score and validate RAG outputs, providing a practical deployment pattern for quality assurance in production AI systems. It serves as a concrete example of enterprise-grade evaluation pipelines built on top of RAG architectures.

