dataset
ChatDoctor
datasetactiveprovisional
chatdoctor-1a6cac9b·1 events·first seen 25h agoAliases: ChatDoctor
Co-occurring entities
More like this (12)
Recent events (1)
Multi-agent semantic rewriting framework for privacy-preserving RAG
A new arXiv preprint proposes a three-agent framework for sanitizing retrieved content in RAG pipelines by performing privacy extraction, semantic analysis, and reconstruction as an offline preprocessing step. Evaluated on ChatDoctor and Wiki-PII datasets across six LLMs, the approach reduces targeted information exposure in LLaMA-3-8B from 144 baseline instances to 1, while maintaining contextual fidelity (BLEU-1 of 0.122 vs. SAGE's 0.117). The framework introduces no additional online inference latency since rewriting is done offline. Source code is publicly released.