Entity · benchmark

AUPRC

benchmarkactiveauprc-4d9aa7c3·3 events·first seen May 28, 2026

Aliases: AUPRC

Co-occurring entities

AUROC When Does Synthetic Data Augmentation Improve Score-Based Imbalanced Classification?LLM-augmented clinical NLP pipeline evidence extraction Australian Emergency Department triage notes Reverse Probing delta energy clinical text summarization Uncertainty Quantification

More like this (12)

AuRA UMAP AB-UPT UCL ARC AUROC GP-UCB AUC USACO UBP2 APS-RAG UR-VC LPU

Recent events (3)

4arXiv · cs.LG·Jun 25, 2026·source ↗

Theoretical framework for when synthetic data augmentation improves imbalanced classification metrics

A new arXiv preprint develops a theoretical framework characterizing when synthetic minority-class augmentation improves score-based metrics (AUROC, AUPRC, balanced accuracy, F1) under class imbalance. The authors show that under well-specified score models, augmentation provides no fundamental population-level improvement and may introduce bias, while under model misspecification it can correct ranking errors by shifting effective class balance. Minimax lower bounds confirm the raw estimator is already optimal in the well-specified regime, and simulation studies corroborate the theory.

Evaluation and Benchmarking AUPRC AUROC When Does Synthetic Data Augmentation Improve Score-Based Imbalanced Classification?

4arXiv · cs.CL·Jun 2, 2026·source ↗

Evidence-Augmented ML for Self-Harm Surveillance in Emergency Department Triage Notes

Researchers developed a three-stage pipeline combining traditional machine learning with LLM-based screening and evidence extraction to detect self-harm in Australian emergency department triage notes. The system achieved AUPRCs around 0.88 in both internal and external validation, and transferred to two external hospital sites without site-specific retraining. A notable capability is identifying the primary self-harm method with 95% accuracy, enabling more granular public health surveillance beyond binary classification.

Enterprise Deployment Patterns AUPRC LLM-augmented clinical NLP pipeline evidence extraction +1 more

5arXiv · cs.AI·May 28, 2026·source ↗

Reverse Probing: Supervised Token-level Uncertainty Quantification for LLMs in Clinical Text

The paper introduces Reverse Probing, a novel uncertainty quantification framework designed specifically for clinical text summarization that estimates token-level uncertainty from pre-existing labeled summaries rather than sampling new outputs. It extracts uncertainty signals from four categories of internal model activations, treating text as a probe into the model's internal state. Evaluated on two expert-annotated clinical datasets, it outperforms eight adapted baselines on all metrics, achieving up to 4× higher AUPRC while reducing inference time and compute. Feature analysis identifies delta energy and neighborhood context as the most consistent predictors of uncertainty across models.

Evaluation and Benchmarking AI Safety Research Reverse Probing delta energy AUPRC +3 more