Entity · benchmark

ValueEval

benchmarkactivevalueeval-67726494·1 events·first seen May 22, 2026

Aliases: ValueEval

Co-occurring entities

TouchéValueML DeBERTa-v3 Retrieval-Augmented Generation Schwartz Value Theory

More like this (12)

ParaEval CharacterEval L-Eval Every Eval Ever DeepEval T-Eval G-Eval SummEval HypoEval TweetEval UniEval IFEval

Recent events (1)

4arXiv · cs.CL·May 22, 2026·source ↗

Systematic Study of Schwartz Value Detection in Political Texts: Context, Scale, and Moral Knowledge

This paper investigates when additional context, larger models, or retrieved moral knowledge improve detection of Schwartz human values in political text using the ValueEval benchmark format. Key findings show that full-document context helps supervised DeBERTa encoders (+3.8–4.8 macro-F1) but not zero-shot LLMs, while RAG with a curated moral knowledge base consistently benefits all model families under early fusion. Scaling model size does not guarantee gains, and simple early fusion outperforms more complex RAG variants. The study recommends jointly evaluating context, knowledge, and model family rather than assuming larger inputs or models universally improve value-sensitive NLP.

Evaluation and Benchmarking Agent and Tool Ecosystem TouchéValueML DeBERTa-v3 Retrieval-Augmented Generation +2 more