benchmark

human alignment benchmarks (perceptual similarity, gloss, robustness, shape-texture)

benchmarkactiveprovisionalhuman-alignment-benchmarks-perceptual-similarity-gloss-robustness-shape-texture--532705c2·1 events·first seen 23d ago

Aliases: human alignment benchmarks (perceptual similarity, gloss, robustness, shape-texture)

Co-occurring entities

Joint Energy-Based Models (JEMs)generative-discriminative continuum

More like this (12)

human alignment (neural/behavioral)Human-Vehicle Interaction Benchmark contrastive semantic alignment human uncertainty alignment Representational Similarity Analysis AI alignment Creative Quality Alignment (CQA)diff hunk taxonomy benchmark harness-level benchmarks Human Label Variation (HLV)post-training alignment MedAlign

Recent events (1)

6arXiv · cs.AI·23d ago·source ↗

Joint Energy-Based Models Reveal a Generative-Discriminative Sweet Spot for Human-Aligned Vision

Researchers use Joint Energy-Based Models (JEMs) to isolate the effect of learning objective—independent of architecture, scale, and data—on human alignment in visual representations. By varying a single mixing coefficient between discriminative and generative training, they evaluate models across six human-alignment benchmarks and find that alignment peaks at intermediate points on the generative-discriminative continuum rather than at either extreme. The results suggest that hybrid objectives combining categorical structure from discriminative learning with input-structure sensitivity from generative learning yield the most human-like visual behavior. This challenges the framing of generative vs. discriminative as a binary choice for building human-aligned vision systems.

Evaluation and Benchmarking Alignment and RLHF human alignment benchmarks (perceptual similarity, gloss, robustness, shape-texture)Joint Energy-Based Models (JEMs)generative-discriminative continuum