Entity · benchmark

BLEU-4

benchmarkactivebleu-4-b9d5a77f·1 events·first seen Jun 1, 2026

Aliases: BLEU-4

Co-occurring entities

Graph Transformer Diffusion Language Models Graph-LLaDA LLaDA LAGRANGE lambda-scaled structural decoding

More like this (12)

BLEU Bluesky STT-Agent-4B MVOIK-4D GLM-4.5-Air Gemma-4 E4B-it Blue J MultiBLiMP BLEURT HAT-4D NEC BluStellar BLIP-2

Recent events (1)

6arXiv · cs.CL·Jun 1, 2026·source ↗

Trajectory Analysis of Masked Diffusion LMs for Graph-to-Text Generation with Lambda-Scaled Structural Decoding

This paper presents the first systematic study of masked diffusion language models (MDLMs) for graph-to-text generation, analyzing the order in which tokens are unmasked during iterative decoding. The authors find MDLMs naturally unmask entities first, then relational/function words, then structural tokens—a pattern disrupted by supervised fine-tuning, which prematurely anchors structural tokens and causes hallucination or omission. They propose lambda-scaled structural decoding, a training-free inference-time fix that recovers +9.4 BLEU-4, and introduce Graph-LLaDA, which integrates a Graph Transformer encoder into LLaDA's decoding process. Cross-dataset evaluation on the LAGRANGE benchmark shows prior baselines overfit to dataset-specific patterns while MDLM-based approaches generalize better.

Frontier Model Releases Evaluation and Benchmarking BLEU-4 Graph Transformer Diffusion Language Models +5 more