Almanac
product

PhotoFlow

productactiveprovisionalphotoflow-ec93bd73·1 events·first seen 22d ago

Aliases: PhotoFlow

Co-occurring entities

More like this (12)

Recent events (1)

5arXiv · cs.AI·22d ago·source ↗

PhotoFlow: Agentic 3D Virtual Photography via Director-Reviewer-Reflector Loop

PhotoFlow introduces a closed-loop agentic system for language-conditioned virtual photography in arbitrary 3D scenes, using a Director-Reviewer-Reflector architecture to iteratively search camera poses and render photographs without preselected viewpoints. The system is evaluated on VPhotoBench, a new benchmark of 47 Blender scenes and 141 language-conditioned missions covering spatial composition and aesthetic criteria. PhotoFlow outperforms one-shot prediction, single-chain reflection, anchor-bank selection, and random search baselines under a six-round rendering budget. The work represents the first formalization of language-conditioned virtual photography as an executable agent task, probing both 3D spatial reasoning and aesthetic judgment in vision-language models.