Almanac
technique

Visual Geometry Foundation Models

techniqueactivevisual-geometry-foundation-models-f606910f·1 events·first seen 1mo ago

Aliases: Visual Geometry Foundation Models

Co-occurring entities

More like this (12)

Recent events (1)

5arXiv · cs.AI·1mo ago·source ↗

IVGT: Implicit Visual Geometry Transformer for Neural Scene Representation

IVGT is a new neural architecture that implicitly models continuous 3D geometry from unposed multi-view images without requiring explicit pointmap regression. It learns a continuous neural scene representation in a canonical coordinate system, supporting SDF-based surface queries and color prediction via lightweight decoders. The model is trained with multi-dataset joint optimization using 2D supervision and 3D geometric regularization, achieving strong generalization across mesh reconstruction, novel view synthesis, depth/normal estimation, and camera pose estimation tasks.