Entity · dataset

CC12M

datasetactivecc12m-4f6d46b1·1 events·first seen Jun 3, 2026

Aliases: CC12M

Co-occurring entities

DINOv2 Yuan Gao MS COCO SiT SigLIP 2 Jiatao Gu Apple ImageNet Feature Auto-Encoder

More like this (12)

CWM 32B HC3 LM1B Pixtral 12B HMMT25 CSI300 CT-DEB26 BM25 AMC12 MMMC-Code M2M100 PCMCI+

Recent events (1)

5The Batch·Jun 3, 2026·source ↗

Apple researchers propose Feature Auto-Encoder to speed diffusion training via compressed DINOv2 embeddings

Researchers at Apple introduced Feature Auto-Encoder (FAE), a latent diffusion image generator that compresses DINOv2 vision encoder embeddings before learning to denoise them, then expands them back for decoding. The approach achieves comparable image quality to state-of-the-art diffusion models while training roughly 7x faster on ImageNet class-conditional generation. The key insight is that shrinking semantically rich vision embeddings reduces compute during diffusion training without sacrificing the representational benefits of large pretrained encoders.

Training Infrastructure Multimodal Progress DINOv2 Yuan Gao MS COCO +7 more