Entity · technique

Variational Autoencoder (VAE)

techniqueactivevariational-autoencoder-vae--5214bda6·3 events·first seen May 19, 2026

Aliases: Variational Autoencoder (VAE), Variational Autoencoders

Co-occurring entities

Diffusion Models Bayesian Optimization Multimodal Learning active learning Inverse Materials Design Reinforcement Learning normalizing flows Multimodal Large Language Models Dual Layer Aggregation (DLA)Subject-driven Image Generation VAE-based identity conditioning Hugging Face Remote VAE Inference Endpoints

More like this (12)

conditional variational autoencoder VAE Encoder Sparse Autoencoder Remote VAE Sparse Autoencoders (SAEs)VideoVAE+Natural Language Autoencoders Feature Auto-Encoder Sparse Autoencoders AudioVAE2 VQ-VAE amortized variational inference

Recent events (3)

5arXiv · cs.LG·Jun 2, 2026·source ↗

Review: Generative Models, Multimodal Learning, and Closed-Loop Workflows in Inverse Materials Design

This arxiv review surveys recent advances in generative modeling for inverse materials design, covering variational autoencoders, normalizing flows, autoregressive models, and diffusion models applied to crystalline solid discovery. It examines how multimodal learning fuses crystal structures, thermodynamic data, spectroscopy, microscopy, and scientific text into transferable chemical-space representations. The paper also reviews closed-loop design pipelines integrating conditional generation with Bayesian optimization, reinforcement learning, and active learning, and identifies recurring failure modes including surrogate exploitation, diversity collapse, and the stability-synthesizability gap.

Evaluation and Benchmarking Agent and Tool Ecosystem Bayesian Optimization Multimodal Learning active learning +6 more

5arXiv · cs.LG·May 26, 2026·source ↗

Squeezing Capacity from MLLMs for Subject-driven Image Generation via Dual Layer Aggregation

This paper proposes conditioning diffusion models on Multimodal Large Language Models (MLLMs) that jointly encode text and reference images, augmented with VAE-based identity conditioning to address copy-paste artifacts and identity preservation failures in subject-driven image generation. A Dual Layer Aggregation (DLA) module aggregates multi-level MLLM features, and a multi-stage denoising strategy progressively balances semantic and fine-detail identity signals during inference. Experiments show improved human preference scores on subject-driven generation benchmarks compared to prior approaches that encode text and reference images separately.

Agent and Tool Ecosystem Multimodal Progress Multimodal Large Language Models Dual Layer Aggregation (DLA)Subject-driven Image Generation +3 more

4Hugging Face Blog·May 19, 2026·source ↗

Remote VAEs for Decoding with Hugging Face Inference Endpoints

Hugging Face introduces Remote VAEs, a feature for Inference Endpoints that offloads the VAE decoding step of diffusion models to a separate remote service. This approach reduces GPU memory pressure on the primary inference host by decoupling the computationally expensive decoding stage. The pattern is relevant for large latent diffusion models where VAE decoding can be a significant memory and compute bottleneck.

Inference Economics Enterprise Deployment Patterns Hugging Face Variational Autoencoder (VAE)Remote VAE +1 more