Almanac
← Events
4OpenAI Blog·1mo ago

Implicit Generation and Generalization Methods for Energy-Based Models

OpenAI published research on stable and scalable training of energy-based models (EBMs), achieving sample quality competitive with GANs at low temperatures while retaining mode coverage guarantees of likelihood-based models. The approach uses iterative compute during generation to continually refine outputs. This work positions EBMs as a promising alternative generative modeling paradigm bridging GANs and likelihood-based models.

Related guides (2)

Related events (8)

5Openai Blog·1mo ago·source ↗

Improved Techniques for Training Consistency Models

OpenAI presents improved training techniques for consistency models, a class of generative models capable of producing high-quality samples in a single step without adversarial training. The work advances a nascent alternative to diffusion-based generation that trades multi-step sampling for single-step inference. The post originates from OpenAI's research blog, indicating continued investment in efficient generative modeling.

2Openai Blog·1mo ago·source ↗

OpenAI: Generative Models Overview (2016)

A 2016 OpenAI blog post describing four research projects centered on generative models as a branch of unsupervised learning. The post explains what generative models are, their importance, and potential future directions. This is an archival piece predating modern large language models and diffusion systems, representing early foundational work at OpenAI.

6arXiv · cs.AI·26d ago·source ↗

Joint Energy-Based Models Reveal a Generative-Discriminative Sweet Spot for Human-Aligned Vision

Researchers use Joint Energy-Based Models (JEMs) to isolate the effect of learning objective—independent of architecture, scale, and data—on human alignment in visual representations. By varying a single mixing coefficient between discriminative and generative training, they evaluate models across six human-alignment benchmarks and find that alignment peaks at intermediate points on the generative-discriminative continuum rather than at either extreme. The results suggest that hybrid objectives combining categorical structure from discriminative learning with input-structure sensitivity from generative learning yield the most human-like visual behavior. This challenges the framing of generative vs. discriminative as a binary choice for building human-aligned vision systems.

4Openai Blog·1mo ago·source ↗

Learning Concepts with Energy Functions

OpenAI presents an energy-based model capable of learning abstract spatial concepts—such as 'near,' 'above,' and 'between'—from only five demonstrations using sets of 2D points. The model generalizes across domains, transferring concepts learned in a 2D particle environment to control tasks in a 3D physics-based robot simulation. The work demonstrates few-shot concept acquisition and cross-domain transfer via energy function representations.

6The Batch·19d ago·source ↗

GLM-5.1 Open-Weights Model Targets Long-Running Agentic Tasks; Andrew Ng on Coding Agent Acceleration by Software Domain

Z.ai released GLM-5.1, an open-weights mixture-of-experts LLM (754B total / 40B active parameters) designed for sustained agentic coding tasks lasting up to eight hours, featuring iterative planning-execution-evaluation loops with thousands of tool calls. The model claims top open-weights performance on Artificial Analysis Intelligence Index and SWE-Bench Pro, available under MIT license via HuggingFace. The accompanying editorial by Andrew Ng offers a tiered framework for how much coding agents accelerate different software work categories—frontend most, then backend, infrastructure, and research least—with practical implications for team organization. A secondary item references data-center opposition and LLM helpfulness failure modes.

6Openai Blog·1mo ago·source ↗

Image GPT: Transformer Models Applied to Pixel Sequences for Image Generation and Classification

OpenAI demonstrates that a large transformer model trained autoregressively on pixel sequences can generate coherent image completions and samples, analogous to text generation. The work establishes a correlation between generative sample quality and downstream image classification accuracy. The best generative model achieves features competitive with top convolutional networks in the unsupervised setting, suggesting shared representational principles across modalities.

5Openai Blog·1mo ago·source ↗

OpenAI Releases Procgen Benchmark for RL Generalization

OpenAI released Procgen Benchmark, a suite of 16 procedurally-generated environments designed to measure how quickly reinforcement learning agents learn generalizable skills. The benchmark targets a core challenge in RL: distinguishing memorization of specific environments from genuine skill generalization. Its procedural generation ensures agents cannot overfit to fixed level layouts.

6Openai Blog·1mo ago·source ↗

Consistency Models

OpenAI introduces Consistency Models, a new generative modeling framework designed to address the slow iterative sampling process inherent in diffusion models. The approach aims to enable faster single-step or few-step generation for image, audio, and video synthesis. The post appears to be a research announcement or blog summary of the underlying technique.