Almanac
technique

Model Distillation

techniqueactivemodel-distillation-ec8aed50·2 events·first seen 28d ago

Aliases: Model Distillation, diffusion model distillation

Co-occurring entities

More like this (12)

Recent events (2)

6Openai Blog·28d ago·source ↗

Model Distillation in the API

OpenAI has launched a model distillation feature within its API platform, enabling developers to fine-tune smaller, cost-efficient models using outputs generated by large frontier models. The workflow is entirely contained within the OpenAI platform. This lowers the barrier to deploying capable but cheaper models by leveraging knowledge transfer from frontier systems like GPT-4o.

5arXiv · cs.AI·26d ago·source ↗

CARV: Compute-Aware Variance Reduction for Diffusion Teacher Gradient Estimation

CARV is a hierarchical Monte Carlo estimation framework that reduces gradient variance when using frozen pretrained diffusion models as teachers in downstream pipelines such as text-to-3D distillation and data attribution. The approach amortizes expensive upstream computation (rendering, simulation, encoding) over cheap diffusion-noise resamples, augmented by timestep importance sampling and stratified-inverse-CDF construction. In text-to-3D experiments, CARV delivers 2–3× effective compute multipliers; in single-step distillation, it cuts gradient variance by an order of magnitude but does not improve FID, revealing that MC variance is not the bottleneck in that regime.