6OpenAI Blog·1mo ago

Trading Inference-Time Compute for Adversarial Robustness

OpenAI published research exploring the trade-off between inference-time compute and adversarial robustness. The work investigates whether allocating more compute at inference time can improve a model's resistance to adversarial attacks. This connects to the broader trend of using test-time compute scaling as a lever for capability and safety improvements.

Evaluation and Benchmarking Inference Economics AI Safety Research adversarial robustness inference-time compute scaling OpenAI

Related guides (4)

OpenAI

OpenAI: The Lab That Made AI a Household Word

Read asBeginner In-depth

AI Safety ResearchTopic guide

AI Safety Research: From Lab Policies to Real-World Flashpoints

Read asBeginner In-depth

Inference EconomicsTopic guide

Inference Economics: The Cost Structure of Running AI Models in Production

Read asIn-depth

Evaluation and BenchmarkingTopic guide

Evaluation and Benchmarking: The Shifting Yardstick of AI Capability

Read asIn-depth

Related events (8)

4Openai Blog·1mo ago·source ↗

Computational limitations in robust classification and win-win results

OpenAI published research examining computational limitations in robust classification, exploring theoretical bounds on adversarially robust machine learning. The work investigates so-called 'win-win' results where both standard and robust accuracy can be achieved simultaneously. This is a foundational safety and robustness research contribution from 2019, addressing hardness results in adversarial ML.

Evaluation and Benchmarking AI Safety Research adversarial robustness Robust Classification OpenAI

4Openai Blog·1mo ago·source ↗

Testing Robustness Against Unforeseen Adversaries

OpenAI published a method to evaluate whether neural network classifiers can defend against adversarial attacks not encountered during training. The approach introduces a new metric called UAR (Unforeseen Attack Robustness) to quantify a model's resilience to unanticipated attacks. The work argues for measuring robustness across a broader, more diverse set of attack types rather than only those seen in training.

Evaluation and Benchmarking AI Safety Research adversarial robustness OpenAI UAR (Unforeseen Attack Robustness)

4Openai Blog·1mo ago·source ↗

Transfer of Adversarial Robustness Between Perturbation Types

OpenAI published research examining whether adversarial robustness trained against one type of perturbation (e.g., L-infinity) transfers to other perturbation types (e.g., L2, L1). The work investigates the generalization properties of adversarial training across different threat models. This is an early safety and robustness research contribution from OpenAI predating the modern LLM era.

Evaluation and Benchmarking AI Safety Research adversarial robustness L-infinity perturbation adversarial training +2 more

7Openai Blog·1mo ago·source ↗

AI and Compute: OpenAI Analysis of Exponential Growth in Training Compute Since 2012

OpenAI published an analysis in May 2018 showing that compute used in the largest AI training runs has been doubling every 3.4 months since 2012, far outpacing Moore's Law's 2-year doubling period. Over the 2012–2018 period, this metric grew by more than 300,000x. The analysis frames compute scaling as a key driver of AI progress and argues for preparing for systems with capabilities well beyond those of the time.

Training Infrastructure Frontier Model Releases Moore's Law OpenAI AI and Compute +1 more

3Hugging Face Blog·1mo ago·source ↗

How to Train Your Model Dynamically Using Adversarial Data

This Hugging Face blog post describes a methodology for dynamically training models using adversarial data, likely in the context of improving robustness against adversarial examples. The post covers techniques for generating and incorporating adversarial inputs during the training loop to improve model resilience. Published in mid-2022, it targets practitioners looking to harden ML models against distribution shift and adversarial attacks.

AI Safety Research MNIST Hugging Face adversarial training

5arXiv · cs.AI·47h ago·source ↗

Distributionally robust optimization framework for probabilistic runtime verification of AI agents

A new arXiv preprint introduces a sound and efficient framework for verifying probabilistic security policies for AI agents operating in complex digital environments, addressing limitations of prior Datalog-based approaches that assumed deterministic policies or predicate independence. The method uses distributionally robust optimization to compute sound upper bounds on policy violation probability without requiring independence assumptions between predicates. Evaluated on benchmarks for terminal and tool-calling agents, the approach outperforms prior art on the security-utility trade-off.

AI Safety Research Agent and Tool Ecosystem Datalog Efficient and Sound Probabilistic Verification for AI Agents distributionally robust optimization

5arXiv · cs.CL·8d ago·source ↗

Adaptive asymmetric token compression accelerates time series language models up to 7.68×

A new arXiv preprint proposes an adaptive token budgeting framework for time series (TS) language models that compresses TS tokens using frequency-domain structure and progressively prunes prompt tokens across model layers. The authors demonstrate up to 7.68× inference acceleration with performance improvements in 78% of evaluated settings across forecasting, classification, imputation, and anomaly detection tasks. The work is motivated by the observation that TS tokens have uneven spectral contributions and prompt-token influence attenuates with model depth, making uniform token processing wasteful.

Long Context Evolution Inference Economics Beyond Uniform Tokens: Adaptive Compression for Time Series Language Models

4Latent Space·1mo ago·source ↗

[AINews] The Inference Inflection

A Latent Space commentary piece reflecting on the broader implications of the 'inference age' in AI. The piece appears to be a daily AI news digest framing inference-time compute as a significant structural shift. Published during a relatively quiet news day, it offers analytical perspective on inference economics and deployment patterns rather than breaking news.

Inference Economics Enterprise Deployment Patterns Latent Space