Entity · other

cybersecurity risk uplift

otheractivecybersecurity-risk-uplift-2e7bb760·1 events·first seen May 20, 2026

Aliases: cybersecurity risk uplift

Co-occurring entities

biology risk uplift malicious fine-tuning GPT-OSS OpenAI

More like this (12)

biology risk uplift cyberattack scaling law Trusted Access for Cyber Cybersecurity Task Evaluation reward hacking AI Cybersecurity Threat Evaluation Framework OpenAI Cybersecurity Grant Program CyberSecEval 2 US Cyber and AI Safety Institute Cyber Verification Program UAR (Unforeseen Attack Robustness)U.S. Cyber Command

Recent events (1)

8Openai Blog·May 20, 2026·source ↗

Estimating Worst-Case Frontier Risks of Open-Weight LLMs

OpenAI introduces a methodology called malicious fine-tuning (MFT) to assess worst-case risks of releasing open-weight models, specifically applied to their internal model gpt-oss. The study attempts to elicit maximum dangerous capabilities in biology and cybersecurity domains through targeted fine-tuning. This represents a systematic effort to quantify uplift risks before open-weight releases, informing OpenAI's open-weight release policy.

Evaluation and Benchmarking Open Weights Progress cybersecurity risk uplift biology risk uplift malicious fine-tuning +3 more