Entity · model

gpt-oss-20b

modelactivegpt-oss-20b-03cfea6b·4 events·first seen May 20, 2026

Aliases: gpt-oss-20b, GPT-oss 20B

Co-occurring entities

Apache 2.0 GPT-OSS 120B OpenAI Qwen2.5-7B BERTurk Conditional Scale Entropy mechanistic interpretability GPT-2 LLaMA-7B VUA Metaphor Corpus gpt-oss usage policy

More like this (12)

GPT-OSS 120B gpt-oss-safeguard GPT-OSS GPT-J 6B GPT-5.2 GPT-4o GPT-Image-2 GPT-f GPT-4V GPT GPT-next gpt-oss usage policy

Recent events (4)

3arXiv · cs.CL·Jun 8, 2026·source ↗

Supervised vs. in-context learning for Turkish multiword expression classification

A new arXiv paper evaluates Turkish idiomatic light verb construction (LVC) detection as a binary classification task, comparing a supervised BERTurk baseline against three instruction-tuned LLMs under zero-shot, one-shot, and few-shot prompting. Results show LLMs have very low LVC recall in zero-shot but improve substantially with demonstrations, though one-shot prompting can introduce strong model-specific biases. The supervised baseline remains competitive, while carefully constructed few-shot prompts allow GPT-OSS-20B and Qwen 2.5-14B to match or exceed it. The study highlights significant prompt sensitivity in Turkish metalinguistic classification tasks.

Evaluation and Benchmarking Qwen2.5-7B BERTurk gpt-oss-20b

5arXiv · cs.CL·May 21, 2026·source ↗

Conditional Scale Entropy: A Wavelet-Derived Tool for Mechanistic Interpretability of Metaphor Processing in Transformers

This paper introduces Conditional Scale Entropy (CSE), a wavelet-derived measure of how transformer computation engages across frequency scales at each layer, and applies it to study metaphor processing in decoder-only language models. The authors prove CSE is invariant to update magnitude, isolating structural computation patterns from intensity. Across architectures ranging from GPT-2 (124M) to LLaMA-2 7B and GPT-oss 20B, metaphorical tokens consistently produce higher spectral breadth than literal tokens in early-to-mid layers, with the effect surviving permutation correction and specificity controls. The work establishes multi-scale coordination as a consistent mechanistic signature of metaphorical language processing and positions CSE as a general interpretability tool for cross-depth structure in transformers.

Evaluation and Benchmarking AI Safety Research Conditional Scale Entropy mechanistic interpretability GPT-2 +3 more

9Openai Blog·May 20, 2026·source ↗

OpenAI Releases gpt-oss-120b and gpt-oss-20b Open-Weight Reasoning Models

OpenAI has published model cards for gpt-oss-120b and gpt-oss-20b, two open-weight reasoning models released under the Apache 2.0 license alongside a dedicated gpt-oss usage policy. This marks a significant move by OpenAI into the open-weights space, offering both a large 120B parameter model and a smaller 20B variant. The release signals a strategic shift for OpenAI, which has historically kept its frontier models proprietary.

Frontier Model Releases Open Weights Progress gpt-oss usage policy Apache 2.0 GPT-OSS 120B +4 more

9Openai Blog·May 20, 2026·source ↗

OpenAI Releases gpt-oss-120b and gpt-oss-20b Open-Weight Models Under Apache 2.0

OpenAI is releasing two open-weight language models, gpt-oss-120b and gpt-oss-20b, under the Apache 2.0 license. The models are claimed to outperform similarly sized open models on reasoning tasks and feature strong tool use capabilities. They are optimized for efficient deployment on consumer hardware, positioning them as cost-effective alternatives in the open-weights ecosystem.

Frontier Model Releases Open Weights Progress Apache 2.0 GPT-OSS 120B OpenAI +3 more