Entity · model

Grok Voice Think Fast 1.0

modelactivegrok-voice-think-fast-1-0-edc4f786·1 events·first seen May 18, 2026

Aliases: Grok Voice Think Fast 1.0

Co-occurring entities

Scale AI Audio MultiChallenge GPT-Realtime-2 Google τ-Voice Artificial Analysis Conversational Dynamics GPT-Realtime-Translate xAI OpenAI Realtime API Gemini 3.1 Flash Live Preview Step-Audio R1.1 Realtime OpenAI GPT-Realtime-Whisper Artificial Analysis Big Bench Audio

More like this (12)

Grok Grok-4-Fast Grok Imagine Grok-3 Grok 4 Grok 4.3 Voice Engine WordVoice Gemini 3.1 Flash TTS Voxtral TTS VibeVoice OpenAI Voice AI

Recent events (1)

6The Batch·May 18, 2026·source ↗

OpenAI Updates Audio Models That Reason, Transcribe, and Translate

OpenAI introduced three new audio models in its Realtime API: GPT-Realtime-2 (speech-to-speech with five configurable reasoning effort levels), GPT-Realtime-Translate (70+ input languages), and GPT-Realtime-Whisper (transcription). GPT-Realtime-2 operates as an end-to-end audio model including reasoning, with latency ranging from 1.12 seconds at minimal effort to 2.33 seconds at high effort. Benchmark results are mixed: it leads Scale AI's Audio MultiChallenge and Artificial Analysis Conversational Dynamics but trails Step-Audio R1.1 Realtime and Grok Voice Think Fast 1.0 on speech reasoning and agentic tasks. The configurable reasoning-latency tradeoff is positioned as a key differentiator for voice agent applications.

Frontier Model Releases Evaluation and Benchmarking Scale AI Audio MultiChallenge GPT-Realtime-2 Google +14 more