AI Launch Profile

Advancing voice intelligence with new models in the API

OpenAI released GPT-Realtime-2, GPT-Realtime-Translate, and GPT-Realtime-Whisper for realtime voice reasoning, translation, and streaming transcription in the API.

Company
OpenAI
Launch date
May 7, 2026
Launch type
Model Release
Category
AI Agents, AI Infrastructure, AI Voice/Audio Tools
Audience
Developers, Enterprises, Product Teams
Pricing
GPT-Realtime-2 is listed at $32 per 1M audio input tokens and $64 per 1M audio output tokens; GPT-Realtime-Translate is $0.034 per minute and GPT-Realtime-Whisper is $0.017 per minute.
Free plan
no
API
yes
Open source/open weight
no

Verification & Sources

Status
Verified
Source links
2
Freshness
Verified June 8, 2026
Last verified
June 8, 2026
Last updated
June 8, 2026
Suggest a correction

Kingy Scores

Launch Score
Unscored
Demo Quality
Unscored
YouTube Potential
Unscored

Kingy AI Take

A high-signal API launch because it moves voice AI toward realtime agents that can reason, translate, transcribe, and act during a conversation.

Who it is for

Developers and product teams building voice agents, multilingual support, live transcription, realtime translation, and speech-driven product workflows.

What feels promising

The release covers reasoning voice, live translation, and streaming transcription with explicit API pricing, making it easier for product teams to plan experiments.

What feels unproven

Production voice quality will depend on latency, barge-in handling, accuracy under noise, safety behavior, and cost at real call-center scale.

Traction notes

Voice agents are becoming a core enterprise AI category, and this launch gives OpenAI a new realtime audio stack for developers.