AI Launch Profile

Advancing voice intelligence with new models in the API

OpenAI released GPT-Realtime-2, GPT-Realtime-Translate, and GPT-Realtime-Whisper for realtime voice reasoning, translation, and streaming transcription in the API.

Read official source Tool profile

At a glance

Launch Snapshot

Company: OpenAI
Launch date: May 7, 2026
Launch type: Model Release
Category: AI Agents, AI Infrastructure, AI Voice/Audio Tools
Audience: Developers, Enterprises, Product Teams
Pricing: GPT-Realtime-2 is listed at $32 per 1M audio input tokens and $64 per 1M audio output tokens; GPT-Realtime-Translate is $0.034 per minute and GPT-Realtime-Whisper is $0.017 per minute.
Free plan: No
API: Yes
Open weights/source: No

Launch Context

Use these links to move from this record into the broader Launch Intelligence database.

Launch Intelligence hub Today's launches This week's launches More: AI Agents More: AI Infrastructure More: AI Voice/Audio Tools More: Developers More: Enterprises More: Product Teams More: Model Release

Verification & Sources

Status: Verified
Source links: 2
Freshness: Needs recheck: verified June 8, 2026
Last verified: June 8, 2026
Last updated: June 11, 2026

Key source checks

Official site OpenAI API pricing page

Suggest a correction

Creator Coverage Next Steps

This launch has signals that may support demos, reviews, creator education, founder storytelling, or practical product explainers.

Launching an AI product that needs clear demos, creator education, and buyer trust? Sponsor a Kingy AI video or launch feature.

View YouTube-worthy launch list Sponsor Kingy AI Request creator coverage review Estimate creator campaign ROI

Kingy AI Take

A high-signal API launch because it moves voice AI toward realtime agents that can reason, translate, transcribe, and act during a conversation.

Who it is for

Developers and product teams building voice agents, multilingual support, live transcription, realtime translation, and speech-driven product workflows.

What feels promising

The release covers reasoning voice, live translation, and streaming transcription with explicit API pricing, making it easier for product teams to plan experiments.

What feels unproven

Production voice quality will depend on latency, barge-in handling, accuracy under noise, safety behavior, and cost at real call-center scale.

Traction notes

Voice agents are becoming a core enterprise AI category, and this launch gives OpenAI a new realtime audio stack for developers.

Source-backed record

Verified Sources

Official launch source openai.com OpenAI API pricing page openai.com