Skip to content

OpenAI#

Building low-latency voice agents in 3 lines of code with GPT Realtime 2

LiveAgent on AG2 Beta

OpenAI recently announced advanced voice intelligence in the API, including GPT Realtime 2 for lower-latency, more natural spoken interactions. AG2 Beta wraps that class of model behind LiveAgent: one bidirectional session, continuous audio in and out, and provider-side voice activity detection so users can speak and interrupt like on a phone call—not like a walkie-talkie app.

In this post we walk through why that matters, how LiveAgent compares to the classic STT → Agent → TTS stack, and how to add tools and subagent-style delegation without giving up the realtime voice surface.