Building Voice Agents with Gemini 3

31 Views

Thanks! Share it with your friends!

You disliked this video. Thanks for the feedback!

Published Mar 27, 2026

*Build real-time conversational agents with Gemini 3*
Thor from Google DeepMind walks through the Gemini Live API, showing how to build natural, human-like voice interactions powered by Gemini’s native audio model: speech-to-speech, no text in the middle, with emotional nuance, multilingual support, and real-time tool use.

*What’s covered:*
Testing in Google AI Studio, streaming audio and video frames, configuring voices and system instructions, WebSocket integration with the GenAI SDK, session management, interruption handling, and deploying with partner frameworks like LiveKit, Daily, and Stream.

Get started:
Try Gemini Live in Google AI Studio and grab your API key to start building.

Resources:
✅Live API documentation → https://goo.gle/4rObPZV
✅GitHub examples → https://goo.gle/4c4sZhi
✅Blog post → https://goo.gle/4m1KoLa

What are you building with Gemini Live? Drop it in the comments.

Subscribe to Google for Developers → https://goo.gle/developers

Speaker: Thor Schaeff
Products Mentioned:

Category: Project
Tags: Google, developers, pr_pr: AI DevRel (fka Core ML);

Be the first to comment

Sign in

Create your account

Add Video

Building Voice Agents with Gemini 3

Up Next