Write For Us

Using Gemini Pro Vision for multimodal use cases with text, images, and videos

E-Commerce Solutions SEO Solutions Marketing Solutions
72 Views
Published
What are the applications of multimodality with Gemini? This session will cover a variety of different multimodal use cases for text, images, and video, and provide some ideas on how to apply multimodality to practical business scenarios. You'll also gain experience with Gemini Pro Vision.

To complete this workshop, you will need a laptop and a Google Cloud Project.

Walk through an interactive notebook with multimodal use cases with Gemini → https://goo.gle/4b98tbY
Learn about multimodal prompts in the Gemini documentation → https://goo.gle/4aNzaTV
Try out multimodal capabilities in Gemini Pro Vision to create a retail recommendation system → https://goo.gle/49PRc6I

NOTE: Cloud Credits discussed in this session or workshop were for live audiences only

Speakers: Lavi Nigam, Katie Nguyen

Watch more:
Check out all the AI videos at Google I/O 2024 → https://goo.gle/io24-ai-yt

Subscribe to Google Developers → https://goo.gle/developers

#GoogleIO

Products Mentioned: Gemini
Event: Google I/O 2024
Category
Project
Tags
Google, developers, pr_pr: Google I/O;
Sign in or sign up to post comments.
Be the first to comment