The capabilities of multimodal AI | Gemini Demo
Published on Wed, Dec 6th 2023 Science & Technology Rectangular HD
Our natively multimodal AI model Gemini is capable of reasoning across text, images, audio, video and code. Here are favorite moments with Gemini Learn more and try the model: https://deepmind.google/gemini
Explore Gemini: https://goo.gle/how-its-made-gemini
For the purposes of this demo, latency has been reduced and Gemini outputs have been shortened for brevity.
Subscribe to our Channel: https://www.youtube.com/google
Tweet with us on X: https://twitter.com/google
Follow us on Instagram: https://www.instagram.com/google
Join us on Facebook: https://www.facebook.com/Google
0:00 Intro
0:19 Multimodal Dialogue
1:32 Multilinguality
2:04 Game Creation
2:31 Visual Puzzles
3:17 Making Connections
3:39 Image & Text Generation
4:06 Logic & Spatial Reasoning
4:55 Translating Visuals
5:27 Cultural Understanding
General statistics
- Total
-
2,995,412
- Total
-
xxx.xx
- Total
-
xxx.xx
- Total
-
00:06:23
Metadata
Topics
Videolists
No videolist for this video.
Content
No format for this video.
Public statistics
Private statistics
Watch time
This widget is only available to channel's owner.
Sign inSubscribers gained
This widget is only available to channel's owner.
Sign inShares
This widget is only available to channel's owner.
Sign inEstimated demography
This widget is only available to channel's owner.
Sign inEstimated audience
This widget is only available to channel's owner.
Sign in