Hands-on with Gemini: Interacting with multimodal AI
Gemini is our natively multimodal AI model capable of reasoning across text, images, audio, video and code. This video highlights some of our favorite intera...
This is an impressive demo, it seems like just making training multimodal models approaches the appearance of symbolic understanding. It's a surprise how effective this can be.