Hands-on with Gemini: Interacting with multimodal AI
cross-posted from: https://jemmy.jeena.net/post/413507
Gemini is Google's natively multimodal AI model capable of reasoning across text, images, audio, video and code. This video highlights some of their favorite interactions with Gemini. Learn more and try the model: https://deepmind.google/gemini
Pretty cool tech demo.
The commonality of ducks was a pretty poor information sharing however.
Wonder how well this performs in a real world setting, and being Google how long until they cancel this too.