Skip Navigation

Hands-on with Gemini: Interacting with multimodal AI

cross-posted from: https://jemmy.jeena.net/post/413507

Gemini is Google's natively multimodal AI model capable of reasoning across text, images, audio, video and code. This video highlights some of their favorite interactions with Gemini. Learn more and try the model: https://deepmind.google/gemini

1
1 comments
  • Pretty cool tech demo.

    The commonality of ducks was a pretty poor information sharing however.

    Wonder how well this performs in a real world setting, and being Google how long until they cancel this too.