Technology @beehaw.org Jeena @jemmy.jeena.net 12 mo. ago

Hands-on with Gemini: Interacting with multimodal AI

cross-posted from: https://jemmy.jeena.net/post/413507

Gemini is Google's natively multimodal AI model capable of reasoning across text, images, audio, video and code. This video highlights some of their favorite interactions with Gemini. Learn more and try the model: https://deepmind.google/gemini

The AI Community On Kbin @kbin.social DarkGamer @kbin.social 12 mo. ago

Technology @lemmy.world Jeena @jemmy.jeena.net 12 mo. ago

Hands-on with Gemini: Interacting with multimodal AI

1 comments

Pretty cool tech demo.

The commonality of ducks was a pretty poor information sharing however.

Wonder how well this performs in a real world setting, and being Google how long until they cancel this too.