The AI Community On Kbin @kbin.social DarkGamer @kbin.social 12 mo. ago

Hands-on with Gemini: Interacting with multimodal AI

Gemini is our natively multimodal AI model capable of reasoning across text, images, audio, video and code. This video highlights some of our favorite intera...

Technology @beehaw.org Jeena @jemmy.jeena.net 12 mo. ago

Technology @lemmy.world Jeena @jemmy.jeena.net 12 mo. ago

Hands-on with Gemini: Interacting with multimodal AI

1 comments

This is an impressive demo, it seems like just making training multimodal models approaches the appearance of symbolic understanding. It's a surprise how effective this can be.

Edit: evidently it was faked https://techcrunch.com/2023/12/07/googles-best-gemini-demo-was-faked/