That's because this isn't something coming from the AI itself. All the people blaming the AI or calling this a "hallucination" are misunderstanding the cause of the glue pizza thing.
The search result included a web page that suggested using glue. The AI was then told "write a summary of this search result", which it then correctly did.
Gemini operating on its own doesn't have that search result to go on, so no mention of glue.
Not quite, it is an intelligent summary. More advanced models would realize that is bad advice and not give it. However for search results, google uses a lightweight, dumber model (flash) which does not realize this.
I tested with rock example, albiet on a different search engine (kagi). The base model gave the same answer as google (ironically based on articles about google's bad results, it seems it was too dumb to realize that the quotations in the articles were examples of bad results, not actual facts), but the more advanced model understood and explained how the bad advice had been spreading around and you should not follow it.
It isn't a hallucination though, you're right about that
You just haven’t gaslighted your ai into saying the glue thing. If you keep trying by saying things like “what about non-toxic glue” or “aren’t there glues designed for humans” the ai will finally give in and recommend the glue. Don’t give up. Glue is good for us.
Main issue is Gemini traditionally uses it's training data and the version answering your search is summarising search results, which can vary in quality and since it's just a predictive text tree it can't really fact check.
Yeah when you use Gemini, it seems like sometimes it’ll just answer based on its training, and sometimes it’ll cite some source after a search, but it seems like you can’t control that. It’s not like Bing that will always summarize and link where it got that information from.
I also think Gemini probably uses some sort of knowledge graph under the hoods, because it has some very up to date information sometimes.
You can't just "update" models to not say a certain thing with pinpoint accuracy like that. Which one of the reasons why it's so challenging to make AI not misbehave.
Not exactly. The answers would be exactly the same given the exact same inputs if they didn't intentionally and purposefully inject some random jitter into the algorithm each time specifically to avoid getting the same answer each time
It’s not just random jitter, it also likely adds context, including the device you’re using, other recent queries, and your relative location (like what state you’re in).
I don’t work for Google, but I am somewhat close to a major AI product, and it’s pretty much the industry standard to give some contextual info to the model in addition to your query. It’s also generally not “one model”, but a set of models run in sequence— with the LLM (think chatGPT) only employed at the end to generate a paragraph from a conclusion and evidence found by a previous model.
I'm almost sure that they use the same model for Gemini and for the A"I" answers, so patching the "put glue on pizza" answer for one also patches it for another.
Nope it’s because on Search it was summarizing the first results, the “pure Gemini” isn’t doing a search at that time, it’s just answering based on what it knows.
Y'all losing your mind intentionally misunderstanding what happened with the glue. Y'all are becoming anti ai lemons just looking for rage bait.
The AI doesn't need to be perfect. Just better than the average person. That why the shitty Tesla said driving has such good accident rates despite the fuck ups everyone loves to rage about in the news cycle.