What's the plan to tackle the horde of incoming AI bots?
I'm sure this is a common topic but the timeline is pretty fast these days.
With bots looking more human than ever i'm wondering what's going to happen once everyone start using them to spam the platform. Lemmy with it's simple username/text layout seem to offer the perfect ground for bots, to verify if someone is real is going to take scrolling through all his comments and read them accurately one by one.
The architecture of lemmy means that the API is completely open. Bots do not need to even scrape or do anything with the website, they can do everything through the API. it doesn't matter how simple the layout is. Lemmy is open source as well and the API is fully documented and available to the public.
Lemmy devs are going to need to do some additional work to differentiate bot/human accounts. In the meantime its going to be on the admins to identify and remove/ban these accounts.
Yes, but last I checked, it was documented very poorly. It's just bad enough that I'm not super motivated to build helpful tools, but not bad enough to discourage trolls.
Somewhat of a loaded question but, if we need to scroll through their comment history meticulously to separate real from bot, does it really matter at that point?
SPAM is SPAM and we’re all in agreement that we don’t want bots junking up the communities with low effort content. However if they reach the point that it takes real effort to ferret them out they must be successfully driving some sort of engagement.
I think we'll be in bad shape when you can't trust any opinions about products, media, politics, etc. Sure, shills currently exists, so everything you read already needs skepticism. But at some point bots will be able to flood very high quality posts. But these will of course be lies to push a product or ideology. The truth will be noise.
I do think this is inevitable, and the only real guard would be to move back to smaller social circles.
I’m of the mind that the truth already is noise and has been for a long, long time. AI isn’t introducing anything new, it’s just enabling faster creation of agenda-driven content. Most people already can’t identify the AI generated content that’s been spewing forth in years past. Most people aren’t looking for quality content, they looking for bias-affirming content. The overall quality is irrelevant.
Things like chatGPT are not designed to think using object relations like a human. Its designed to respond the way a human would, (a speach quartex with no brain), it is made to figure out what a human would respond with rather than give a well thoght out answer.
One of the cool things to me about Lemmy is it is like email where people have their own custom domain names. Personally I think people using their real identity should come back into fashion and post 9/11/2001 USA culture of terrorism fear-ism should not be the dominating media emotion in 2023.
"Real humans, not bots" for the ongoing Social Media reboot of Twitter since September 2022 and Reddit since May 2023 could really leverage it. The "throwaway" culture of Reddit.
ChatGPT GPT-4 is incredibly good at convincing human beings it gives factual information when it really is great at "sounding good, but being factually wrong". It's amazing to me how many people have embraced and even shown deep love towards the machines. It's pretty weird to me that a computer fed facts spits out anti-facts. Back in March I would doing a lot of research on ChatGPT's fabrication of facts, it made wild claims like Bill Gates traveled to New Mexico when BASIC was first created. It would even give pages from Bill Gate's book that did not have the quotes it provided. https://www.AuthoredByComputer.com/ has examples I documented.
That's an interesting theory of why it works that way. Personally, I think rights usage, as in copyright, is a huge problem for OpenAI and Microsoft (Bing)... and they are trying to avoid paying money for the training material they use. And if they accurately quoted source material, they would run into expensive costs they are trying to avoid.
The horde aspect might make it easier. The ones on twitter at least you can tell are just running the same script through a thesaurus basically. 20 people leaving the same comment is a little more obvious than just one
That's why they're talking about the next generation.
With AI you can easily generate 100 different ways to say the same thing. And it's hard to distinguish a bot that's parroting someone else from a person who's repeating something they heard.