Science @beehaw.org Gaywallet (they/it) @beehaw.org 4 mo. ago

A peer reviewed journal with nonsense AI images was just published

twitter.com /kareem_carr/status/1758148011245371627

You're viewing a single thread.

20 comments

This feels like clickbait to me, as the fundamental problem clearly isn't AI. At least to me it isn't. The title would have worked as well without AI in the title. The fact that the images are AI generated isn't even that relevant. What is worrying is that the peer review process, at least for this journal clearly is faulty as no actual review of the material took place.

If we do want to talk about AI. I am impressed how well the model managed to actually create text made up of actual letters resembling words. From what I have seen so far that is often just as difficult for these models as hands are.
- Simplifying this down to an issue of just the review process flattens out the problem that generative AI does not think in the same way that generative human content does. There's additional considerations that need to be made when considering using generative AI, namely that generative AI does not have a sum of knowledge to pull from in order to keep certain ideas in check, such as how large an object should appear and it doesn't have the ability to fact check relevancy with other objects within the image.
  
  We need to think about these issues in depth because we are introducing a non-human, specific kind of bias into literature. If we don't think about it systematically we can't create a process which intends to limit or reduce the amount of bias introduced by allowing this kind of content. Yes, the review process can and should already catch a lot of this, but I'm not convinced that waving our hands and saying that review is enough is adequate to fully address the biases we may be introducing.
  
  I think there's a much higher chance of introducing bias or false information in highly specialized fields where the knowledge necessary to determine if something was generated incorrectly, since generative AI does not draw upon facts or fact check, is in fact, correct. Reviewers are not perfect, and may miss things. If we then draw upon this knowledge in the future to direct additional studies we might create a house of cards which becomes very difficult to undo. We already have countless examples of this in science where a study with falsified data or poor methodology breeds a whole field of research which struggles to validate the original studies and eventually needs to be retracted. We could potentially have situations in which the study is validated but an image influences how we even think (or can acquire funding for) a process should work. Having strong protections such as requiring that AI images be clearly notated that they were created via AI, can help to mitigate these kinds of issues.
  
  I totally see why you are worried about all the aspects AI introduces, especially regarding bias and the authenticity of generated content. My main gripe, though, is with the oversight (or lack thereof) in the peer review process. If a journal can't even spot AI-generated images, it raises red flags about the entire paper's credibility, regardless of the content's origin. It's not about AI per se. It is about ensuring the integrity of scholarly work. Because realistically speaking, how much of the paper itself is actually good or valid? Even more interesting, and this would bring AI back in the picture. Is the entire paper even written by a human or is the entire thing fake? Or maybe that is also not interesting at all as there are already tons of papers published with other fake data in it. People that actually don't give a shit about the academic process and just care about their names published somewhere likely already have employed other methods as well. I wouldn't be surprised if there is a paper out there with equally bogus images created by an actual human for pennies on Fiverr.
  
  The crux of the matter is the robustness of the review process, which should safeguard against any form of dubious content, AI-generated or otherwise. Which is what I also said in my initial reply, I am most certainly not waving hands and saying that review is enough. I am saying that it is much more likely the review process has already failed miserably and most likely has been for a while.
  
  Which, again to me, seems like the bigger issue.
  
  My main gripe, though, is with the oversight (or lack thereof) in the peer review process. If a journal can’t even spot AI-generated images, it raises red flags about the entire paper’s credibility, regardless of the content’s origin.
  
  The crux of the matter is the robustness of the review process
  
  The pace at which AI can generate bullshit not only currently vastly outstrips the ability for individual humans to vet it, but is actually accelerating. We cannot manually solve this by saying "people just need to catch it." Look at YT with CSAM or other federal violations - they literally can't keep up with the content coming in despite having armies of people (with insane turnover I might add) trying to do it. So the bar has been changed from "you can't have any of this stuff" to "you must put in reasonable effort to minimize it," because we've simply accepted it can't be done with humans - and that's with the assistance of their current algorithms constantly scouring their content for red flags. Bear in mind this is an international, massive company with resources these journals can't even dream of and almost all this content has been generated and uploaded by individual people.
  
  These people I'm sure are perfectly capable of catching AI generated nonsense most of the time. But as the content gets more sophisticated and voluminous, the problem is only going to get worse. Stuff is going to get through. So we are at a crossroads where we throw up our hands and say "well there's not much we can do, good luck separating the wheat from the chaff," or we get creative. And this isn't just in academic journals either. This is crossing into more and more industries, in particular if it requires writing. Someone(s) is throwing money and resources at getting AI to do it faster and cheaper than people can.
  
  I feel like two different problems are conflated into one though.
  
  The academic review process is broken.
  
  AI generated bullshit is going to cause all sorts of issues.
  
  Point two can contribute to point 1 but for that a bunch of stuff needs to happen. Correct my if I am wrong but as far as my understanding of peer-review processes are supposed to go it is something along the lines of:
  
  A researcher submits their manuscript to a journal.
  
  An editor of that journal validates the paper fits within the scope and aims of the journal. It might get rejected here or it gets send out for review.
  
  When it does get send out for review to several experts in the field, the actual peer reviewers. These are supposed to be knowledgeable about the specific topic the paper is about. These then read the paper closely and evaluate things like methodology, results, (lack of) data, and conclusions.
  
  Feedback goes to the editor, who then makes a call about the paper. It either gets accepted, revisions are required or it gets rejected.
  
  If at point 3 people don't do the things I highlighted in bold then to me it seems like it is a bit silly to make this about AI. If at point 4 the editor ignores most feedback for the peer reviewers, then it again has very little to do with AI and everything the a base process being broken.
  
  To summarize, yes AI is going to fuck up a lot of information, it already has. But by just shouting, “AI is at it again with its antics!” at every turn instead of looking further and at other core issues we will only make things worse.
  
  Edit:
  
  To be clear, I am not even saying that peer reviewers or editors should "just do their job already". But fake papers have been increasingly an issue for well over a decade as far as I am aware. The way the current peer review process works simply doesn't seem to scale to where we are today. And yes, AI is not going to help with that, but it is still building upon something that already was broken before AI was used to abuse it.
  
  But by just shouting, “AI is at it again with its antics!” at every turn instead of looking further and at other core issues we will only make things worse.
  
  I think this is a very unfair characterization of what I and others have voiced. This has always been a fundamental issue when talking to AI-evangelists, which you may not be but your argument seems to fall in line with. There is this inherently defensive posture I find whenever a critique is levied of AI, yet if I were so protective of something like the internal combustion engine, people would (rightfully) raise eyebrows.
  
  I agree that AI is a tool and often it is just widening cracks that exist, but we need to deal with these issues on multiple fronts and acknowledge that reckless adoption exacerbates the issue. And the new front that AI has opened up is scale. The ability for even someone with a modest, home-rolled LLM to just flood the internet with a bunch of crappy blog spam is outrageous and wasn't even a possible 5 years ago. One person can do the damage of a thousand. Run a cursory google search and see what SEO + AI blog spam has wrought.
  
  Characterizing it as "this was already an issue it's not AI’s fault" is overly reductionist at its core. It's passing the buck and saying that AI in no way, shape, or form, bears any responsibility for the problem. That just means we aren't looking critically at what is a ultimately a tool and how it can be used for harm.
  
  But fake papers have been increasingly an issue for well over a decade as far as I am aware.
  
  Yes but these articles were not nearly as prolific. We are talking orders of magnitude more crap to sift through already occurring across many industries. It has never been this bad. Give the journals 1000 people and 100x the budget and eventually they will still be overcome. It's not just "fix the review process." It's a complicated issue that is exploited in multiple ways.
  
  I feel like this is the third time people are selective reading into what I have said.
  
  I specifically acknowledge that AI is already causing all sorts of issues. I am also saying that there is also another issue at play. One that might be exacerbated by the use of AI but at its root isn't caused by AI.
  
  In fact, in this very thread people have pointed out that *in this case" the journal in question is simply the issue. https://beehaw.org/comment/2416937
  
  In fact. The only people likely noticed is, ironically, the fact that AI was being used.
  
  And again I fully agree, AI is causing massive issues already and disturbing a lot of things in destructive ways. But, that doesn't mean all bullshit out there is caused by AI. Even if AI is tangible involved.
  
  If that still, in your view, somehow makes me sound like an defensive AI evangelist then I don't know what to tell you...
  
  Permanently Deleted
  
  The fact that you specifically respond to this one highly specific thing. While I clearly have written more is exactly what I mean.
  
  shrugs
  
  Permanently Deleted
  
  Would you like me to quote every single one of your lines, line by line, and respond to them?
  
  No, that's not really what I'm asking for. I'm also not looking for responses that isolate a single sentence from my longer messages and ignore the context. I'm not sure how to make my point any clearer than in my first reply to you, where I started with two bullet points. You seemed to focus on the second, but my main point was about the first. If we do want to talk about standard behavior in human conversation, generally speaking, people do acknowledge that they have heard/read something someone said even if they don't respond to it in detail.
  
  Again, I've been agreeing that AI is causing significant problems. But in the case of this specific tweet, the real issue is with a pay to publish journal where the peer review process is failing, not AI. This key point has mostly been ignored. Even if that was not the case, if you want to have any change of trying to combat the emergence of AI I think it is pretty reasonable to question if the basic processes in place are even functioning in the first place. Where my thesis (again, if this wasn't a pay to publish journal) would be that this is likely not the case as in that entire process clearly nobody looked closely at these images. And just to be extra clear, I am not saying that AI never will be an issue, etc. But if reviewing already isn't happening at a basic level how are you ever hoping to combat AI in the first place?
  
  When did anyone say
  
  But by just shouting, “AI is at it again with its antics!” at every turn instead of looking further and at other core issues we will only make things worse”
  
  The context of this tweet, saying "It’s finally happened. A peer-reviewed journal article with what appear to be nonsensical AI-generated images. This is dangerous.", does imply that. I've been responding with this in mind, which should be clear. It is this sort of thing I mean when I say selective reading when you seemingly take it as me saying that you personally said exactly that. Which is a take, but not one I'd say is reasonable if you take the whole context into account.
  
  And in that context, I've said:
  
  that doesn’t mean all bullshit out there is caused by AI
  
  Which I stand by. In this particular instances, in this particular context AI isn't the issue and somewhat clickbait. Which makes most of what you argued about valid concerns. Youtube struggling, SEO + AI blog spam, etc are all very valid and concerning of AI causing havoc. But in this context of me calling a particular tweet clickbait they are also very much less relevant. If you just wanted to discuss the impact of AI in general and step away from the context of this tweet, then you should have said so.
  
  Now, about misrepresenting arguments:
  
  If you are reaffirming somebody else’s comment, you are generally standing behind most if not all of what they said. But nobody here is saying or doing the things you are claiming. You are tilting at windmills.
  
  Have you looked back at your own previous comments when you wrote that? Because while have this, slightly bizarre, conversation I have gone back to mine a few times. Just to check if I actually did mess up somewhere or said things differently that I thought I did. The reason I am asking is that I have been thrown a few of these remarks from you where I could have responded with the above quote myelf. Things like "It’s passing the buck and saying that AI in no way, shape, or form, bears any responsibility for the problem."
  
  I am most certainly not waving hands and saying that review is enough
  
  Apologies, that's what it sounded like to me. You said it's clickbait. You said the title would work without AI in the title. You also said that AI generation isn't relevant. That felt like diminishing the conversation - focusing in on what you're most concerned about, and dismissing all other discussions. I don't think that helps discussion happen. It discourages it. It says that we shouldn't talk about the problems present here which exist outside the realm of just the review process.
  
  For example, both of the figures do have a description, but neither of them have any kind of attribution. The review process might ensure it is factual when it is followed and still let through material such as that you've laid out above which do not involve AI - like hiring someone off of fiverr. One way to solve this would be with image attribution. As I mentioned above, simply requiring that an image explain where it came from, such as requiring attribution to the artist who created the figure or requiring that the software used be attributed (perhaps even requiring the full prompt for generated images) are all methods through which we can ensure scientific rigor (and accurate attribution) which will both help ensure the review process catches problematic material and cues the readers in to key information about the figures present in research.
  
  I said clickbait about the AI specific thing. Which I do stand by. To be more direct, if peer reviewers don't review and editors don't edit you can have all the theoretical safeguards in place, but those will do jack shit. Procedures are meaningless if they are not being followed properly.
  
  Attributions can be faked, just like these images are now already being faked. If the peer review process is already under tremendous pressure to keep up for various reasons then adding more things to it might actually just make things worse.
  
  Rather the opposite: simplifying this down to an issue of just an AI introducing some BS, flattens out the problem that grifter journals don't follow a proper peer review process.
  
  introducing bias or false information in highly specialized fields
  
  Reviewers are not perfect, and may miss things
  
  It's called a "peer review" process for a reason. If there are not enough peers in a highly specialized field to conduct a proper review, then the article should stay on arxiv or some other preprint server until enough peers can be found.
  
  Journals that charge for "reviewing" BS, no matter if AI generated, or by a donkey with a brush tied to its tail, should be named and shamed.
  
  We already have countless examples of this in science where a study with falsified data or poor methodology breeds a whole field of research which struggles to validate the original studies and eventually needs to be retracted.
  
  ...and no AI was needed. Goes to show how AI is the red herring here.
- Modern AI image generators are pretty good with creating text (and hands). You're right that that's very recent though (like the last 6 months); they used to be bad at it.
  
  American classic car,1935 Ford Pickup, poster ,retro ,illustrator, vibrant colors,Idaho color view landscape, with words "Idaho"
  
  Oh huh, you are right. I threw that exact prompt in Dall-e and got indeed legible letters.

You've viewed 20 comments.