Skip Navigation

Meta torrented over 81.7TB of pirated books to train AI, authors say

They also didn't seed,

Supposedly, Meta tried to conceal the seeding by not using Facebook servers while downloading the dataset to "avoid" the "risk" of anyone "tracing back the seeder/downloader" from Facebook servers, an internal message from Meta researcher Frank Zhang said, while describing the work as in "stealth mode." Meta also allegedly modified settings "so that the smallest amount of seeding possible could occur," a Meta executive in charge of project management, Michael Clark, said in a deposition.

28 comments
  • Corporation pirates millions of books to train AI: No charge

    Bourgeois individual commits billions of dollars in fraud: 40 months in country club prison

    Homeless man steals $100 and gives it back: 15 years in general population prison

    Any questions?

  • much less that Plaintiffs’ books were somehow distributed by Meta.

    While I guess that Meta may have used settings to be leech only. Unless they show that they did that (which is of course poor practice if torrenting), the nature of torrenting by default means that even one piece of a file was seeded to another user is "distribution."

28 comments