Skip Navigation

'Meta Torrented over 81 TB of Data Through Anna's Archive, Despite Few Seeders'

torrentfreak.com

'Meta Torrented over 81 TB of Data Through Anna's Archive, Despite Few Seeders' * TorrentFreak

33 comments
  • This doesn’t mean that Meta denies using shadow libraries, its argument is that using such data to train its LLM models constitutes fair use under U.S. copyright law.

    Oh wow, I'm very much looking forward to this argument... "We believe pirating the copyrighted commercial works of others en masse to develop our own commercial product constitutes fair use... China bad!"

  • I don't usually like Meta, but here they used that data to produce open weights models available to the public. That sort of thing is what piracy is for so I support it.

33 comments