Meta Platforms Inc. is facing a copyright infringement lawsuit from author Christopher Farnsworth, who alleges that the company’s large language model was trained on pirated books. The complaint, filed in the U.S. District Court for the Northern District of California, claims that Meta utilized an 800 GB open-source dataset known as The Pile, which includes a database of pirated books called Books3. This dataset has been central to other lawsuits against tech firms accused of using nearly 200,000 copied books. Farnsworth’s suit highlights ongoing concerns regarding the legality of artificial intelligence training practices, stating that “The trove of nearly 200,000 copied books” raises significant legal questions.
Leave a Reply