Wednesday, November 26, 2025

Apple Intelligence Training Lawsuit

Mariella Moon:

Two authors have filed a lawsuit against Apple, accusing the company of infringing on their copyright by using their books to train its artificial intelligence model without their consent. The plaintiffs, Grady Hendrix and Jennifer Roberson, claimed that Apple used a dataset of pirated copyrighted books that include their works for AI training. They said in their complaint that Applebot, the company’s scraper, can “reach ‘shadow libraries’” made up of unlicensed copyrighted books, including (on information) their own. The lawsuit is currently seeking class action status, due to the sheer number of books and authors found in shadow libraries.

Malcolm Owen:

The suit hinges on whether Apple used the dataset referred to as “Books3.” The suit alleges that Books3 is based on the contents of a “shadow library” website known as Bibliotik, which allegedly hosted the contents of thousands of books.

The dataset was available on HuggingFace before being removed in October 2023, and it was also included as part of the RedPajama dataset. RedPajama was used as part of the OpenELM open-source models, which Apple made available in 2024.

Since Apple used a dataset that was connected to pirated books for OpenELM, the suit believes that Apple probably used the same techniques to train its Foundation Language Models.

[…]

In July, Apple doubled down on its claims of being ethical, including items accessible from the Internet. In a research paper, it explained that, if a publisher didn’t agree to data being scraped for training, it will not scrape the content.

Previously:

1 Comment RSS · Twitter · Mastodon


Jammie Thomas was a private citizen who had to pay over $9,000 dollars per song for pirating 24 songs

Apple is a trillion dollar public company who seems to have pirated hundreds or even thousands of *books*.

A book is generally worth significantly more than a single song. A company with Apple’s money and power should have to pay much more than an everyday person. And because Apple is so visible, letting them off easy would set a very bad example for everyone else

I think Apple should have to pay each author $9 million for each book they pirated. As a Trillion dollar company with record sales, this isn’t even that much. Just an amount that would be more than a slap on the wrist

Leave a Comment