thewayne: (Default)
[personal profile] thewayne
This is laughable insanity.

Nvidia is defending itself in a lawsuit from a bunch of authors that their works were used - without license or any form of authorization - to train up Nvidia's LLM platform. Apparently Nvidia got the data from scraping pirate ebook web sites....

Quoth the article: "Nvidia seemed to defend the shadow libraries as a valid source of information online when responding to a lawsuit from book authors over the list of data repositories that were scraped to create the Books3 dataset used to train Nvidia's AI platform NeMo.

That list includes some of the most "notorious" shadow libraries—Bibliotik, Z-Library (Z-Lib), Libgen, Sci-Hub, and Anna's Archive, authors argued. However, Nvidia hopes to invalidate authors' copyright claims partly by denying that any of these controversial websites should even be considered shadow libraries."


Copyright infringement is a pretty simple standard, which these sites clearly violate. Now, some may contain books that are out of copyright, or completely unavailable, but a bulk of their content is illegal under U.S. law. That is pretty clear. Quibbling over the definition of the term 'shadow library' is a complete waste of the court's time and isn't going to win them any points with the judge.

This is not going to work.

https://arstechnica.com/tech-policy/2024/05/nvidia-denies-pirate-e-book-sites-are-shadow-libraries-to-shut-down-lawsuit/

Date: 2024-05-29 12:11 am (UTC)
disneydream06: (Disney Surprised)
From: [personal profile] disneydream06
I hope nothing stupid happens and those authors prevail. :)
Hugs, Jon

Date: 2024-05-29 02:18 am (UTC)
kathmandu: Close-up of pussywillow catkins. (Default)
From: [personal profile] kathmandu
Nvidia may also find their product isn't marketable, and is blatantly displaying their guilt. There was already one instance of an LLM trained on AO3, that started regurgitating porn of very specific kinds, very clearly traceable.

Date: 2024-05-29 06:28 pm (UTC)
warriorsavant: (Books (Trinity College Library))
From: [personal profile] warriorsavant

Frankly, I would consider the whole issue of LLM training on copyrighted material to be a clear case of infringement, but even more than most of law, copyright law depends on who has the better lawyers and deeper pockets.

BTW, just realized I've been writing "copywrite" for years, even though I know it is "the right to copy" not the "copy that is written."

Date: 2024-05-31 10:15 pm (UTC)
silveradept: A kodama with a trombone. The trombone is playing music, even though it is held in a rest position (Default)
From: [personal profile] silveradept
I would not be surprised to see nVidia decide that the best way to defend their copyright infringement is to pass off the blame onto the sites themselves that did the initial infringement. They, after all, were merely scraping content that was made freely available online. Surely the blame lies with the people who shouldn't have been posting copyrighted content online?

(This will not work, but I won't be surprised if something like that is their defense.)

May 2025

S M T W T F S
    1 23
45678910
1112 131415 1617
18 19 20 212223 24
25262728 2930 31

Most Popular Tags

Style Credit

Expand Cut Tags

No cut tags
Page generated Jun. 7th, 2025 01:07 pm
Powered by Dreamwidth Studios