*Q7AT8<Z)P-3P(7G?';*F40E#.=FAP
SYSTEM PROCESSING...
*Q7AT8<Z)P-3P(7G?';*F40E#.=FAP
SYSTEM PROCESSING...
Posted: 2025-04-13 17:49:03 UTC

This article contains some claims that remain unverified. While much of the content may be accurate, exercise care when relying on this information.
This article contains some claims that remain unverified. While much of the content may be accurate, exercise care when relying on this information.
Status
Last Updated
2025-04-13 17:52:19 UTC
Verified By
Rollup News
The author clarifies their previous tweet regarding the NYT lawsuit against companies using copyrighted content to train LLMs. They emphasize that regurgitating copyrighted content at scale without permission is unacceptable. They also discuss the role of RAG (Retrieval-Augmented Generation) in the examples of copyright violations presented in the lawsuit and question the extent of harm caused to the NYT by LLMs regurgitating their text.
The importance of obtaining permission or having a fair-use rationale when using copyrighted content to train LLMs.
The potential role of RAG in copyright violations involving LLMs.
The limited harm caused to the NYT by LLMs regurgitating their text due to the rarity of such occurrences and the closure of loopholes in newer versions of ChatGPT.
Companies regurgitating copyrighted content at scale without permission.
LLMs potentially using RAG to bypass paywalls and regurgitate copyrighted content.
Determining the actual extent of harm caused to copyright holders by LLMs.