this post was submitted on 22 Aug 2023
755 points (95.6% liked)

Technology

59174 readers
2161 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each another!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed

Approved Bots


founded 1 year ago
MODERATORS
 

OpenAI now tries to hide that ChatGPT was trained on copyrighted books, including J.K. Rowling's Harry Potter series::A new research paper laid out ways in which AI developers should try and avoid showing LLMs have been trained on copyrighted material.

you are viewing a single comment's thread
view the rest of the comments
[–] [email protected] 18 points 1 year ago (2 children)

It is not a derivative it is transformative work. Just like human artists "synthesise" art they see around them and make new art, so do LLMs.

[–] [email protected] 2 points 1 year ago (1 children)

Transformative works are not a thing.

If you copy the copyrightable elements of another work, you have created a derivative work. That work needs to be transformative in order to be eligible for its own copyright, but being transformative alone is not enough to make it non-infringing.

There are four fair use factors. Transformativeness is only considered by one of them. That is not enough to make a fair use.

[–] [email protected] 1 points 1 year ago

Transformativeness is only considered by one of them. That is not enough to make a fair use.

Somebody better let YouTube content creators know that. /s

[–] [email protected] 2 points 1 year ago

LLMs don’t create anything new. They have limited access to what they can be based on, and all assumptions made by it are based on that data. They do not learn new things or present new ideas. Only ideas that have been already done and are present in their training.