this post was submitted on 26 Jul 2023
291 points (94.2% liked)
Technology
59107 readers
3248 users here now
This is a most excellent place for technology news and articles.
Our Rules
- Follow the lemmy.world rules.
- Only tech related content.
- Be excellent to each another!
- Mod approved content bots can post up to 10 articles per day.
- Threads asking for personal tech support may be deleted.
- Politics threads may be removed.
- No memes allowed as posts, OK to post as comments.
- Only approved bots from the list below, to ask if your bot can be added please contact us.
- Check for duplicates before posting, duplicates may be removed
Approved Bots
founded 1 year ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
Well, if OpenAI knowingly used pirated work, that's one thing. It seems pretty unlikely and certainly hasn't been proven anywhere.
Of course, they could have done so unknowingly. For example, if John C Pirate published the transcripts of every movie since 1980 on his website, and OpenAI merely crawled his website (in the same way Google does), it's hard to make the case that they're really at fault any more than Google would be.
well no, because the summary is its own copyrighted work
Right, but not one the author of the book could go after. The article publisher would have the closest rights to a claim. But if I read the crib notes and a few reviews of a movie... Then go to summarize the movie myself... That's derivative content and is protected under copyright.
The published summary is open to fair use by web crawlers. That was settled in Perfect 10 v Amazon.
Haven't people asked it to reproduce specific chapters or pages of specific books and it's gotten it right?
I haven't been able to reproduce that, and at least so far, I haven't seen any very compelling screenshots of it that actually match. Usually it just generates text, but that text doesn't actually match.