Movies and TV Shows

119 readers

2 users here now

General discussion about movies and TV shows.

Spoilers are strictly forbidden in post titles.

Posts soliciting spoilers (endings, plot elements, twists, etc.) should contain [spoilers] in their title. Comments in these posts do not need to be hidden in spoiler MarkDown if they pertain to the title's subject matter.

Otherwise, spoilers but must be contained in MarkDown as follows:

::: your spoiler warning
the crazy movie ending that no one saw coming!
:::

Your mods are here to help if you need any clarification!

Subcommunities: The Bear (FX) - [[email protected]](/c/thebear @lemmy.film)

Related communities: [email protected] [email protected]

founded 1 year ago

MODERATORS

[email protected]

158

Striking actor Stephen Fry says his voice was stolen from the Harry Potter audiobooks and replicated by AI (fortune.com)

submitted 1 year ago by [email protected] to c/[email protected]

12 comments fedilink hide all child comments

you are viewing a single comment's thread
view the rest of the comments

[–] [email protected] 9 points 1 year ago* (last edited 1 year ago) (4 children)

Modern models need about 5sec of audio to replicate a voice. The days when you needed a large amount of audio for replication are long gone. Same for faces by the way, the original Deepfake needed hundred of images and hours of training, now you can do it with as little as a single good image instantly. Software to automatically clone the voice, translate the audio into another language and adjust the lip motion exists as well, again without any lengthy training or material, just needs the clip you want to change.

Where the whole thing gets interesting is in remixing. If you stolen Stephen Fry, sure that's bad and there might be laws against it. What if you remix Stephen Fry and Patrick Steward into a brand new AI-persona? What if you remix a Stephen Fry sound-alike out of other peoples voices without ever touching his voice?

This whole issue gets very blurry very fast.

[–] [email protected] 2 points 1 year ago (1 children)

Do you have a reference for the 5secs of audio? And what would the quality of the ai be? I mean it would have to infer a lot of training just given 5 seconds of audio.

[–] [email protected] 5 points 1 year ago (1 children)

[–] [email protected] 2 points 1 year ago

Here is an alternative Piped link(s):

Translation with HeyGen

Piped is a privacy-respecting open-source alternative frontend to YouTube.

I'm open-source, check me out at GitHub.

load more comments (2 replies)