synthesizing voices from voice clips

Help for Audacity on Windows.
Forum rules
ImageThis forum is for Audacity on Windows.
Please state which version of Windows you are using,
and the exact three-section version number of Audacity from "Help menu > About Audacity".


Audacity 1.2.x and 1.3.x are obsolete and no longer supported. If you still have those versions, please upgrade at https://www.audacityteam.org/download/.
The old forums for those versions are now closed, but you can still read the archives of the 1.2.x and 1.3.x forums.
Post Reply
Flamesofshadow
Posts: 3
Joined: Sat Jun 12, 2021 10:58 pm
Operating System: Windows 10

synthesizing voices from voice clips

Post by Flamesofshadow » Sat Jun 12, 2021 11:56 pm

Windows 10 Audacity version 3.0.2

This should be the correct section for this question, but if it Isn't I can move it elsewhere.

I wish to know if it is possible with or without plugins to take various Voice samples of a character (or multiple characters) and use Audacity to synthesize full voices that I could use to make a character say what I want with correct inflection and Phonetics when writing dialogs.

as an example of what I wish to do, I have multiple voice clips of two video game characters saying various things, I want to take all the clips to be analyzed/learned and be able to have voices be created so that I can write dialog and have the characters interact realistically without the painstaking process of going through every individual vocal clip and trying to form every word/expression by hand such as this very generic conversation:

"Hello George, are things going well?"
"Hey Charles, yea everything is going well, though I'm a bit tired lately."

Side note: If such a thing is possible, though unlikely, I also wonder if it is possible to somehow write the TTS/dialog in audacity in order to test this and perhaps have a setting to alter emotion/inflection so that if I write that a character suddenly sounds angry from being neutral or happy its still possible by simply changing the emotion in the "voice instead of needing a whole new voice per emotion

Trebor
Posts: 9953
Joined: Sat Dec 27, 2008 5:22 pm
Operating System: Windows 8 or 8.1

Re: synthesizing voices from voice clips

Post by Trebor » Sun Jun 13, 2021 9:20 am

Flamesofshadow wrote:
Sat Jun 12, 2021 11:56 pm
I wish to know if it is possible ... to take various Voice samples of a character (or multiple characters) ... to synthesize full voices that I could use to make a character say what I want...
LyreBird ... https://youtu.be/VnFC-s2nOtI

Flamesofshadow
Posts: 3
Joined: Sat Jun 12, 2021 10:58 pm
Operating System: Windows 10

Re: synthesizing voices from voice clips

Post by Flamesofshadow » Sun Jun 13, 2021 5:10 pm

I have tried lyrebird before, the issue is as far as i could find is that it doesnt work with audio clips, it has to hear the voice for up to a minute which many of the sound clips would not reach that length of time

steve
Site Admin
Posts: 81609
Joined: Sat Dec 01, 2007 11:43 am
Operating System: Linux *buntu

Re: synthesizing voices from voice clips

Post by steve » Sun Jun 13, 2021 6:18 pm

Flamesofshadow wrote:
Sun Jun 13, 2021 5:10 pm
I have tried lyrebird before, the issue is as far as i could find is that it doesnt work with audio clips, it has to hear the voice for up to a minute which many of the sound clips would not reach that length of time
Come back and ask again in a couple of years time. Technology may have moved on by then.
9/10 questions are answered in the FREQUENTLY ASKED QUESTIONS (FAQ)

Flamesofshadow
Posts: 3
Joined: Sat Jun 12, 2021 10:58 pm
Operating System: Windows 10

Re: synthesizing voices from voice clips

Post by Flamesofshadow » Sun Jun 13, 2021 8:58 pm

the main issue with lyrebird is that it requires at least a minute of constant speech, plus you have to say what's in their training prompt for the voice to work, but if you have 100+ 3-5 second voice clips of a character saying "Hello" or "alright" you would have to use audacity to individually splice together a bunch of the audio just to make a voice that could work with lyrebird and then project it back into a microphone (as far as I can tell it cant take already done audio).

if it didnt take so long to splice the audio together i wouldnt need lyrebird in the first place, thats why i figured someone may have come up with some sort of workflow or plugins for audacity that could help with splicing the audio into reasonably cohesive speech, or at least take the various phonetics from whatever audio i run through it and save that as a file so its easier to splice to then be used in lyrebird

Post Reply