Let's say I have a waveform, called wavAZ, and multiple others which are smaller, i.e wavBD, wavHL, wavPT, wavWY and each of these smaller waves are somewhat similar to certain parts of wavAZ, what can I do to find out the timings where they occur along wavAZ? What are the existing tools/techniques that can help in this? Find timings tBDs tBDe, tHLs tHLe, tPTs tPTe, and tWYs tWYe.
Just trying to automatically generate timings for a bouncing ball karaoke lyrics thing.
See illustration below:
https://drive.google.com/file/d/0B158KX ... RNUG8/view
Automatically generate timings for a bouncing ball karaoke
Forum rules
If you require help using Audacity, please post on the forum board relevant to your operating system:
Windows
Mac OS X
GNU/Linux and Unix-like
If you require help using Audacity, please post on the forum board relevant to your operating system:
Windows
Mac OS X
GNU/Linux and Unix-like
-
goespro.tocall
- Posts: 6
- Joined: Wed Jul 27, 2016 9:36 am
- Operating System: Windows 10
-
kozikowski
- Forum Staff
- Posts: 68902
- Joined: Thu Aug 02, 2007 5:57 pm
- Operating System: macOS 10.13 High Sierra
Re: Automatically generate timings for a bouncing ball karao
I don't know any way to make Audacity do pattern or content recognition. It's harder than you think. Digital systems don't do "close." Unless the match is exact, a comparison system will fail. That means no noise, no room sounds, no digital errors and no competing music.
The editing people want us to do matching.
"Help me automatically find the trumpet sound so I don't have to go looking for it."
Not so far.
Koz
The editing people want us to do matching.
"Help me automatically find the trumpet sound so I don't have to go looking for it."
Not so far.
Koz
-
goespro.tocall
- Posts: 6
- Joined: Wed Jul 27, 2016 9:36 am
- Operating System: Windows 10
Re: Automatically generate timings for a bouncing ball karao
Awww shuck! I thought this would be possible given the present state of voice recognition technologies.
Can we tackle this from the wave analysis approach? Look for clusters of higher amplititudes and see if its a spoken syllubus?
Can we tackle this from the wave analysis approach? Look for clusters of higher amplititudes and see if its a spoken syllubus?
Re: Automatically generate timings for a bouncing ball karao
That's the problem. There's about one company that has "good" speech recognition, and a handful of other companies (including Microsoft) that are trying to catch up with them. There's not yet really good open source speech recognition, though this is improving. Speech recognition is very complicated for computers. Perhaps in 10 years time there will be a good open source speech recognition library.goespro.tocall wrote: Awww shuck! I thought this would be possible given the present state of voice recognition technologies.
Yes we can look for clusters of higher amplitude frequencies. You can see them as bright bits in the spectrogram view of a track (http://manual.audacityteam.org/man/spec ... _view.html). The difficult part is being able to tell if a particular cluster of frequencies is someone saying "oh" or "ah" or the sound of a car horn, or a crow.goespro.tocall wrote: Can we tackle this from the wave analysis approach? Look for clusters of higher amplititudes and see if its a spoken syllubus?
Someone on the forum made an experimental plug-in that would try to distinguish between vowel sounds and consonants. It mostly worked when used on a good quality voice recording. I'll post a link if I can find it.
9/10 questions are answered in the FREQUENTLY ASKED QUESTIONS (FAQ)
Re: Automatically generate timings for a bouncing ball karao
Here: http://forum.audacityteam.org/viewtopic ... 72#p210572steve wrote:Someone on the forum made an experimental plug-in that would try to distinguish between vowel sounds and consonants. It mostly worked when used on a good quality voice recording. I'll post a link if I can find it.
9/10 questions are answered in the FREQUENTLY ASKED QUESTIONS (FAQ)