I’m wondering about the following:
Suppose I have a recording of a single note (e.g. sung by a human).
Over the course of the single note there’s a (very irregular) change in
perceived volume (obvious from the signal when looked at from a suitable zoom factor).
I would like to even out the perceived volume (without “changing the spectrum”, i.e. keeping the same overall sound).
I expect something as follows should be possible, but I have no idea how to go about it:
- for a given selection of audio: detect
1.a. an envelope describing the change in volume over time (using some low-pass filter?)
1.b. an average perceived volume of the signal (average of absolute value?) - use the information in 1.a. and 1.b. to amplify the audio so that the perceived amplitude variations are removed
(at every moment in time the perceived volume should be the average perceived volume of the original signal).
Manually using the envelope editor is not really an option in this case. I think an automatic solution should be possible.
Thanks for any insights you may have!