I am confused, is this or ain't this part of the Audacity distribution? I still can't find it in the latest svn repository.
I meant the CMU libraries documentation that includes naturally the libs of Standalone Nyquist.
Does this mean, you are using a gain function with two segments and a hard knee, just like the built in compressor, but the threshold and slope vary adaptively to regions of input?
The Rms vector is iteratively searched for the longest segments that ar above the threshold (e.g. -20 dB) and this section will be attenuated by 1 dB. It then begins all anew.
In this fashion, whole phrases are corrected, followed by words and syllables.
The slope is at the moment just one frame/block length (linear transition).
Have you shared the code for these experiments?
No, it is still under development.
The code is relatively slow because the input is alternatively analysed and multiplied by the correction envelope.
Of course, one could work with the initial Rms vector alone.
For the time being, I want to explore different block lengths, for example vary them by the golden ratio, in order to have the frame boundaries optimally distributed.
Also, since start and ending have a short taper, the Rms value of the analysed audio can be different from the calculated one (0.5 dB cut at those points).
The nice thing of this random access method is that you can make an inverted copy and run the effect on it. All will be silent, except the corrected expressions.
It also works with peaks although they are generally very short.
Robert J. H. wrote:In my opinion, the greatest improvement would be to let the plug-in find the optimal settings by itself e.g. by the means of Hidden Markoff Models.
Of course, I'm always refering to single-voice compression and not music. Because only the former offers the chance to employ a general energy distribution, i.e. a modelled histogram.
The user would thus only set the target values, e.g. -3 dB peak, -20 dB Rms, -60 dB noise floor and a general compression ratio or a style preset.
This would make a lot of narrators happy. It would take the guesswork out of hitting ACX guidelines.
Yes, indeed.
My histograms show a relatively constant shape for different audio books. The trick will be to combine peaks and Rms in a meaningful manner.
It is also thinkable to use some filter curves as well, e.g. threshold of hearing for the noise floor calculation.
Lots of interesting thought experiments.