Updated De-Clicker and new De-esser for speech

Here are the latest improvements to the plug-in I am developing for filtering out of wet mouth noises and clicks from my narrations. I also present a de-esser using similar methods that fell out as a by-product.

De-Clicker is designed for filtering of natural but undesirable noises that take a few milliseconds to decay, not for repairing spikey damage to a digitized signal. I did not intend this for music or vinyl repair, but I would like to hear whether it has unexpected other uses.

The default settings in both tools are the ones I favor now for treatment of whole tracks. Do you have different opinions about good default settings? Let me know.

I successfully treat most of the more resistant clicks by hand with a single pass and double the number of bands: NOT by lowering the dB threshold control. But such settings may be too slow and too aggressive for batch treatment. Yet the nice thing is that you do not have to select the interval around the click very narrowly. The surrounding signal is usually unaffected. So select it and fix it and move on, don’t fuss with zooming in and out.

I treat typically no more than a half hour at once. Compute time does unfortunately go nonlinearly with length of the track to some extent. Not sure why. (I’ll blame garbage collection.) But I have improved performance much over the previous version.

As mentioned below, do low-frequency rolloff (highpass filtering) before De-Clicker for slight improvement of some results.

Detailed advice about settings:

  • In either tool, choose Apply Changes to hear just the results. Or, choose Isolate Changes to hear only what is subtracted from your signal. You don’t want to hear a lot of “murmur” so that you almost understand the words: your settings are taking overtones out of your vowels and muffling the voice. You may notice that the attack of many consonants is somewhat affected, but you might judge that this is acceptable and sometimes even helpful.
  • If you duplicate a track, then isolate changes in one, you can listen to the tracks together to hear the repaired sound (the isolated clicks interfere destructively), or mute one to hear just the original or just the subtractions. You might silence corrections you don’t want before mixing. I don’t do any of this once I find settings I trust.
  • Both tools specify a range of frequencies and a number of frequency bands (of equal width in log-frequency) for detection and repair. Impatient for results? Use fewer bands, or raise the bottom of the range. I find it valuable to repair the low frequencies too in speech (certain lip-closing thumps and rattles in the sibilants get removed) but the lower frequency bands do take more computation time. De-esser performance is not too bad with many high frequency bands: but very few bands may give poor results, affecting vowels.
  • Both tools specify a step size for precision in identifying the intervals to repair. You might trade precision for speed.
  • Both tools specify another time period for cross-fading of repairs around each clicky interval. Make it too short and you introduce undesirable artifacts. This setting should not affect computation time significantly.
  • The De-Esser has a dB threshold which is the amplitude within each frequency band, to which, the band is trimmed back. -20dB is my default setting, and -20dB and above is also what appears white in spectrogram views with default settings.
  • De-Clicker has a threshold in dB which is not absolute, but relative: a band must rise so much over a short interval for a click to be detected. I repeat, subtle clicks may be treated better not with a lower threshold, but with more numerous and narrow bands.
  • De-Clicker specifies a minimum separation between clicks. Make it too small, and more clicks will be identified, but there may be too much damage to voice when it drops in pitch. There is also a dB threshold (for the entire signal) below which closer clicks may be detected. This may help clean up crackles in de-voiced intervals and breaths. Use low-frequency rolloff effects first for better results!
  • De-Clicker can specify a number of times to repeat treatment, and a maximum click length in steps, which need no explanation.
  • De-Clicker can also just make labels around clicks without repairing them. This information may be useful, for instance, if you want to find dB threshold setting just low enough to detect certain of the clicks.
    DeEsser.ny (55.6 KB)
    DeClicker.ny (70.5 KB)
2 Likes

What does Ian (MichloIW) think about it? He’s the one with the Wet Mouth Noises.

I love that phrase. Keep it in.

Koz

Haha, well thank you for that. :stuck_out_tongue:

I actually ran it on the test piece I uploaded for you all yesterday and unless I’m hearing things, much to my astonishment, appreciation and gratitude, it got rid of them!

I really needed and wanted to get some recording on the e-book done tonight so I ran through a new foreword and five chapters before my throat felt like sandpaper. I shall be processing them tomorrow (today) so that will be the real test and I shall let you know. :slight_smile:

Cheers.

Paul,

I think you have cracked it and should be vaulted to the exalted ranks of the other geniuses kindly giving us this program! :slight_smile:

I’ve only worked on my Foreword from last night’s recordings so far but listen to the difference!

https://onedrive.live.com/redir?resid=FF7251924FF7642D!40433&authkey=!AHZarwZY4wGeiZU&ithint=folder%2c

I’ll put more in there later.

Thank you so much. I think voice artists may start a religion in your name now. :wink:

Cheers.

Thank you. I gather you use the defaults. If you think you find better settings (for bands and frequency range, perhaps, reducing compute time without reducing quality of results) let me know.

Are you a MAD genius?

Why would I tinker with what works? Hehe. That’s for branier audiophiles than I (or for when this less brainy one has more time). :wink:

Cheers.

Paul

I came across this forum page and the information about your two plug-ins over the weekend.

I ran them through a couple of recent recordings that I have produced and they seem to me to have made a noticeable improvement - particularly in dealing with the number of ‘clicks’ that inevitably scatter themselves through my recordings!

I posted a link to and details of these plug-ins on the Librivox.org forum to see if anyone else would find them useful.

(Librivox is a site where volunteers record books and other publications that are in the Public Domain, which are then made freely available for downloading).

One question though is - what is the best way to give you feedback on these plug-ins?

If you’re up for receiving feedback, I could post the details on the Librivox forum if you want.

Anyway, thanks for putting in the time and effort to produce these plug-ins. I think that they could be really useful tools to have at hand for cleaning up recordings.

Cheers

Steve

Hello, Steve.

Well, you reached me here.

I have also very recently started the Audacity Voiceover Users’ Group on Facebook, specifically to discuss the technicalities of how to get the best results out of Audacity for spoken word. I don’t pretend to have all the answers… which is why I made the group so I can learn! Most of the members have come from the Audiobook Creation Exchange, but I would like to attract LibriVox people to it too.

Steve, I am trying to join the librivox forum. They seem vigilant about spammers so I’m not in yet. Haven’t heard back from your gatekeeper… update, just got in!

Sorry, newbie ;What program to use to open these plugins ? Thanks

jlatt, follow these instructions: http://wiki.audacityteam.org/wiki/Nyquist_Plug-ins

They go in the plugins folder inside the Audacity program folder.

I think this is the installer instructions.

http://wiki.audacityteam.org/wiki/Nyquist_Effect_Plug-ins

Koz

Hi Paul

There seem to be suggestions that the De-Esser is very good. Is the De-Esser also good for music vocals? Does it need more development? Would you want it to be published on the Audacity Wiki if so when?


Gale

From whom have you heard that?

I am not sure I’m doing a smartest things mathematically with this de-esser, but it seemed to drop out of the more elaborate de-clicker as a sort of simplified special case. As for the de-clicker, I am also not sure I am doing the smartest possible things there. It all runs at a snail’s pace. This is a free tool I offered to the patient.

Have you tried it yourself and formed an opinion?

Trebor speaks highly of your De-esser - he said it was better than Spitfish.

Not yet - I’ve no use for a de-esser most of the time. But tools don’t have to be mathematically perfect to be on the Wiki. And if they are on Wiki there is more chance of getting feedback from multiple use cases to improve them.


Gale

No joke!

Well I don’t really know what “real” de-essers do, but mine could be described as a multi-band brickwall limiter, with many narrow bands in the default settings, which account for the slowness but perhaps also for precision of results if you can wait for it.

I hear it said that “sibilance” is eq’d down at 7 or 8 kHz, but I find that s sounds in my speech recordings sometimes have unpleasant piercing whistles in them that are much lower, between 3 and 4 kHz. You can easily see such things in spectrogram. I wrote something that does the program equivalent of scanning the spectrogram for such bright bits, and then fixing each one precisely over the right interval with the right gain using the Nyquist eq-band function.

Hi Paul-L ,

No Joke : your de-esser is the best I’ve encountered, it allows far more precise control of sibilance than any other I’ve seen …

SpitFish can’t do that.

If it’s use is extended to processing stereo music then it will need the option of linking the stereo channels , rather than processing each channel independently , ( if they are not linked then there is an unwanted stereo flutter effect ).

Well thanks Trebor!

Your suggestion for stereo is a good one one. Have you observed that defect? I haven’t, I work only with narration, all mono, no mixing. Will you test improvements if I write them? Or send me a test case?

Attached is my own pet example of a bit I captured from a professionally produced audiobook that I enjoyed very much for story, but the sibilance was a knife in my ears sometimes. This especially painful s whistled at about 3400 as you see, not the 6kHz and above range. This was my acid test case as I developed. The “after” with default de-esser settings is right.

Your picture shows “multi-band brickwall limiting with many narrow bands” in action. Find just the white stuff and hammer it down. The danger is changing things too much and making the voices lisp. I have used this on enough hours of voice to believe it does not not.

Do use use default settings on the spectrogram and do you understand the colors? Every frequency band at -20 dB or higher appears white in default display settings. And -20 is exactly what I limit to in the tool’s default settings.
tracks015.jpg

Trebor, I see you use even more bands than I do and a higher top frequency. You must be very patient! My work has everything above 11.5 kHz stripped out by other parties so I don’t worry about that.

You probably know this: if you choose Spectrogram view, then the bands you specify do not correspond to equal divisions in screen height. But choose spectrogram log F instead – then they do correspond to equal heights. And that’s how a graphic equalizer’s sliders work too.

+1 to linked stereo. :slight_smile:

If there is a rule in the code that colours relate to a specific dB level at default display settings, that might be nice to put in the Manual. All we know there is that blue is least energy and red and white are most energy.


Gale