Referring to: https://manual.audacityteam.org/man/label_sounds.html
Threshold level (dB): (default -30 dB)
When audio is below this level, it is considered to be ‘silence’. The lower (more negative) this setting, the quieter the background level must be to be recognized as “silent”. If set below the track’s noise floor level, the entire track will be seen as one continuous sound.
So that “silence” is about -59 dB
and that silence is about -81 dB.
So we need to set the “Threshold level (dB)” a little bit higher than the silences. Let’s try -50 dB.
Threshold measurement: (default “Peak level”)
Peak level: The threshold measurement is based on the peak amplitude in each 10ms period.
We’ve been using peak amplitude measurements, so let’s leave this set to “Peak level”.
Minimum silence duration: (default 1 second)
When ‘silence’ of this duration (or longer) is found, preceding sound and following sound are considered to be separate sounds
So to detect gaps that are 0.303 seconds we need to set the “Minimum silence duration” to 0.303 seconds or less. Let’s try 0…25 seconds.
(When set to the default “1 second”, gaps less than 1 second are ignored)
Minimum label interval: (default 1 second)
Allows short sounds to be grouped within a label region. This ensures that labels will be at intervals of no less than this length. In effect this combines short sounds to create a group of sounds that is at least the specified length. Valid values are between 0.01 seconds to 2 hours.
Looking at your second screenshot (https://media.discordapp.net/attachments/842259121189158962/855884612240146442/unknown.png?width=1582&height=937), each of the sounds are longer than 1 second, so let’s leave this control set to 1 second so that every sound longer than 1 second will be labelled individually.
Label type: > (default: Region around sounds) Sounds / silences are labelled either with point labels or region labels.
Point before sound This option places a point label before each detected sound or group of sounds.
You can choose whichever you want to suit your needs, but for now let’s leave this as “Region around sounds”.
Maximum leading silence: > (default: 0 seconds)
When labeling sounds, a point label, or the start of a region label will be placed before the beginning of the sound by up to this amount.
So this option provides an offset for the position of each label. Let’s leave this at zero for now.
If you want to shift the start of each the labels a little bit earlier so that there is 0.1 seconds of space included in the labelled region before the start of the sound, then you could set this to 0.1 seconds.
Maximum trailing silence: > (default 0 seconds)
This setting is used by region labels only.
When labeling sounds, the end of a region label will be placed this distance after the end of a sound, provided that there is room to do so before the next sound.
This can provide an offset for the end of each region label. As the sounds that we are labelling most end with a gradually fading tail, let’s allow 0.1 seconds for the end of the tail. Set this to 0.1 seconds.
Label text: (default “Sound ##1”)
This is the text that will be entered in each label.
This can be customised to suit your needs, but let’s leave this at defaults for now.
If you do the above, the settings should look like this: