Would this be too complicated?
If it is, then which features would you wish to retain/discard?
Controls:
- Sound/Silence threshold (dB): The detection threshold below which audio is considered "silent" and above which "sound".
- Min detected silence (seconds): Silences shorter than this are ignored.
- Min detected sound (seconds): Sounds shorter than this are ignored.
- What to label: Choice - Sounds / Silences.
- Can label either sounds or silences.
- Where to label: Choice - Before Start / Middle / After End / Region.
- Can produce point labels:
- before the start of the detected region,
- in the middle of the detected region,
- after the end of the detected region.
- or can label the region (region label).
- Can produce point labels:
- Before Start max offset (s): If a label is placed before the start of a sound/silence, this sets the maximum time (seconds) before the start. Detected regions are not allowed to overlap and the first label cannot be before the start of the selection.
- After End max offset (s): If a label is placed after the end of a sound/silence, this sets the maximum time (seconds) after the end. Detected regions are not allowed to overlap.
- Label text (optional): If not empty, each label will have this label text.
- Number from (optional): If not empty, each label will start with a number, counting up from this number. Leading zeros are honoured.