This has been a known issue in Audacity for years:
When you record in Mono, your single input (like a mic) is recorded at half volume, if your input device has two channels.
This is a minor annoyance when you have a 2-channel input device. It’s even described in the Audacity manual here:
But on a many-channel input device, it makes mono recording totally unusable. For example on the Motu 624, which has 24 channels, your single input is recorded at 1/24 volume. See here:
But in testing today… it’s even worse than just low volume.
If you have MULTIPLE mics plugged into your interface, and pick “mono” to only record the first one… well, ALL mics are recorded and mixed together into one track.
This is probably not what most people would ever want to do.
Most people don’t notice it, because they only have one mic plugged in.
But it’s definitely not good or sensible behavior.
Even if Audacity wants to support a “mix every channel together as Mono” mode, that should be a separate entry on the channel-select list, along with, well… just recording Channel 1 at full volume.
It’s also not clear to me if this is really an Audacity problem, or something deeper with Pulse or ALSA. For example, I have used the command-line utility arecord, and at least at one point, the same problem was present there. Also, when using Zoom for calls, I found that the 24-channel hardware was unusable, because the one mic channel would be mixed down to 1/24 of its actual loudness.
However, I just tried arecord on a new version of Ubuntu, and it doesn’t seem like it’s mixing multiple mics together in the same way as Audacity.
I have the Audacity source code here, and I’m building it now… but before I dig in, is there any institutional memory about why Audacity works this way, or what the source of the problem is?