I can’t think of a good automated way of doing this, other than spending a lot of time developing a plug-in, but even then it’s not going to be perfect because there are too many “unknowns” to be able to reliably calculate spacing.
To do it semi-manually:
- Use the “StretchPause” experimental plug-in from this post (https://forum.audacityteam.org/t/modified-silence-marker-to-insert-more-silence/29449/12) to expand the gaps between words of the first language track
- Use “Silence Finder” (http://manual.audacityteam.org/man/silence_finder_and_sound_finder.html#silence) to label the gaps of the second language track,
- Then use “Edit Menu > Labeled Audio > Split” (http://manual.audacityteam.org/man/edit_menu_labeled_audio.html#split) to split the track up
- Then repeat for each other language track,
- Then use the “Multi-tool” (http://manual.audacityteam.org/man/multi_tool.html) combined with Shift or Ctrl to move the audio clips.
- Repeat step 5 for each language track
Step 5 is quite tricky until you get the hang of it:
- Click on an empty part of the info panel on the left end of the second track to select the second track
- F6 to switch to the Multi-Tool
- Move the cursor over the second audio clip (assuming no need to move the first audio clip)
- Hold down the Shift key. The cursor becomes a left pointing arrow.
- Left click. The left end of the selection jumps to the cursor position.
- Release the shift key and the mouse button
- Move the mouse pointer over the shaded (selected) area that contains all clips from the 2nd onward.
- Hold down the Ctrl button. The cursor becomes a double headed horizontal arrow (for “time shift”)
- Left click and drag to the right to line up the word in the second audio clip with the second word in the first track
- Repeat steps iii to ix for the rest of the track