<<<If I put one of the tracks out of phase (either voice or music), maybe I could get more separation.>>>
Yes, but no. Don’t do that. Straight delay on one channel will cause the voice to cancel out in mono playback systems. Leave the actor where he is.
Jury’s out on the music, but I do know one technique. FM stereo radio does not broadcast Left and Right. They broadcast Left plus Right as a mono channel and then a helper sound channel with Left minus Right. L-R is magic. It’s also called the separation channel. It literally tells the main channel (the mono music), where the brass is left to right (usually right). Where the violins are left to right (usually left). As you decrease the influence of the separation channel, the stereo image of the show slowly collapses to straight mono. One of the things that happens when you drive under a very long tunnel is the separation channel gets noisy and the radio switches to much quieter and robust mono. Reverse when you drive out of the tunnel.
The effect in the car is that the orchestra collapses to a small point just to the left of the GPS receiver on the dashboard and then spreads back out again.
Nowhere is it written that you can’t increase the influence of the separation channel and accentuate the “spread” of the music. This is serious channel management in Audacity and the arithmetic is making my head hurt, but that’s a way to manage separation without serious – or really any – damage.
<<<I think there’s frequency combat between music and voice. I think the music track is too fat in the vocal frequencies. Maybe the way to fix this (aside from using different music) is to eq the music down in the vocal range.>>>
That would be Fletcher-Munson, yes. The “loudness” control on your music system.
http://www.webervst.com/fm.htm
Note that most of the effect is around 3000 where your ear reacts most strongly to sounds. Here, let me run my fingernails down this blackboard…
<<<I tried the echo, but it seemed like either it was too low to be perceptible, or it was too much. >>>
Even the more advances packages – Gverb, et al. – pretty much suck. I’m going to attribute that to apparent simplicity. You would think you could merely repeat the initial sounds with delay and decay and have it done. If you’ve ever seen the echo analysis of your bathroom, you would have to lie down with a cold towel on your head. It’s enormously complex with asymptotic decay parameters, etc. etc. One of the reasons there’s no such thing as software that can remove echoes. Same problem in reverse.
Once again we should leave the voice actor pretty much alone.
[listening to track]
You’re creeping significantly past the point where I can help. Any one of these is a pleasing, presentable product and it’s up to you to decide which you like. Enough engineering, now you need to wear your producer’s hat.
“A great work is never finished, merely abandoned.”
Koz