method of extracting TV show music

My newest voice removal uses STFT (or FFT) to preserve the stereo character of the Audio.
https://forum.audacityteam.org/t/karaoke-rotation-panning-more/30112/1
You could - in principle - apply the effect on different versions of the song and then mix and render them (with proper set gain of course).
Or you go the other way and mix all left channels and all right channels together, re-combine them to a stereo track and do the center removal then.
There’s also the possibility to crossfade between the tracks to gather the best parts of each version.
You can first try with the normal voice removal to see how much unwanted sound effects are removed.