How to remove voice from a music file?
I've seen some methods (e.g. inverting one stereo channel and cancelling it out with the others) but I was wondering if someone has achieved how to do this relatively well.
I was looking online for some AI solutions, but I couldn't find anything, would the collective knowledge of HN have an answer to this question?
There's not currently a plug and play solution that will work well on all types of music, but there's a lot of research happening in this area. If you're interested in digging into some of the cutting edge source separation algorithms there's a great python library called nussl that provides implementations of many of them. https://interactiveaudiolab.github.io/nussl/
Interestingly, many years ago I had this start happening with my car stereo. The vocals were mostly missing but the other instruments were there. When it got to the instumental solo, the main instrument was missing. This happened on 90% of songs.
Turned out I had accidentally caused wires to come loose in the trunk, leaving the speakers wired in series, which caused the stereo channel cancelation effect.
I've always wondered where karaoke bars get vocal-free versions of seemingly every track that would ever get requested. Do record companies make them?
I believe this is only effectively possible, not fully possible. Inherently, music and the voice will share some of the same frequency samples, since they are discrete. I’m sure you might be able to get a solution that works to the human ear, but I don’t know that it’s possible to perfectly strip out one or the other.
Voice is often at the centre of a stereo recording so invert the phase of one channel and combine?
This is not a direct answer to the technical aspect of your question, but if you know who produced the music, they might give/sell you a version that is missing the vocal tracks.
Those interested in removing voice from a music file may wish to check out the many resources available at my website, monotoSTEREO.info (https://www.monotostereo.info). There is also a companion Facebook page (https://www.facebook.com/monotostereo.info) where I post updates and related content. "Like" us on Facebook to follow the page for the updates! Be sure to check out the many examples on the MEDIA pages of the website!
Welcome to the rabbit hole that is the inverse problem.
I always wonder where DJs get the instrumental versions to sample/mix.
My date was sending me mixed signals, so I did a fourier analysis.
What about the inverse, isolating only the voice?
if its like talking, fourier helps !
This is a longstanding problem in audio engineering, along the lines of "how does one un-bake a cake to get the eggs?" There's always going to be artefacts and distortion ranging from unpleasant to extreme, hence audio engineers when mixing from stems/channels will do a stereo instrumental mix and a mix with vocals.