How to remove voice from a music file?

Question

I've seen some methods (e.g. inverting one stereo channel and cancelling it out with the others) but I was wondering if someone has achieved how to do this relatively well.I was looking online for some AI solutions, but I couldn't find anything, would the collective knowledge of HN have an answer to this question?

unlinked_dll · Accepted Answer

The technical name for this problem is "blind source separation" [1] and is related to the "cocktail party effect" [2].
[1] http://www.mit.edu/~gari/teaching/6.555/LECTURE_NOTES/ch15_b...
[2] https://en.wikipedia.org/wiki/Cocktail_party_effect
Commercial solutions are out there, but imo/e they're not always that great and the problem is of prime interest for researchers both in industry and academia.

hnaccount141 · Answer

There's not currently a plug and play solution that will work well on all types of music, but there's a lot of research happening in this area. If you're interested in digging into some of the cutting edge source separation algorithms there's a great python library called nussl that provides implementations of many of them. https://interactiveaudiolab.github.io/nussl/

paulrpotts · Answer

Izotope RX can do a pretty credible job of this, although it depends a lot on the source file. This is a commercial product, though, so I'm not sure if it is what you are looking for. https://ask.audio/articles/isolating-separating-remixing-usi...

robbrown451 · Answer

Interestingly, many years ago I had this start happening with my car stereo. The vocals were mostly missing but the other instruments were there. When it got to the instumental solo, the main instrument was missing. This happened on 90% of songs.Turned out I had accidentally caused wires to come loose in the trunk, leaving the speakers wired in series, which caused the stereo channel cancelation effect.

hantusk · Answer

A few relevant links:https://www.celemony.com/ Celemony melodynehttps://www.youtube.com/watch?v=FMEk8cHF-OAhttps://www.youtube.com/watch?v=zL6ltnSKf9khttps://github.com/f90/Wave-U-Net

superfamicom · Answer

I have tried many options, and https://phonicmind.com/ has had the best results.

psychometry · Answer

I've always wondered where karaoke bars get vocal-free versions of seemingly every track that would ever get requested. Do record companies make them?

elamje · Answer

I believe this is only effectively possible, not fully possible. Inherently, music and the voice will share some of the same frequency samples, since they are discrete. I&rsquo;m sure you might be able to get a solution that works to the human ear, but I don&rsquo;t know that it&rsquo;s possible to perfectly strip out one or the other.

sachinsmc · Answer

https://github.com/deezer/spleeter worth checking once

ojm · Answer

I spent some time researching this (albeit in 2014) for my wife. Heres the best solution I could come up with at the time: https://ojm.co/blog/using-audacity-remove-vocals-audio-free/

abdulhaq · Answer

Voice is often at the centre of a stereo recording so invert the phase of one channel and combine?

sellingwebsite · Answer

There is a thread on the homepage that might be useful to you: https://news.ycombinator.com/item?id=21431071

timrichard · Answer

There are several products from Audionamix that might be worth checking out :https://audionamix.com/shop-adx/

LinuxBender · Answer

This is not a direct answer to the technical aspect of your question, but if you know who produced the music, they might give/sell you a version that is missing the vocal tracks.

monotoSTEREO · Answer

Those interested in removing voice from a music file may wish to check out the many resources available at my website, monotoSTEREO.info (https://www.monotostereo.info). There is also a companion Facebook page (https://www.facebook.com/monotostereo.info) where I post updates and related content. "Like" us on Facebook to follow the page for the updates! Be sure to check out the many examples on the MEDIA pages of the website!

person_of_color · Answer

Welcome to the rabbit hole that is the inverse problem.

conductr · Answer

I always wonder where DJs get the instrumental versions to sample/mix.

villmann · Answer

My date was sending me mixed signals, so I did a fourier analysis.

techload · Answer

What about the inverse, isolating only the voice?

pvtmert · Answer

if its like talking, fourier helps !

dx7tnt · Answer

This is a longstanding problem in audio engineering, along the lines of "how does one un-bake a cake to get the eggs?" There's always going to be artefacts and distortion ranging from unpleasant to extreme, hence audio engineers when mixing from stems/channels will do a stereo instrumental mix and a mix with vocals.