University of Hertfordshire

By the same authors

H-Semantics: A Hybrid Approach to Singing Voice Separation

Research output: Contribution to journalArticle

View graph of relations
Original languageEnglish
Pages (from-to)831-841
Number of pages11
JournalJournal of the Audio Engineering Society
Publication statusPublished - Oct 2012


The singing voice is the most prominent content of music tracks that can be described as songs. Separation from its music accompaniment is considered highly desirable in the field of music information retrieval, as it facilitates such applications as melody extraction, lyrics recognition, and singer identification. This paper presents investigations into effective methods for unsupervised separation of singing voice from stereophonic studio recordings. The work
involves the introduction of two novel time-domain procedures for music pruning and the integration of each of them with frequency-domain voice isolation, which is based on the enhancement of a previously established procedure. The performance of the complete system based on each of the above music-pruning methods is analyzed and measured using a set of experimental investigations. The outcomes clearly illustrate that the effectiveness in singing
voice separation can be considerably improved through the proposed approaches.

ID: 1303581