A Performance Analysis on Matrix Factorisations in Solving Musical Instrument Source Separation Problem
Main Article Content
Abstract
Audio signal decomposition breaks a mixture of musical instrument audio signals into its fundamental musical instrument components. Machine learning is one of the methods widely used in audio signal decomposition. However, the limitation of computer hardware and the complexity of the algorithm may cause the computational speed of machine learning to deteriorate. This paper aims to use the contemporary matrix factorisation to extract the fundamental musical instrument audio signal component from the mixture of musical instrument audio signals. We choose nine contemporary matrix factorisation techniques and compare their performance in separating the mixture of musical instrument audio signals. We create five scenarios with different melody complexity to test the matrix factorisation techniques. Based on the Signal-to-Noise Ratio, Nonnegative Matrix Factorisation with Kullback-Leibler Divergence (NMF-KL) is the best separation performance when the monotonic noise is not added to the mixture of musical instrument audio signals. Initially, NMF-KL has good separation when monotonic noise is added to the simple recurring mixture of musical instrument audio signals, but as the melody complexity increases the NMF-KL separation performance starts to deteriorate. Lastly, the matrix factorisation techniques do not work well when the white noise is added to the mixture of musical instrument audio signals.
Article Details
This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
AJSE contents are under the terms of the Creative Commons Attribution License. This permits anyone to copy, distribute, transmit and adapt the worknon-commercially provided the original work and source is appropriately cited.