This study proposes a hybrid framework for separating hazardous sounds from single-channel mixed audio signals by combining dimensionality reduction based on Non-negative Matrix Factorization (NMF) with K-medoids clustering. The method decomposes the ...
This study proposes a hybrid framework for separating hazardous sounds from single-channel mixed audio signals by combining dimensionality reduction based on Non-negative Matrix Factorization (NMF) with K-medoids clustering. The method decomposes the spectrogram of a mixed signal into low-rank basis components and subsequently groups them using a distance-based clustering scheme designed to enhance the separation of hazard-related spectral patterns. After reconstructing cluster-specific signals, the separated audio is evaluated using the Signal-to-Distortion Ratio (SDR) to quantify separation quality. To further verify the practical effectiveness of the proposed pipeline, Mel-Frequency Cepstral Coefficients (MFCCs) extracted from the separated signals are passed to a Convolutional Neural Network (CNN) classifier to identify the underlying hazard sound classes. Experimental results demonstrate that the distance-driven clustering approach improves separation performance and that the post-separation CNN classification achieves meaningful recognition accuracy, confirming the viability of the overall method for real-world hazard sound analysis