RISS 검색 - 국내학술지논문 상세보기

다국어 초록 (Multilingual Abstract)

In this paper, we propose a novel method for the cross-correlation based double-talk detection (DTD), which employing the Gaussian Mixture Model (GMM) in the frequency domain. The proposed algorithm transforms the cross correlation coefficient used in the time domain into 16 channels in the frequency domain using the discrete fourier transform (DFT). The channels are then selected into seven feature vectors for GMM and we identify three different regions such as far-end, double-talk and near-end speech using the likelihood comparison based on those feature vectors. The presented DTD algorithm detects efficiently the double-talk regions without Voice Activity Detector which has been used in conventional cross correlation based double-talk detection. The performance of the proposed algorithm is evaluated under various conditions and yields better results compared with the conventional schemes. especially, show the robustness against detection errors resulting from the background noises or echo path change which one of the key issues in practical DTD.

번역하기

국문 초록 (Abstract)

본 논문에서는 주파수 영역에서의 가우시안 혼합 모델 (Gaussian Mixture Model, GMM) 기반의 새로운 동시통화 검출 (Double-talk Detection, DTD) 알고리즘을 제안한다. 구체적으로 주파수 영역에서의 음향학적 반향억제 (Acoustic Echo Suppression, AES)를 위한 동시 통화 검출 알고리즘을 구성하기 위해 기존의 시간 영역에서의 동시통화 검출에 사용되는 상호 상관계수를 이산 푸리에 변환을 통해 16개 채널의 주파수 영역으로 변환하였다. 이러한 주파수 영역에서의 상호 상관계수를 GMM의 보다 효과적인 구성을 위해 통계적 분류 특성에 근거하여 우수한 7개를 선별하였다. 본 논문은 이러한 특징 벡터로 패턴인식에서 우수한 성능을 보이는 GMM을 구성하였으며 원단화자만 있는 구간, 동시통화 구간, 근단 화자만 있는 구간을 우도 (Likelihood) 비교에 따라 분류함으로써 별도의 원단 화자 신호에 대한 음성 검출기 (Voice Activity Detector, VAD)의 사용 없이 잡음환경과 반향 경로 변화에서 강인한 동시통화 검출 알고리즘을 제안한다. 다양한 실험 결과 제안된 방법은 기존의 상호 상관계수를 고정된 문턱 값과 가부 비교하여 동시 통화 구간을 검출하는 hard decision 방법에 비해 검출 오류 확률 (Detection Error Probability)을 비교한 결과 우수한 성능을 보였다.

번역하기

본 논문에서는 주파수 영역에서의 가우시안 혼합 모델 (Gaussian Mixture Model, GMM) 기반의 새로운 동시통화 검출 (Double-talk Detection, DTD) 알고리즘을 제안한다. 구체적으로 주파수 영역에서의 음향...

참고문헌 (Reference)

1 "TIA/EIA/IS-127, Enhanced variable rate codec, speech service option 3 for wideband spectrum digital system"

2 N. S. Kim, "Spectral enhancement based on global soft decision" 7 (7): 108-110, 2000

3 D. A. Reynolds, "Speaker verification using adapted gaussian mixture models" 10 (10): 19-41, 2000

4 D. A. Reynolds, "Robust text-independent speaker identification using gaussian mixture speaker models" 3 (3): 72-83, 1995

5 S. J. Park, "Integrated echo and noise canceler for hands-free applications" 49 (49): 186-195, 2002

6 N. Furuya, "High performance custom VLSI echo canceller" 1470-1476, 1985

7 K. Ochiai, "Echo canceller with two echo path models" 25 (25): 589-595, 1977

8 G. Xuan, "EM algorithm of gaussian mixture model and hidden Markov model" 145-148, 2001

9 J. H. Song, "Analysis and improvement of Speech/Music classification for 3GPP2 SMV based on GMM" 15 : 103-106, 2008

10 P. S. R. Diniz, "Adaptive Filtering: Algorithm and practical implementation" Kluwer 1997