서지주요정보
선택적 사영과 심리 음향 모델에 기반한 부공간 잡음 제거 방법 = Subspace speech enhancement based on selective projection method and psychoacoustic model
서명 / 저자 선택적 사영과 심리 음향 모델에 기반한 부공간 잡음 제거 방법 = Subspace speech enhancement based on selective projection method and psychoacoustic model / 김종욱.
발행사항 [대전 : 한국과학기술원, 2002].
Online Access 원문보기 원문인쇄

소장정보

등록번호

8013564

소장위치/청구기호

학술문화관(문화관) 보존서고

MEE 02106

휴대폰 전송

도서상태

이용가능(대출불가)

사유안내

반납예정일

리뷰정보

초록정보

A novel subspace speech enhancement algorithm based on both selective projection and perceptual psychoacoustic model is proposed. A subspace speech enhancement method suggested by Ephraim et al. functions in Khrunen-Loeve transform (KLT) domain where the eigenvector of the covariance matrix of a given signal forms the bases. These bases in the KLT domain are considered to be optimal in terms of energy compaction, and for this reason , subspace enhancement method has been known to perform better than those obtained in the Fourier domain, such as spectral subtraction and Wiener filter method. Although subspace method has been successful in reducing noise while minimizing the signal distortion, the main disadvantage is that it works only for additive white noise. So, for colored noise, one possible choice is to use whitening filter before the subspace method. However as pointed out in the literature, the use of whitening filter does not guarantee the noise shaping by which residual noise spectrum is masked by clean speech. To overcome this noise shaping problem, noise projection method is used. In the noise projection method, the noise is projected onto each eigenvectors obtained from the neighborhood of the current analysis frame. In some cases, however, it performs better to project the samples of the current analysis frame onto the eigenvectors obtained from noise covariance. The ultimate method which we will call the selective projection method is to select between the two methods using appropriately defined condition number. The subspace approach based on selective projection method gives optimum KLT matrix according to the type of noise. However, there still remains unwanted residual noise after processing. To suppress this residual noise, the perceptual psychoacoustic model is used in this paper. According to the theory of psychoacoustics, the human ear cannot perceive noise whose energy lies under the masking threshold. So by adaptively adjust the parameter used in the subspace method using the masking threshold, the residual noise can be suppressed more while minimizing the signal distortion. By incorporating the selective projection and psychoacoustic model in the subspace approach, the proposed algorithm shows an improvement in the performance. A preference test as a subjective measure shows that the proposed method using psychoacoustic model performs better than the ones which does not incorporate psychoacoustic model. Segmental signal-to-noise ratio (SNRseg) as objective measures also shows better performances than the ones mentioned above.

서지기타정보

서지기타정보
청구기호 {MEE 02106
형태사항 vi, 41 p. : 삽화 ; 26 cm
언어 한국어
일반주기 부록 : 상태 지수의 추정
저자명의 영문표기 : Jong-Uk Kim
지도교수의 한글표기 : 유창동
지도교수의 영문표기 : Chang-D. Yoo
학위논문 학위논문(석사) - 한국과학기술원 : 전기및전자공학전공,
서지주기 참고문헌 : p. 40-41
QR CODE

책소개

전체보기

목차

전체보기

이 주제의 인기대출도서