한국과학기술원 도서관

서지주요정보
Adaptive filtering methods for acoustic noise reduction and noisy speech recognition = 음향 잡음 제거 및 잡음 음성 인식에 적합한 적응 필터 방법
서명 / 저자	Adaptive filtering methods for acoustic noise reduction and noisy speech recognition = 음향 잡음 제거 및 잡음 음성 인식에 적합한 적응 필터 방법 / Hyung-Min Park.
발행사항	[대전 : 한국과학기술원, 2003].
Online Access	원문보기 원문인쇄

소장정보

등록번호

8014763

소장위치/청구기호

학술문화관(문화관) 보존서고

DEE 03061

휴대폰 전송

도서상태

이용가능(대출불가)

사유안내

반납예정일

리뷰정보

초록정보

Although adaptive filtering has been successfully used in various fields, it still needs to satisfy more strict requirements to extend its applications. In this dissertation, adaptive filtering was studied to develop several approaches which overcome problems with very long adaptive filter lengths such as unsatisfactory performance, slow convergence, and too much computational demands to achieve real-time processing. The approaches also include methods which surpass conventional approaches to independent component analysis (ICA) and an efficient method which is designed for noisy speech recognition. In order to prove their effectiveness, adaptive noise canceling (ANC) and blind source separation (BSS) are served as benchmarking problems. First, an ANC algorithm based on ICA is derived. Although the most widely used least-mean-square (LMS) algorithm removes noise components based on second-order correlation, there may exist many components which depend on a reference signal through higher order statistics. The ICA-based approach can take these components into account and provided much higher signal-to-noise ratios (SNRs) in the system output than the conventional LMS algorithm. Second, transform-domain adaptive filtering (TDAF) is considered as a method to enhance convergence rates. Stochastic gradient algorithms including the LMS algorithm and the ICA-based algorithm show slow convergence speed especially for colored input signals. TDAF can speed up convergence by pre-whitening input data using unitary transform and improved convergence speed of the ICA-based approach. Third, a uniform filter bank approach is presented. Decimation in a filter bank provides faster convergence rates by making input signals more whitened, and it reduces computational complexity by performing adaptive filtering at a subsampled sampling rate and with much shorter adaptive filters in each subband. As an approach to ICA, the filter bank approach achieved much better performance than the frequency domain approach, and it gained faster convergence speed than the time domain approach with less computational complexity. Fourth, a Bark-scale filter bank is applied to the filter bank approach. Roughly speaking, most of natural signals have exponentially or more steeply decreasing energy as the frequency increases. Therefore, the uniform filter bank approach shows relatively slower convergence in low frequency subbands than high frequency subbands, and it may not have sufficient data to estimate adaptive filters exactly in high frequency subbands. Experimental results showed that the Bark-scale filter bank approach provided faster convergence speed and higher SNRs or perceptual evaluation of speech quality (PESQ) scores than those of the uniform filter bank approach. Finally, a mel-frequency cepstral coefficient (MFCC) extraction method is proposed, which efficiently obtains a clean speech feature using adaptive filtering. Although a conventional MFCC extraction method utilizes the Fourier transform, this method directly uses subband outputs of the filter bank approach using mel-scale filter banks without reconstruction. Recognition experiments showed that this method could be used as a successful multi-microphone approach to noisy speech recognition.

주위 환경에 따라서 적절한 반응을 나타낼 수 있는 적응 필터 방법은 과거에 다양한 분야에서 성공적으로 사용되어 왔지만, 그 응용분야를 더욱 넓히기 위해서는 더욱 엄격한 요구 사항들을 충족시켜야 할 것이다. 본 논문에서는 이러한 적응 필터에 관한 연구를 통해서 매우 긴 적응 필터를 사용할 때 발생하는 불완전한 성능이나 느린 수렴 속도 그리고 실시간 처리가 불가능한 많은 계산량과 같은 문제점들을 극복하기 위한 몇 가지 방안을 제안한다. 또한, 이 방안에는 기존의 독립 성분 분석 (ICA) 기법보다 뛰어난 특성을 나타내는 방법과 잡음 환경에서의 음성 인식에 적합한 효율적인 특징 추출 방법도 포함되어 있다. 본 논문에서는 적응 잡음 제거 (ANC)나 암묵 신호 분리 (BSS) 문제에 이러한 방안들을 적용하여 그 효과를 입증하였다. 첫째, 독립 성분 분석에 기반한 적응 잡음 제거 알고리즘을 유도하였다. 적응 잡음 제거에 가장 일반적으로 사용하는 최소 평균 제곱 (LMS) 알고리즘은 이차 통계적 특성을 이용하여 잡음 성분을 제거함에도 불구하고 고차 통계적 특성에 의해서 기준 신호에 관계되어 있는 많은 성분들이 존재할 수 있다. 독립 성분 분석에 기반한 적응 잡음 제거 알고리즘은 이러한 관계를 고려할 수 있으며, 출력 신호에서 기존의 최소 평균 제곱 알고리즘보다 더 높은 신호 대 잡음 비 (SNR)를 나타내었다. 둘째, 수렴 속도를 개선하기 위한 방안으로 변환 영역 적응 필터 (TDAF) 방법을 사용하였다. 최소 평균 제곱 알고리즘이나 독립 성분 분석에 기반한 알고리즘 같은 확률 경사법 (stochastic gradient algorithm)은 특히 유색(colored) 입력 신호에 대해 느린 수렴 속도를 보인다. 변환 영역 적응 필터 방법은 유니터리 변환 (unitary transform)을 사용하여 입력 데이터를 미리 백색화함으로써 수렴 속도를 가속화 할 수 있으며, 독립 성분 분석에 기반한 적응 잡음 제거 알고리즘의 수렴 속도를 향상시켰다. 셋째, 등간격 필터뱅크 접근 방법을 사용하여 입력 신호를 백색화함으로써 빠른 수렴 속도를 얻을 수 있으며, 각 대역에서 느린 속도로 동작하는 짧은 필터를 이용하여 원하는 적응 필터를 수행함으로써 계산량을 크게 줄일 수 있다. 이 방법을 독립 성분 분석에 적용하여 기존의 주파수 영역 접근 방법보다 뛰어난 성능을 얻을 수 있었으며, 시간 영역 접근 방법보다 적은 계산량으로도 더 빠른 수렴 속도를 얻을 수 있었다. 넷째, Bark 청각 척도 필터 뱅크를 필터 뱅크 접근 방법에 적용하였다. 대부분의 음향 신호는 주파수가 올라감에 따라서 에너지가 지수 함수 모양으로 감소한다. 따라서, 등간격 필터 뱅크 접근 방법은 높은 주파수 대역보다 낮은 주파수 대역에서 상대적으로 더 느린 수렴 속도를 보이고, 높은 주파수 대역에서는 필터를 정확히 학습할 충분한 데이터를 보유할 수 없다. 실험 결과를 통해서 Bark 청각 척도 필터 뱅크 접근 방법이 등간격 필터 뱅크 접근 방법보다 더 빠른 수렴 속도와 더 높은 신호 대 잡음 비나 PESQ 값을 얻을 수 있음을 확인하였다. 마지막으로, 적응 필터 방법을 사용하여 잡음의 영향이 배제된 음성의 특징을 효율적으로 얻기 위한 MFCC 추출 방법을 제안하였다. 기존의 MFCC 추출 방법은 푸리에 변환 (Fourier transform)을 이용하지만, 이 방법은 mel 청각 척도 필터 뱅크를 사용한 필터 뱅크 접근 방법을 통해서 대역 출력으로부터 신호를 복원하지 않고 직접 특징을 추출할 수 있다. 인식 실험을 통해서 잡음 환경에서의 음성 인식을 위한 다채널 방법으로서 제안한 방법을 효과적으로 사용할 수 있음을 확인하였다.

서지기타정보

서지기타정보
청구기호	{DEE 03061
형태사항	ix, 104 p. : 삽화 ; 26 cm
언어	영어
일반주기	저자명의 한글표기 : 박형민 지도교수의 영문표기 : Soo-Young Lee 지도교수의 한글표기 : 이수영 수록잡지명 : "Adaptive noise cancelling based on independent component analysis". Electronics letters, v.38 no.15, pp.832-833(2002)
학위논문	학위논문(박사) - 한국과학기술원 : 전기및전자공학전공,
서지주기	Reference : p. 96-104

QR CODE

책소개

전체보기

나의 도서관정보

메뉴

소장정보

리뷰정보

초록정보

서지기타정보

책소개

목차

이 주제의 인기대출도서