한국과학기술원 도서관

서지주요정보
독립성분분석을 이용한 강인한 화자식별 = Robust speaker identification using independent component analysis
서명 / 저자	독립성분분석을 이용한 강인한 화자식별 = Robust speaker identification using independent component analysis / 장길진.
발행사항	[대전 : 한국과학기술원, 1999].
Online Access	원문보기 원문인쇄

소장정보

등록번호

8009836

소장위치/청구기호

학술문화관(문화관) 보존서고

MCS 99032

휴대폰 전송

도서상태

이용가능(대출불가)

사유안내

반납예정일

등록번호

9006021

소장위치/청구기호

서울 학위논문 서가

MCS 99032 c. 2

휴대폰 전송

도서상태

이용가능(대출불가)

사유안내

반납예정일

리뷰정보

초록정보

The aim of the speaker recognition system is identifying who is speaking, by the personal identity information extracted from speech signal. Practical application mainly used for speaker recognition is authentication of a person on the telephone line. But the telephone speech contains nonlinear distortions caused by transmission line, which can lead to serious performance degradation due to the mismatches between training and testing environments. Some compensation methods such as CMS (cepstral mean subtraction) and SBR (signal bias removal) were proposed. But these have their own limits on the estimation of time-varying channel distortions and the preservation of static speaker information while compensating the distortions, so new method suited to speaker recognition is needed. This thesis proposes feature parameter transformation using ICA (independent component analysis) as a new compensation method. ICA is a signal processing technique, whose goal is to express a set of random variables as linear combinations of components that are statistically as independent from each other as possible. The proposed method assumes that the cepstrum vectors from various channel-conditioned speech are linear combinations of some characteristic functions with random channel noise added, and transforms them into new vectors using ICA. The resultant vector space can give emphasis to the repetitive speech information and suppress the random channel distortions. The proposed method was compared to other channel compensation methods. Experiments on SPIDRE, real telephone speech database, were performed in equal and different channel conditions. In the equal channel condition, proposed method marked 2%~7% higher recognition rate than others. In the different channel condition, 9%~16% higher. These results showed that the proposed method is more robust and discriminating than others.

서지기타정보

서지기타정보
청구기호	{MCS 99032
형태사항	vii, 59 p. ; 삽화 ; 26 cm
언어	한국어
일반주기	저자명의 영문표기 : Gil-Jin Jang 지도교수의 한글표기 : 오영환 지도교수의 영문표기 : Yung-Hwan Oh
학위논문	학위논문(석사) - 한국과학기술원 : 전산학과,
서지주기	참고문헌 : p. 53-55

QR CODE

책소개

전체보기

나의 도서관정보

메뉴

소장정보

리뷰정보

초록정보

서지기타정보

책소개

목차

이 주제의 인기대출도서