한국과학기술원 도서관

서지주요정보
모음인식과 벡터 양자화를 이용한 화자인식 시스템에 대한 연구 = A study on a speaker identification system based on vowel classification and vector quantization
서명 / 저자	모음인식과 벡터 양자화를 이용한 화자인식 시스템에 대한 연구 = A study on a speaker identification system based on vowel classification and vector quantization / 임창헌.
발행사항	[서울 : 한국과학기술원, 1988].
Online Access	제한공개(로그인 후 원문보기 가능)원문

소장정보

등록번호

4105368

소장위치/청구기호

학술문화관(문화관) 보존서고

MEE 8888

휴대폰 전송

도서상태

이용가능(대출불가)

사유안내

반납예정일

리뷰정보

초록정보

In this thesis work, we propose a text-independent speaker identification algorithm based on VQ (vector quantization) and vowel classification, and its performance is studied and compared with that of a conventional speaker identification algorithm using VQ. The proposed speaker identification algorithm is composed of three processes: vowel segmentation, vowel recognition and average distortion calculation. For automatic vowel segmentation, vowel sound intervals are segmented first from test speech using RMS energy, BTR, SFBR. Then, the segmented vowel intervals are manually labeled and VQ codebooks are made separately for different vowel sounds of each speaker. As a result, a speaker is identified as the one whose VQ codebook gives the minimum average distortion. Computer simulation has been done to obtain the performance of the proposed speaker identification algorithm using 33 word vocabularies drawn from seven male speakers under ordinary ambient condition. Among the spoken vocabularies, 17 words are used for constructing the VQ codebooks and the rest are used for speaker identification test. With both vowel segmentation and recognition done manually, the proposed speaker identification algorithm gives identification rate of about 83% when the test speech signal consists of one isolated word vocabularies. On the other hand, when we use the automatic vowel segmentation algorithm, the identification rate of the proposed algorithm drops from 83.0% to 75.5%. The identification rate of the proposed speaker identification algorithm is better than that of the reference speaker identification algorithm using VQ when the number of VQ codewords is small (4~8). However, as the number of codewords in more than 16, the performance of the proposed algorithm is worse than that of the reference speaker identification algorithm using VQ. It the input signal is noisy, particularily when the SNR is 20dB, the identification rate of the proposed speaker identification algorithm is better than that of the reference speaker identification algorithm with manual segmentation and recognition of vowel sounds. And if the speech signal which is telephone speech signal and noisy (SNR=20dB) is used for speaker identification, the identification rate of the proposed speaker identification algorithm is better than that of the reference speaker identification algorithm.

본 논문에서는 모음 인식과 VQ를 이용한 text-independent 화자 인식 system을 제안하고, 기존의 VQ를 이용한 화자 인식 system과 성능을 비교하여 보았다. 모음 인식과 VQ를 이용한 화자 인식 system은 모음 분리 과정과 모음 인식과정 그리고 평균 distribution을 계산하여 화자를 인식하는 과정으로 구성되어 있다. 모음 분리는 자동으로 하는 방법과 수동으로 하는 방법 2 가지를 사용하였고, 모음 인식은 수동으로만 하였다. 자동 모음 분리는 음성 신호로부터 추출한 3가지 feature(RMS energy, BTR, SFBR)을 사용하여 모음을 분리하는 방법을 사용하였다. 그리고 training 과정에서 각 화자마다 네 가지 모음(아(와),어,오,에(애))에 대한 VQ codebook을 만들고, test 과정에서 모음 인식의 결과에 따라 평균 distortion을 계산할 때 사용되는 VQ codebook을 선택하였다. 실험 결과, noise의 영향이 없는 상태에서, 모음 인식과 VQ를 이용한 화자 인식 system은 codeword 수가 작을 때는 VQ를 이용한 화자 인식 system보다 성능이 좋았으나 codeword수가 클 때는 성능이 나빠졌다. 그리고 noise의 영향만을 고려한 경우, 모음 인식과 VQ를 이용한 화자 인식 system은 VQ를 이용한 화자 인식 system보다 전반적으로 인식률이 높았다. 그리고 noise의 영향(SNR=20dB)과 전화선의 영향을 고려한 경우에도, 모음 인식과 VQ를 이용한 화자 인식 system의 인식율이 VQ를 이용한 화자 인식 system의 인식율에 비해 전반적으로 높았다. 한편 모음 인식과 VQ를 이용한 화자 인식 system은 VQ를 이용한 화자 인식 system과는 달리 음성 처리 영역이 모음 부분으로 한정되기 때문에, test 음성 data 양이 많아지고 codeword 수가 커지면 모음 분리와 모음 인식으로 인한 overhead를 고려하더라도 계산량이 작아지는 장점이 있다.

서지기타정보

서지기타정보
청구기호	{MEE 8888
형태사항	[v], 68 p. : 삽화 ; 26 cm
언어	한국어
일반주기	저자명의 영문표기 : Chang-Heon Lim 지도교수의 한글표기 : 이황수 지도교수의 영문표기 : Hwang-Soo Lee
학위논문	학위논문(석사) - 한국과학기술원 : 전기및전자공학과,
서지주기	참고문헌 : p. 65-68
주제	Vowels. Vector processing (Computer science) Vocoder. 음성 인식. --과학기술용어시소러스 모음. --과학기술용어시소러스 벡터 양자화. --과학기술용어시소러스 화자 인식. --과학기술용어시소러스 보코더. --과학기술용어시소러스 Speech perception.

QR CODE

책소개

전체보기

나의 도서관정보

메뉴

소장정보

리뷰정보

초록정보

서지기타정보

책소개

목차

이 주제의 인기대출도서