서지주요정보
주파수 영역에서의 특징 벡터 변환과 판단 네트웍을 이용한 화자 적응 = Speaker adaptation using spectral transformation and judge network
서명 / 저자 주파수 영역에서의 특징 벡터 변환과 판단 네트웍을 이용한 화자 적응 = Speaker adaptation using spectral transformation and judge network / 정재훈.
발행사항 [대전 : 한국과학기술원, 2000].
Online Access 원문보기 원문인쇄

소장정보

등록번호

8011511

소장위치/청구기호

학술문화관(문화관) 보존서고

DEE 00059

휴대폰 전송

도서상태

이용가능(대출불가)

사유안내

반납예정일

리뷰정보

초록정보

Recent advances in speech recognition technology have resulted in high performance speaker-independent(SI) speech recognizer. But the speakers who are not covered by the training data. So, the SI system needs to be adapted for a new speaker. In this dissertation, a new speaker adaptation method which uses spectral transformation and judge netword is developed when a small subset of word classes is available for the adaptation. Because radial-basis function (RBF) neutal network, SI recognizer, needs much data for training, it is hard to adapt the parameters of SI recognizer using small adaptation data. So, a spectral transformation approach is used to adapt to a new speaker. The target vector could be placed at the output of adaptation network. In this case, adaptation network approximate the transformation function between incoming speech features and the standard features. The centers of RBF are used as standard features because the centers of hidden nodes of RBF represent average vectors of training vectors of RBF, which could save the memory for hardware implementation. In another case, target could be placed at the output of SI recognizer. Using the steepest decent algorithe, the adaptation network is adapted to increase the discrimination ability of SI recognizer, which results in better recognition rate than that of the previous case. The adapted network gives much improved results for the adapted word classes, but gives degradation results for the non-adapted word classes. Judge network uses the outputs of recognizers both with and without adaptation network. By using "judge" network the degradation of the recognition rates for non-adapted word classes is minimized, which leads to the improvement of overall word recognition rates even when a small subset of word classes is available for the adaptation.

서지기타정보

서지기타정보
청구기호 {DEE 00059
형태사항 ix, 101 p. : 삽화 ; 26 cm
언어 한국어
일반주기 저자명의 영문표기 : Jae-Hoon Jeong
지도교수의 한글표기 : 이수영
지도교수의 영문표기 : Soo-Young Lee
수록잡지명 : "Speaker adaptation based on judge neural networks for real world implementation of voice-command system". Information sciences, v.123 no.1, pp. 13-24(2000)
학위논문 학위논문(박사) - 한국과학기술원 : 전기및전자공학전공,
서지주기 참고문헌 : p. 97-101
QR CODE

책소개

전체보기

목차

전체보기

이 주제의 인기대출도서