한국과학기술원 도서관

서지주요정보
소용량 음성데이터베이스를 이용한 어휘독립 단어인식기의 음소 모델링 = Subword modeling of vocabulary independent word recognizer with small speech database
서명 / 저자	소용량 음성데이터베이스를 이용한 어휘독립 단어인식기의 음소 모델링 = Subword modeling of vocabulary independent word recognizer with small speech database / 구동욱.
발행사항	[대전 : 한국과학기술원, 2001].
Online Access	원문보기 원문인쇄

소장정보

등록번호

8011979

소장위치/청구기호

학술문화관(문화관) 보존서고

MCS 01009

휴대폰 전송

도서상태

이용가능(대출불가)

사유안내

반납예정일

등록번호

9007594

소장위치/청구기호

서울 학위논문 서가

MCS 01009 c. 2

휴대폰 전송

도서상태

이용가능(대출불가)

사유안내

반납예정일

리뷰정보

초록정보

The goal of vocabulary independent speech recognition (VISR) is to transform the input speech waveform into words by utilizing subword models with continuous or isolated utterance, even whether the vocabulary set is fixed or not. VISR system has generally been developed with large speech database to model various phonetic and acoustic phenomena. However, in this thesis, we propose the VISR system using back-off phoneme decision tree with small speech database. The back-off phoneme decision tree, which shows the better performance in the condition of small speech database, is binary decision tree for automatic phoneme clustering. In the back-off phoneme decision tree, splitting from parent node is maintained, without stopping criterion, until no more data elements are assigned to children nodes. After the tree construction, the type of node is determined by the number of elements belong to given node. The type of node means that acoustic model belong to given node needs to be trained or not. According to the node type, the acoustic subword model is trained by data not only of terminal nodes, but also of internal nodes. In contrast to previous phoneme decision tree, back-off phoneme decision tree has advantage of using more phonetic information for acoustic subword modeling. In addition, parent node is referred without additional computation if unseen model is requested. With the small speech database, the effectiveness of back-off phoneme decision tree was shown through several experiments. By model combination with deleted interpolation method, we could estimate reliable HMM model parameters. For the future work, the Back-Off phoneme decision tree will be applied to continuous speech recognition system with large scaled speech corpus.

서지기타정보

서지기타정보
청구기호	{MCS 01009
형태사항	v, 48 p. : 삽화 ; 26 cm
언어	한국어
일반주기	저자명의 영문표기 : Dong-Ook Koo 지도교수의 한글표기 : 오영환 지도교수의 영문표기 : Yung-Hwan Oh
학위논문	학위논문(석사) - 한국과학기술원 : 전산학전공,
서지주기	참고문헌 : p. 46-48

QR CODE

책소개

전체보기

나의 도서관정보

메뉴

소장정보

리뷰정보

초록정보

서지기타정보

책소개

목차

이 주제의 인기대출도서