서지주요정보
음소단위의 고립단어 인식을 위한 신경회로망의 구조에 관한 연구 = Neural network architecture for phoneme-based isolated word recognition
서명 / 저자 음소단위의 고립단어 인식을 위한 신경회로망의 구조에 관한 연구 = Neural network architecture for phoneme-based isolated word recognition / 김상훈.
발행사항 [대전 : 한국과학기술원, 1992].
Online Access 원문보기 원문인쇄

소장정보

등록번호

8003051

소장위치/청구기호

학술문화관(문화관) 보존서고

MEE 92015

휴대폰 전송

도서상태

이용가능(대출불가)

사유안내

반납예정일

리뷰정보

초록정보

In this thesis work, we propose a Weighted Time State Neural Network (WTSNN) for phoneme recognition and a neural network architecture for phoneme-based isolated word recognition and obtain their performance. States of TSNN is to reflect temporal structure of phonemic features. However, the contribution of each state to phoneme recognition varies state to state, we propose a new algorithm called weighted TSNN in which each state is weighted according to the effect on recognition performance. Weights of network can also be obtained by learning multi-layer perceptron at the top of TSNN. In recognizing initial stop consonants in syllables, the proposed algorithm yields better performance results over the conventional Time Delay Neural Network (TDNN) and Time State Neural Network (TSNN) algorithms by 8% and 6%, respectively, in a speaker independent mode. For the phoneme-based isolated word recognition, we propose an architecture based on TDNN and TSNN. The weighted TSNN or TSNN has higher phoneme recognition rates than TDNN. However, with the TSNN architecture alone, we can not spot phonemes in speech signal. The proposed phoneme recognition architecture combines the phoneme spotting ability of TDNN and the higher recognition performance of TSNN. Using this architecture, we can reduce the lerning time and obtain recognition rates comparable to those of large TDNN architecture with increased hidden nodes. From computer simulation results on speaker dependent phoneme-based Korean digit recognition, we obtain the recognition accuracies of 80.95% and 96.0% for phoneme recognition and word recognition, respectively using the conventional TDNN architecture. For the proposed architecture, we obtain the recognition rates of 86.67% and 96.0% for phoneme and word recognition respectively. The proposed architecture requires less computation time than the large TDNN in the learning phase.

서지기타정보

서지기타정보
청구기호 {MEE 92015
형태사항 iv, 84 p. : 삽화 ; 26 cm
언어 한국어
일반주기 부록 수록
저자명의 영문표기 : Sang-Hun Kim
지도교수의 한글표기 : 이황수
지도교수의 영문표기 : Hwang-Soo Lee
학위논문 학위논문(석사) - 한국과학기술원 : 전기및전자공학과,
서지주기 참고문헌 : p. 82-84
주제 Word recognition.
Speech perception.
Phonemics.
Korean language.
신경 회로망. --과학기술용어시소러스
음성 인식. --과학기술용어시소러스
단어. --과학기술용어시소러스
음소. --과학기술용어시소러스
Neural networks (Computer science)
QR CODE

책소개

전체보기

목차

전체보기

이 주제의 인기대출도서