음절인식을 위한 순환구조 신경회로망 = Recurrent neural network for syllable recognition
서명 / 저자 음절인식을 위한 순환구조 신경회로망 = Recurrent neural network for syllable recognition / 조용덕.
발행사항 [대전 : 한국과학기술원, 1990].
Recently, various neural networks have been widely used in speech recognition. Among them, the neural networks with the recurrent connections that give the network memory have been studied for the recognition of timevarying sequences. The successful neural networks for speech recognition should not only capture the temporally-distributed feature, but also allow the temporal distortion that results in length variation. Though the recurrent connections provide some capability of the sequence recognition, it is too burdensome for them to memorize all the dynamicity in the speech signal with them only. So we extended the Elman's network[Elman88] that has fully recurrent connections in hidden layer to enhance the dynamic memory capacity of the recurrent network. The input layer of the extended Elman's network is aligned with n(n>1) context buffers instead of 1 in the Elman's which is useful to extract the context sensitive features in the input. The target function in the output layer is an analog function instead of binary. This reflects the confidence level of the output for the current input in the context buffer. With the 14th LPC cepstral coefficients, speaker dependent CV syllable recognition was performed. The experimental results show that the performance of the extended Elman's network is superior to that of the Elman as well as to that of the multi-layer perceptron(MLP) without recurrent connections and with maximum input buffer, that is, there exists an optimal number of input context buffers that makes the performance better. This may be due to the fact that the recurrent connections and the context buffers work cooperatively to give the network more discriminant capability than the use only of the recurrent connections or of the context buffers. With the cooperation of the recurrent connections and the context buffers, the segmentation-free nature of the recurrent network makes it possible to extend the proposed network for connected speech recognition.


청구기호 {MCS 9049
형태사항 1책(면수복잡) : 삽화 ; 26 cm
언어 한국어
일반주기 부록 : 음절들의 프레임 수 조사
저자명의 영문표기 : Yong-Duk Cho
지도교수의 한글표기 : 맹승렬
지도교수의 영문표기 : Seung-Ryoul Maeng
학위논문 학위논문(석사) - 한국과학기술원 : 전산학과,
서지주기 참고문헌 수록
주제 Automatic speech recognition.
Speech perception.
Grammar, Comparative and general --Syllable.
음절. --과학기술용어시소러스
음성 인식. --과학기술용어시소러스
Neural networks (Computer science)





