서지주요정보
실시간 대용량 음성인식을 위한 Viterbi scoring 보드의 구현 = Implementation of a viterbi scoring board for real time large vocabulary speech recognition
서명 / 저자 실시간 대용량 음성인식을 위한 Viterbi scoring 보드의 구현 = Implementation of a viterbi scoring board for real time large vocabulary speech recognition / 오광석.
발행사항 [대전 : 한국과학기술원, 1994].
Online Access 제한공개(로그인 후 원문보기 가능)원문

소장정보

등록번호

8004895

소장위치/청구기호

학술문화관(문화관) 보존서고

MIC 94009

휴대폰 전송

도서상태

이용가능(대출불가)

사유안내

반납예정일

등록번호

9000897

소장위치/청구기호

서울 학위논문 서가

MIC 94009 c. 2

휴대폰 전송

도서상태

이용가능(대출불가)

사유안내

반납예정일

리뷰정보

초록정보

For real time implementation of large vocabulary speech recognition systems using hidden Markov models (HMMs), a huge amount of Viterbi scoring operations should be performed to select a vocabulary model which matches best with each incoming speech signal. While the general purpose digital signal processing (DSP) chips are widely used for feature extraction of speech signal, their architectures are not suitable for Viterbi scoring operations. For that reason, we need a specialized architecture to perform the Viterbi scoring operations efficiently for real time speech recognition especially when the vocabulary size becomes large. In this thesis work, we implement a dedicated Viterbi scoring board for real time large vocabulary speech recognition using HMM. The Viterbi scoring board is implemented using the field programmable gate array (FPGA) chips. It performs Viterbi scoring operations in a logarithmic manner for efficient computation. In order to obtain the performance of the Viterbi scoring board, we construct a prototype speech recognition system. The system consists of a host computer (PC/AT), a DSP board, and a dedicated Viterbi scoring board we implemented. In the DSP board, we obtain a 16th order mel-scaled cepstrum and its vector quantized index for each frame of speech signal. Then the Viterbi scoring board receives the vector quantized index from the DSP board and updates all the state metrics at every frame intervals. As a result, the clock rate of 10 MHz, the speech recognition system can update 100,000 state metrics within a single frame of 10 ms, or equivalently, 3,300 words when the average number of states per word is 30.

서지기타정보

서지기타정보
청구기호 {MIC 94009
형태사항 iv, 60 p. : 삽화 ; 26 cm
언어 한국어
일반주기 저자명의 영문표기 : Kwang-Sok Oh
지도교수의 한글표기 : 이황수
지도교수의 영문표기 : Hwang-Soo Lee
학위논문 학위논문(석사) - 한국과학기술원 : 정보및통신공학과,
서지주기 참고문헌 : p. 58-60
주제 Speech perception.
Large scale systems.
Real-time data processing.
Markov processes.
Signal processing --Digital techniques.
Markov 과정. --과학기술용어시소러스
음성 인식. --과학기술용어시소러스
실시간 처리. --과학기술용어시소러스
디지털 방식. --과학기술용어시소러스
대규모 시스템. --과학기술용어시소러스
QR CODE

책소개

전체보기

목차

전체보기

이 주제의 인기대출도서