서지주요정보
문법의 범위를 초과하는 영어 문장들을 위한 견고한 구문 분석기 연구 = A study on a robust parser for extragrammatical english sentences
서명 / 저자 문법의 범위를 초과하는 영어 문장들을 위한 견고한 구문 분석기 연구 = A study on a robust parser for extragrammatical english sentences / 이공주.
발행사항 [대전 : 한국과학기술원, 1994].
Online Access 제한공개(로그인 후 원문보기 가능)원문

소장정보

등록번호

8004967

소장위치/청구기호

학술문화관(문화관) 보존서고

MCS 94042

휴대폰 전송

도서상태

이용가능(대출불가)

사유안내

반납예정일

등록번호

9000969

소장위치/청구기호

서울 학위논문 서가

MCS 94042 c. 2

휴대폰 전송

도서상태

이용가능(대출불가)

사유안내

반납예정일

리뷰정보

초록정보

A practical natural language parser must exhibit robust behavior for an extragrammatical user input. In this thesis, a robust parser with recovery mechanism is proposed to handle extragrammatical sentences, adopting a least-error recognition algorithm. For any input sentence, this algorithm can generate at least one parse tree with least number of errors. And, it can be adapted to any parser which uses a context free grammar as its driver. However, since the algorithm assumes all possible errors including insertion, deletion, and mutation of constituents, the efficiency is degraded, and too many parse tree are generated. Hence, two kinds of heuristic rules are proposed here in order to improve efficiency and reduce the number of parse trees. The first kind of these rules are used to assign proper weight to each hypothetical error edge to select the most promising edge during parsing. With these heuristic rules, syntactic analysis can be efficiently and successfully finished without processing all the edges produced by the generic least-error recognition algorithm. The second type of heuristic rules using clue symbol are introduced to analyze not grammatically but heuristically inserted phrases and parallel phrases. So, we can resolve some ambiguity problems and raise the accuracy of recovery. The empirical result shows that the accuracy of recovery of this parser ranges 79% ∼ 92%, and the number of resulting syntactic trees is decreased by 40% ∼ 90% from that of generic least-error recognizer.

서지기타정보

서지기타정보
청구기호 {MCS 94042
형태사항 iv, 43 p. : 삽화 ; 26 cm
언어 한국어
일반주기 저자명의 영문표기 : Kong-Joo Lee
지도교수의 한글표기 : 김길창
지도교수의 영문표기 : Gil-Chang Kim
학위논문 학위논문(석사) - 한국과학기술원 : 전산학과,
서지주기 참고문헌 : p. 41-43
주제 Natural language processing.
Parsing (Computer grammar)
Robust statistics.
영어. --과학기술용어시소러스
QR CODE

책소개

전체보기

목차

전체보기

이 주제의 인기대출도서