서지주요정보
한국어 형태소 분석과 태깅 워크벤치의 설계 및 구현 = Design and implementation of Korean morphological analysis and tagging workbench
서명 / 저자 한국어 형태소 분석과 태깅 워크벤치의 설계 및 구현 = Design and implementation of Korean morphological analysis and tagging workbench / 허욱.
저자명 허욱 ; Huh, Wook
발행사항 [대전 : 한국과학기술원, 1997].
Online Access 원문보기 원문인쇄

소장정보

등록번호

8007867

소장위치/청구기호

학술문화관(문화관) 보존서고

MCS 97049

SMS전송

도서상태

이용가능

대출가능

반납예정일

등록번호

9003363

소장위치/청구기호

서울 학위논문 서가

MCS 97049 c. 2

SMS전송

도서상태

이용가능

대출가능

반납예정일

초록정보

In natural language processing, statistical methods based on a large annotated corpus are being used these days. To make the corpus be effective to such methods, the quality of the corpus as well as the quantity of it is important. But the task of building a large, faultless, annotated corpus is very difficult and labor-intensive. This thesis presents a method that overcomes problems in building parts-of- speech tagged corpus. For the unknown word problem, we modify the morphological analyzer to recognize and notify unknown words in the input text to the user, who then can semi-automatically register them on the dictionary and re-analyze the text correctly. For the errors of an automatic tagging, we propose a rule-based error correction method which finds and corrects errors semi-automatically based on user-defined rules. We also make use of the user's error correction log to reflect the user‘s feedback during manual correction process. Experiments were carried out on 10,000 Korean words to show the efficiency of error correction process of this workbench. The result shows that about 63.2% of tagging errors can be corrected semi-automatically and user- friendly.

서지기타정보

서지기타정보
청구기호 {MCS 97049
형태사항 iv, 39 p. : 삽도 ; 26 cm
언어 한국어
일반주기 저자명의 영문표기 : Wook Huh
지도교수의 한글표기 : 최기선
지도교수의 영문표기 : Key-Sun Choi
학위논문 학위논문(석사) - 한국과학기술원 : 전산학과,
서지주기 참고문헌 : p. 36-38
주제 형태소 분석
태깅
코퍼스 구축
워크벤치
Morphological analysis
Tagging
Corpus construction
Workbench
QR CODE qr code