서지주요정보
한국어 형태소 분석과 태깅 워크벤치의 설계 및 구현 = Design and implementation of Korean morphological analysis and tagging workbench
서명 / 저자 한국어 형태소 분석과 태깅 워크벤치의 설계 및 구현 = Design and implementation of Korean morphological analysis and tagging workbench / 허욱.
발행사항 [대전 : 한국과학기술원, 1997].
Online Access 원문보기 원문인쇄

소장정보

등록번호

8007867

소장위치/청구기호

학술문화관(문화관) 보존서고

MCS 97049

휴대폰 전송

도서상태

이용가능(대출불가)

사유안내

반납예정일

등록번호

9003363

소장위치/청구기호

서울 학위논문 서가

MCS 97049 c. 2

휴대폰 전송

도서상태

이용가능(대출불가)

사유안내

반납예정일

리뷰정보

초록정보

In natural language processing, statistical methods based on a large annotated corpus are being used these days. To make the corpus be effective to such methods, the quality of the corpus as well as the quantity of it is important. But the task of building a large, faultless, annotated corpus is very difficult and labor-intensive. This thesis presents a method that overcomes problems in building parts-of- speech tagged corpus. For the unknown word problem, we modify the morphological analyzer to recognize and notify unknown words in the input text to the user, who then can semi-automatically register them on the dictionary and re-analyze the text correctly. For the errors of an automatic tagging, we propose a rule-based error correction method which finds and corrects errors semi-automatically based on user-defined rules. We also make use of the user's error correction log to reflect the user‘s feedback during manual correction process. Experiments were carried out on 10,000 Korean words to show the efficiency of error correction process of this workbench. The result shows that about 63.2% of tagging errors can be corrected semi-automatically and user- friendly.

서지기타정보

서지기타정보
청구기호 {MCS 97049
형태사항 iv, 39 p. : 삽화 ; 26 cm
언어 한국어
일반주기 저자명의 영문표기 : Wook Huh
지도교수의 한글표기 : 최기선
지도교수의 영문표기 : Key-Sun Choi
학위논문 학위논문(석사) - 한국과학기술원 : 전산학과,
서지주기 참고문헌 : p. 36-38
QR CODE

책소개

전체보기

목차

전체보기

이 주제의 인기대출도서