서지주요정보
한글 문서의 효과적인 검색을 위한 N-Gram 기반의 색인 방법 = An N-gram-based indexing method for effective retrieval of hangul texts
서명 / 저자 한글 문서의 효과적인 검색을 위한 N-Gram 기반의 색인 방법 = An N-gram-based indexing method for effective retrieval of hangul texts / 안정수.
발행사항 [대전 : 한국과학기술원, 1995].
Online Access 제한공개(로그인 후 원문보기 가능)원문

소장정보

등록번호

9002005

소장위치/청구기호

서울 학위논문 서가

MCS 95046 c. 2

휴대폰 전송

도서상태

이용가능(대출불가)

사유안내

반납예정일

리뷰정보

초록정보

A variety of indexing methods for Hangul texts have been proposed in the past. They can be classified into two groups as follows: One is to extract index terms by removing particles, endings, suffixes et at. from word phrases, and the other is to generate index terms from morphemes of word phrases. The former suffers from the problem of word boundaries when documents contain many compound nouns even though it can be easily implemented with the longest match principle. The latter can overcome the word boundary problem by extracting simple nouns. It, however, has many overheads to develop a lot of linguistic knowledge needed in the indexing procedure. In this paper we propose a new indexing method based on n-grams. The proposed method consists of the following four steps. First, word phrases are recognized from Hangul texts. Second, we eliminate stopwords which are not appropriate to represent the texts. Then, the meaningless parts consisting of particles, endings, suffixes et al. are removed from the remaining word phrases. Finally, we get n-grams from the meaningful parts. The proposed indexing method alleviates the problems of previous indexing methods related with word boundaries and linguistic knowledge. We also show through performance comparison that the n-gram based indexing method provides similar retrieval effectiveness to the case that texts are indexed with manually-extracted simple nouns.

서지기타정보

서지기타정보
청구기호 {MCS 95046
형태사항 i, 40 p. : 삽화 ; 26 cm
언어 한국어
일반주기 부록 수록
저자명의 영문표기 : Jeong-Soo Ahn
지도교수의 한글표기 : 김명호
지도교수의 영문표기 : Myoung-Ho Kim
학위논문 학위논문(석사) - 한국과학기술원 : 전산학과,
서지주기 참고문헌 : p. 34-36
QR CODE

책소개

전체보기

목차

전체보기

이 주제의 인기대출도서