한국과학기술원 도서관

서지주요정보
클러스터링 기반 유사 정보 검색 시스템 원형 개발 = A prototype implementation for approximate information retrieval system based on clustering technique
서명 / 저자	클러스터링 기반 유사 정보 검색 시스템 원형 개발 = A prototype implementation for approximate information retrieval system based on clustering technique / 안지현.
발행사항	[대전 : 한국과학기술원, 2002].
Online Access	원문보기 원문인쇄

소장정보

등록번호

8013481

소장위치/청구기호

학술문화관(문화관) 보존서고

MGSM 02124

휴대폰 전송

도서상태

이용가능(대출불가)

사유안내

반납예정일

등록번호

9008696

소장위치/청구기호

서울 학위논문 서가

MGSM 02124 c. 2

휴대폰 전송

도서상태

이용가능(대출불가)

사유안내

반납예정일

리뷰정보

초록정보

This paper presents a prototype for an approximate information retrieval system based on clustering technique. The approximate information retrieval system is assisting a user to browse searching results which are based on his/her queries as showing similar information that he/she selected a suitable document for his/her purpose from the searching results. It has two advantages; the one is it reflects user's relevance feedback about search results and the other is it searches information based on a document that has much more useful keywords than user's short queries. As it uses a document's whole keywords to seek information, it usually fmds too large documents so that are not related with a user's searching purposes. To solve those problems we discuss the prototype which is based on word's co-occurrence information and clustering technique. Word's co-occurrence information is helpful for settling problems due to homonyms and synonyms in documents and clustering technique can give user much compact and fine searching results. An empirical study was conducted using newspaper articles with 194 entries which are classified by experts. The result is that this prototype achieves considerable improvement of precision while preserving the recall and reduction of retrieval size over a traditional approximate information system based on single keyword indexing. In further research, I will apply standard corpus sets to generalize the prototype. In addition, I will compare the expenses of managing co-occurrence word pairs and making clustering and the benefits of improvement of precision and reduction of retrieval size.

서지기타정보

서지기타정보
청구기호	{MGSM 02124
형태사항	v, 56 p. : 삽화 ; 26 cm
언어	한국어
일반주기	저자명의 영문표기 : Ji-Hyun Ahn 지도교수의 한글표기 : 허순영 지도교수의 영문표기 : Soon-Young Huh
학위논문	학위논문(석사) - 한국과학기술원 : 경영공학전공,
서지주기	참고문헌 : p. 53-56

QR CODE

책소개

전체보기

나의 도서관정보

메뉴

소장정보

리뷰정보

초록정보

서지기타정보

책소개

목차

이 주제의 인기대출도서