한국과학기술원 도서관

서지주요정보
Creating knowledge maps for learning using text mining = 텍스트 마이닝 기법을 이용한 학습 지식맵의 생성
서명 / 저자	Creating knowledge maps for learning using text mining = 텍스트 마이닝 기법을 이용한 학습 지식맵의 생성 / Jae-Hwa Lee.
발행사항	[대전 : 한국과학기술원, 2011].
Online Access	원문보기 원문인쇄

소장정보

등록번호

8022801

소장위치/청구기호

학술문화관(문화관) 보존서고

MKSE 11004

휴대폰 전송

도서상태

이용가능(대출불가)

사유안내

반납예정일

리뷰정보

초록정보

Maps such as concept maps and knowledge maps are often used as learning materials. These node-link tools (or maps) have nodes and links, nodes as key concepts, links as relationship between key concepts. From a map, user can recognize what are important concepts and what relationships exist between them. Node-link tools have some advantages over texts. First, minimum texts are used in maps, so that users can easily search for words, phrases, and ideas.[34] Another advantage is that learners learning from maps can have several learning paths, unlike learning from texts which only can have one path of learning, either top-to-bottom or left-to-right.[22] However, there are also concerns of using maps. One is if a map is big, users of the map can be overwhelmed by the size of it, map shock.[8] Another is that maps can have poor visual configuration when they fail to use node proximity to signal semantic similarity.[42] In order to build concept or knowledge maps, domain experts are needed who understand the topic well. It costs very much, and it is hardly possible for people to have one when they need. In this study, a trial of automatically building a domain knowledge map was made with text mining techniques. From a set of documents which are about a specific topic, keywords were extracted using tf/idf algorithm.[38] For every pair of keywords, the number of appearance in a sentence and the number of words in a sentence were considered to calculate relation rankings. Based on the two information, a domain knowledge map (K-map) was built. K-map is a domain knowledge map without labels on links. If it had labels for links, it would look much like a concept map. However, extracting words for those labels are not covered in this study, and left as future work. Instead of putting labels for relations, K-map shows all the sentences that contain the two keywords which are placed at both ends of the relation chosen. Two experiments were conducted with K-map. One is extraction rate of sentences. Each sentence has different role from others. Some sentences considered to be more important than others for the understanding the topic. So three graders who completely under-stand the contents gave each sentence a score, and a score of a sentence was decided to the score that majority of the graders gave, ranging from 0 to 2 : 0 means not helpful for the understanding of the topic, 1 means ideas that support main ideas, and 2 means main ideas. The result shows that extraction rate of score 1, 2 sentences were higher than score 0 sentences, which means K-map filters sentences that are not important to the topic. The second experiment was learning performance test. 34 participants were divided into two groups, one group was called document group, the other was called map group. Learning materials were 3 documents about American scientist ‘Carl Sagan’. Document group read the original documents, map group learnt with K-map. After 8 minutes, they were asked to write down what they learnt as much as possible on the open text form. To calculate scores, the modification of Dansereau’s scoring technique was used.[17] Result shows that there was no difference between the amount of extracted information of map group and document group. However, after conducting same experiment as experiment 1, we found that map group had more ‘important information’ than document group did. Other than that, K-map has several benefits. User can view holistic picture of the domain by seeing the map, he also can recognize the important concepts, relations. Since user can select a specific relation on the map, he can selectively learn a specific part of the domain, also he can get a direct access to the original document by choosing a sentence, which is believe to increase search efficiency. Even though K-map has some issues such as repetition of the same sentence, accuracy of keyword extraction algorithm, and complexity of links, K-map shows the possibility of automatic generation of domain knowledge maps and its performance in learning.

컨셉 맵이 Novak에 의해 개발 된 이후로 컨셉 맵, 지식 맵 등의 맵 형태의 지식 표현은 학습에 있어 일반적인 텍스트 기반의 문서보다 여러 측면에서 장점을 가진다는 연구 결과가 있어 왔다. 몇 가지 장점을 들면, 문서 기반의 학습은 글을 차례대로 따라 읽어가야 한다는 점에서 학습 경로를 하나만 가지는 반면에 맵은 여러 개의 학습 경로를 가질 수 있는 점, 최소화 된 글자 수를 사용함으로써 사용자가 특정 용어를 찾는다거나 훑어볼 때 용의한 점, 학습자가 학습 후 핵심 내용에 대해서 더 기억해 낼 수 있도록 도와 준다는 점 등이 있다. 이러한 맵이 교육 목적으로 사용되기 위해서는 일반적으로 해당 주제 또는 도메인의 전문가가 직접 작성을 해야 하는데, 최근 컨셉맵을 자동으로 구축하는 데에 관련된 연구들이 시작되었다. 수업 과정에 있어서의 필수 개념들 간에 어떤 관련이 있는 지를 과거 학생들로부터 나온 테스트 결과로 찾아내는 연구, 학술 논문을 대상으로 컨셉 맵을 구축하는 연구, 단어 간의 Co-occurrence 정보를 바탕으로 학습시스템에서 학습 개념들 간의 관계를 연결 지어주는 연구 등이 있다. 맵의 자동 생성에 대한 기존의 연구들은 적용 범위를 특정 분야로 제한 시키거나, 맵 구축의 정확도를 검증하는 실험에서 그쳤으며, 자동 생성된 맵이 학습에서 어떤 효과를 가져오는지에 대한 연구가 아직까지 없었다. 본 연구에서는 기존의 방법과는 다른 방법으로 도메인 지식 맵(K-map)을 구축하는 모델, K-map이 활용되는 환경인 K-map Tools에 대해 소개한다. K-map이 구축되는 단계는 크게 세 단계로 나누어지며, 키워드의 추출, 키워드 간의 관계 추출, 관계의 레이블링이 그것이다. 키워드 추출 알고리즘으로 단어의 웨이팅을 계산하는 방법으로 널리 알려진 Tf/Idf 알고리즘이 사용되었으며, 키워드가 추출된 후 그 사이의 관계의 웨이팅을 매기기 위해 단어 쌍의 문장 내 출현 정도, 출현 시 해당 문장의 단어 개수를 고려하였다. 관계를 대표하는 단어의 추출 방법으로 현재는 단어의 빈도만을 고려하여 최빈 단어를 사용하는데 이 부분에 있어서는 기초단계이므로 추가적인 연구가 필요하다. 이렇게 형성된 K-map을 두 가지 주제에 대해 실험을 한 결과, K-map은 문서 내 문장 중에서 더 중요한 문장들을 상대적으로 덜 중요한 문장보다 더 많이 추출한다는 사실이 밝혀졌다. 이어서 이루어진 학습 성취 실험에서는 34명을 대상으로 두 그룹(문서 기반의 학습 그룹과 K-map기반의 학습 그룹)으로 나누어 주제에 대해8분간 학습을 한 후, 학습 내용을 적도록 하였다. 그 결과, 두 그룹 사이에 학습된 정보의 양은 차이가 없으나 맵 그룹이 문서 그룹보다 중요한 정보를 더 많이 가지고 있다는 사실을 발견했다. 학습 성취 측면 이외에, K-map의 사용자들은 맵의 키워드와 문장들을 바탕으로 학습 중에 원하는 문서를 찾아낼 수 있는 검색 상의 이점이 있으며, 이는 기존의 검색 엔진을 바탕으로 검색 후 학습의 반복적인 과정에서 오는 시간 측면의 비효율을 줄일 수 있을 것으로 예상된다. 또한 기존의 텍스트 기반의 학습으로는 불가능한 이점으로, K-map 기반의 학습에서는 사용자가 스스로 원하는 관계(relation)를 선택하여 학습할 수 있게 함으로써 문서 내에서도 선별적 학습이 가능하게 해준다. 이와 같이 K-map은 검색/학습 작업에서 효율적인 측면을 보여주고 있으나 실험 참가자들과의 인터뷰를 통해 드러난 인터페이스 상의 문제, 같은 문장의 반복 출현, 키워드 추출 상의 정확도 문제 등을 여전히 가지고 있다. 특히 키워드를 두 개 이상 가지고 있는 문장이 추출 대상이 되는 K-map의 특성 상 키워드를 하나만 가지고 있어 놓치게 되는 문장들을 어떻게 추출할 것인가에 대한 문제가 추후 연구에서 다뤄질 것이다.

서지기타정보

서지기타정보
청구기호	{MKSE 11004
형태사항	vi, 41 p. : 삽화 ; 26 cm
언어	영어
일반주기	저자명의 한글표기 : 이재화 지도교수의 영문표기 : Aviv Segev 지도교수의 한글표기 : 아빕 세게브
학위논문	학위논문(석사) - 한국과학기술원 : 지식서비스공학과,
서지주기	References : p. 33-37

QR CODE

책소개

전체보기

나의 도서관정보

메뉴

소장정보

리뷰정보

초록정보

서지기타정보

책소개

목차

이 주제의 인기대출도서