한국과학기술원 도서관

서지주요정보
Cross-domain recommendation through implicit extraction of psychological factors = 심리적 요소의 내재적 추출을 통한 크로스 추천시스템
서명 / 저자	Cross-domain recommendation through implicit extraction of psychological factors = 심리적 요소의 내재적 추출을 통한 크로스 추천시스템 / Francisco Javier Salgado.
발행사항	[대전 : 한국과학기술원, 2016].
Online Access	원문보기 원문인쇄

소장정보

등록번호

8029267

소장위치/청구기호

학술문화관(문화관) 보존서고

MKSE 16009

휴대폰 전송

도서상태

이용가능(대출불가)

사유안내

반납예정일

리뷰정보

초록정보

Internet has become one the main sources of information for a majority of people. The problem of information overload affects the capacity of processing and discerning useful knowledge from the received information. Recommender systems were created with the objective of facilitating and supporting the process of decision making in online environments. These systems help in a technical way by bringing up the right knowledge in an effective and efficient manner. However, psychological aspects must be considered as well in order to design human-computer interfaces that enhance user’s trust in the system. Traditional recommender systems are built up exploiting the existing similarities in behavior be-tween users, or by clustering and suggesting items that share certain common attributes. In most applications, hybrid solutions are a common implementation, where a mix of techniques is employed. To center the focus on the user, these systems are further improved taking contextual, situational, demographic data as well as personality facts. This information can be extracted in an explicit way, via questionnaires or asking customers to establish their preferences and personal data. On the other hand, obtaining implicit, previously unknown and useful information through data mining and machine learning techniques is a method widely used nowadays. Modern e-commerce sites do not focus only in a certain market, but encourages customers to buy products from different domains. Customers also express their opinions on diverse social media and use diverse providers in order to satisfy their needs. Cross recommender systems try to address both issues taking advantage of the relationship between domains and their current users. At the same time, help to tackle the so called “cold start” problem, when a new user or a new item appears but there is not enough information to apply traditional techniques. In this study, a hybrid system solution is offered for users of a previous domain getting recommendations in a sparse new domain. The idea is to leverage preferences by aggregating implicit information from unstructured data using text mining techniques over reviews. Polarity sentiment over items was discerned using NLTK (Natural Language Tool Kit), a tool that uses semantic analysis to extract subjective sentiments about customer’s opinions. Psychological traits were extracted from the text by using LIWC (Language In-quiry and Word Count), and transformed into domains described by the Five Factor Model. The values from the Five Factor Model were loaded into feature vectors in order to be used by the system. For the final solution data from books as a source domain and movies as a target domain were used. Firstly, in order to extract psychological factors only those customers with a certain quantity of overall total words could be used. This restriction was alleviated by the fact that certain users reviewed items in both domains. Those users were selected as the neighbor, their psychological factors extracted and preferences created. Secondly, for neighbor user’s reviews, item genres were crawled, and item ratings were predicted using user similarity within the same genre. The solution was tested using 10 cross fold validation and compared with a baseline K-NN collaborative filtering algorithm.

인터넷은 다수의 사람들에게 정보의 중요한 원천이 되고 있다. 정보 과부화의 문제는 정보로부터 유용한 지식을 포착하고 만드는 능력에 영향을 끼친다. 추천 시스템은 온라인 환경에서 인간의 의사결정이 용이하도록 지원한다. 이러한 시스템은 기술적인 방법을 통해 적절한 지식을 효과적이면서도 효율적으로 떠오르게 돕는다. 그러나 추천 시스템에서 사용자의 신뢰성을 강화시키는 인간-컴퓨터 인터페이스를 설계하기 위해서는 심리학적 측면 또한 반드시 고려되어야 한다. 전통적인 추천 시스템은 사용자들 사이의 행동 유사성을 이용하거나, 군집화 그리고 특정한 속성들을 공유하는 상품을 제안하는 방식으로 만들어졌다. 대부분의 적용으로는 해당 기술들을 섞은 하이브리드 솔루션이 일반적인 구현법이다. 사용자 측면을 강조하기 위해, 추천 시스템은 성격요인뿐만 아니라 맥락, 상황, 그리고 인구통계학적 데이터를 활용해 발전하였다. 이러한 정보는 설문지 또는 고객에게 직접 그들의 선호와 개인적 정보를 묻는 것과 같은 명시적 방법을 활용해 추출할 수 있다. 이와 반면에, 암묵적으로도 데이터마이닝, 기계학습과 같이 오늘날 광범위하게 활용되고 있는 기술을 활용하여, 이전에 알려지지 않은 유용한 정보를 획득할 수 있다. 오늘날 전자상거래 사이트는 특정한 소수의 상품만을 다루지 않고, 고객들에게 다양한 분야의 상품 구입을 권장한다. 고객들 역시 자신의 의견을 다양한 소셜 미디어에 표현하고, 자신의 니즈를 충족시키기 위해 다양한 공급자를 찾는다. 크로스 추천 시스템은 각 상품의 분야와 현재 그것들을 구매한 고객 사이의 관계를 이용하는 것을 시도한다. 이와 동시에, 새로운 사용자 또는 상품이 등장했지만 기존의 기술에 적용하기에는 충분한 정보가 없는 경우를 뜻하는 콜드 스타트라고 불리는 문제를 다루는 것을 돕는다. 본 연구에서는 새로 등장하는 스파스 도메인에서의 추천 상황에서 하이브리드 시스템 솔루션을 제안한다. 본 연구의 아이디어는 리뷰와 같은 반구조적인 데이터를 텍스트 마이닝을 활용하여 종합한 암묵적인 정보인 영향력 선호이다. 상품에 대한 감정분석을 하기 위해, 고객의 리뷰 정보에서 주관적인 감정을 추출하여 분석하는 도구인 NLTK(Natural Language Tool Kit)를 활용을 시도했다. 심리학적 특성들은 LIWC(Language In-quiry and Word Count)를 활용함으로써 텍스트로부터 추출하고 Five Factor Model로 변환하였다. Five Factor Model의 값은 시스템에서 사용하기 위해 Feature vector로 활용했다.

서지기타정보

서지기타정보
청구기호	{MKSE 16009
형태사항	vi, 59 p. : 삽화 ; 30 cm
언어	영어
일반주기	지도교수의 영문표기 : Mun Yong Yi 지도교수의 한글표기 : 이문용 Appendix : A, correlations between big five traits and LIWC categories. - B, prediction by domain. - C, positive prediction by genre. - D, average values of big five traits by genre. - E, studies relating psychological raits and online reviews
학위논문	학위논문(석사) - 한국과학기술원 : 지식서비스공학과,
서지주기	References : p. 42-47

QR CODE

책소개

전체보기

나의 도서관정보

메뉴

소장정보

리뷰정보

초록정보

서지기타정보

책소개

목차

이 주제의 인기대출도서