한국과학기술원 도서관

서지주요정보
Understanding the use of emojis in hate speech : a case study of its detection and restoration on Twitch.tv = 혐오 표현에서 이모지 사용에 대한 이해 : Twitch.tv에서 탐지 및 복원 사례를 중심으로
서명 / 저자	Understanding the use of emojis in hate speech : a case study of its detection and restoration on Twitch.tv = 혐오 표현에서 이모지 사용에 대한 이해 : Twitch.tv에서 탐지 및 복원 사례를 중심으로 / Jaeheon Kim.
발행사항	[대전 : 한국과학기술원, 2020].
Online Access	원문보기 원문인쇄

소장정보

등록번호

8035987

소장위치/청구기호

학술문화관(문화관) 보존서고

MCS 20009

휴대폰 전송

도서상태

이용가능(대출불가)

사유안내

반납예정일

리뷰정보

초록정보

The latest advances in NLP (natural language processing) has led to the launch of the much needed machine-driven hate speech detection. Nevertheless, people continuously find new forms of hateful expressions that are easily identified by humans, but not by machines. One such expression is the mix of text and emojis, a type of visual hate speech that is increasingly used to evade algorithmic moderation. This research analyzes chat conversations from the popular streaming platform Twitch to understand the varied types of visual hate speech. Emotes were used sometimes to replace a letter, seek attention, or for emotional expression. We created a labeled dataset that contains 29,721 cases of emotes replacing letters. Based on the dataset, we built a neural network classifier and identify visual hate speech that would otherwise be undetected through traditional methods and caught an additional 1.3% examples of hate speech out of 15 million chat utterances.

자연언어처리 분야의 발전은 기계 학습을 통한 혐오 표현 탐지를 가능하게 하였다. 그러나, 사용자는 지속적으로 새로운 형태의 혐오 표현을 발견한다. 이는 인간에 의해 쉽게 탐지되지만, 기계에 의해서는 탐지되지 않는다. 그러한 형태의 표현 중 하나는 텍스트와 이모지의 혼합으로, 알고리즘의 감시망을 피하기 위해 사용 빈도가 증가하고 있는 시각적 혐오 표현이다. 이 연구는 인기있는 온라인 방송 플랫폼인 트위치의 채팅 대화를 분석하여 다양한 시각적 혐오 표현의 유형을 이해한다. 이를 통해 이모지는 문자를 대체하거나, 주의를 끌거나, 감정을 표현하기 위해 사용됨을 확인한다. 또한 이모지가 문자를 대체하는 2만 9721개의 데이터셋을 확보하였으며, 이를 기반으로 언어 모델을 제안한다. 기존의 방법으로 발견되지 않았던 시각적 혐오 표현을 식별하고, 1500만건의 채팅 중 약 1.3%의 혐오 표현을 추가로 검출할 수 있었다.

서지기타정보

서지기타정보
청구기호	{MCS 20009
형태사항	iv, 26 p. : 삽화 ; 30 cm
언어	영어
일반주기	저자명의 한글표기 : 김재헌 지도교수의 영문표기 : Meeyoung Cha 지도교수의 한글표기 : 차미영
학위논문	학위논문(석사) - 한국과학기술원 : 전산학부,
서지주기	References : p. 21-24

QR CODE

책소개

전체보기

나의 도서관정보

메뉴

소장정보

리뷰정보

초록정보

서지기타정보

책소개

목차

이 주제의 인기대출도서