서지주요정보
Corpus를 기반으로 하는 한국어 술어의 양상 생성 = A corpus-based modality generation for Korean predicates
서명 / 저자 Corpus를 기반으로 하는 한국어 술어의 양상 생성 = A corpus-based modality generation for Korean predicates / 안동언.
발행사항 [대전 : 한국과학기술원, 1995].
Online Access 원문보기 원문인쇄

소장정보

등록번호

8005665

소장위치/청구기호

학술문화관(문화관) 보존서고

DCS 95004

휴대폰 전송

도서상태

이용가능(대출불가)

사유안내

반납예정일

등록번호

9001542

소장위치/청구기호

서울 학위논문 서가

DCS 95004 c. 2

휴대폰 전송

도서상태

이용가능(대출불가)

사유안내

반납예정일

리뷰정보

초록정보

This paper describes a corpus-based modality generation of a Korean synthesizer. Modalities may be expressed by modality morphemes such as auxiliary verbs and verb endings. To form a complete predicate, they are concatenated together with a main-verb stem, being arranged in the Korean-specific modality order, which is neither a linear order nor a partial order mathematically. To lexicalize a modality, the synthesizer must choose the best one among several different morpheme candidates whose meanings are very similar to one another, since each of them shows a subtle difference from the others as far as stylistic naturalness is concerned. To cope with these difficulties, a corpus-based modality generation is suggested, where a large corpus is analyzed to acquire reliable linguistic knowledge on modalities. Through the corpus analysis, firstly, auxiliary verbs are classified into modality groups according to their modal meanings and grammatical functions. Secondly, the representative for each modality group is selected mainly on the basis of frequency in the corpus. Thirdly, the corpusbased ordering relation among a set of modality groups is transformed into a partial ordering by removing some pairwise orderings, and then through the topological sorting we derive a linear modality order covering as much actual ordering information as possible. Finally, by performance evaluation, we show that the corpus-based approach may be a great help to the improvement of the conventional rule-based Korean synthesizer.

서지기타정보

서지기타정보
청구기호 {DCS 95004
형태사항 v, 74 p. : 삽화 ; 26 cm
언어 한국어
일반주기 부록 : A, 보조용언의 분포. - B, 보조 용언의 분류별 분포. - C, 두겹, 세겹, 네겹 보조용언의 분포. - D, 개선된 생성 결과
저자명의 영문표기 : Dong-Un An
지도교수의 한글표기 : 김길창
지도교수의 영문표기 : Gil-Chang Kim
학위논문 학위논문(박사) - 한국과학기술원 : 전산학과,
서지주기 참고문헌 : p. 55-62
주제 Modality (Linguistics)
Natural language processing.
Computational linguistics.
한국어. --과학기술용어시소러스
생성. --과학기술용어시소러스
양상 논리. --과학기술용어시소러스
Reproduction.
QR CODE

책소개

전체보기

목차

전체보기

이 주제의 인기대출도서