한국과학기술원 도서관

서지주요정보
Contextual modeling for effective image categorization = 효과적인 영상 분류를 위한 관계 모델링
서명 / 저자	Contextual modeling for effective image categorization = 효과적인 영상 분류를 위한 관계 모델링 / Teng Li.
발행사항	[대전 : 한국과학기술원, 2010].
Online Access	원문보기 원문인쇄

소장정보

등록번호

8021092

소장위치/청구기호

학술문화관(문화관) 보존서고

DEE 10046

휴대폰 전송

도서상태

이용가능(대출불가)

사유안내

반납예정일

리뷰정보

초록정보

Classifying images to object or scene categories according to the content is an important topic in computer vision with many applications. In real world, an image or an object is usually associated with rich contexts which are important in human vision to categorization. In this thesis, we explore modeling the contexts for effective image categorization, and address the issues of defining, representing and learning contexts in three categorization scenarios: single-label categorization, multi-label categorization, and pixel-level categorization, $\It{i.e.}$., scene parsing. Defining two typical contextual relations between local features, $\It{i.e.}$., a semantic conceptual relation and a spatial neighboring relation, a local feature based Contextual Bag-of-Words (CBoW) model is proposed for single-label image categorization with the popular Bag-of-Words (BoW) representation style. The conceptual relation is learned according to the similarity of class distributions induced by visual words corresponding to local features, and the spatial neighboring relation is learned by a confidence that neighboring visual words are relevant. Classification is taken using support vector machine (SVM) with a designed kernel incorporating the relational information. Multi-label image categorization is more challenging yet closer to real-world applications than single-label case since real-world images are usually associated to multiple labels. Conventional algorithms over multi-label image data predominantly rely on the holistic image similarities, ignoring that each label essentially only characterizes a local region. With the multi-label contexts piloted by a collection of multi-label images, we propose the Contextual Image Decomposition (CID), to obtain an optimal representation for each label of a set of multi-labeled images without explicit segmentation. Multi-label context is defined that local label representations of the same category are similar across different images while those from different categories are dissimilar. We formulate the decomposition as an optimization problem which minimizes intra-label difference and at the same time maximizes inter-label difference of the target label representations, to which two ways of mathematical solutions are proposed. Scene parsing, to categorize image to different labels on pixel-level, is a core problem in computer vision. Guided by the multi-label context across the images that closely related segments usually have similar labels we propose a weakly-supervised scene parsing algorithm that semantically parses a collection of images with multi-label. Images are segmented to patches on multi-level and the patches contextual relation is discovered via sparse representation by $\ell^1$ minimization. The contextual patch labeling process is formulated in an optimization framework based on the graph representation and solved by a convergent iterative method. For better performance, the category models are also learned using CID from the image set and applied to the segments. Final labeling is obtained by combining all the information on pixel level. The proposed contextual modeling algorithms are extensively evaluated on different image categorization tasks with benchmark datasets. Experimental results demonstrate the importance of contexts for image categorization and the proposed algorithms achieve state of the art performance with comparison to previous methods. Furthermore, a typical application of KAIST campus images labeling and label ranking is demonstrated.

서지기타정보

서지기타정보
청구기호	{DEE 10046
형태사항	xii, 113 p. : 삽화 ; 26 cm
언어	영어
일반주기	저자명의 한글표기 : 텅 리 지도교수의 영문표기 : In-So Kweon 지도교수의 한글표기 : 권인소
학위논문	학위논문(박사) - 한국과학기술원 : 전기및전자공학과,
서지주기	Reference: p. 105-113

QR CODE

책소개

전체보기

나의 도서관정보

메뉴

소장정보

리뷰정보

초록정보

서지기타정보

책소개

목차

이 주제의 인기대출도서