In this thesis, we present a new algorithm based on the connected component for segmentation and recognition of document information. The segmentation module classifies document information into texts and graphics using the size of connected components and the horizontal-vertical join algorithm. The recognition module separates each word into characters using the connected component recognition algorithm and recognizes the characters using the direction contributivity.
We improve the recognition rates and speed of Korean character using the Korean radical recognition.