English  |  正體中文  |  简体中文  |  Post-Print筆數 : 27 |  Items with full text/Total items : 113318/144297 (79%)
Visitors : 51060554      Online Users : 874
RC Version 6.0 © Powered By DSPACE, MIT. Enhanced by NTU Library IR team.
Scope Tips:
  • please add "double quotation mark" for query phrases to get precise results
  • please goto advance search for comprehansive author search
  • Adv. Search
    HomeLoginUploadHelpAboutAdminister Goto mobile version
    政大機構典藏 > 資訊學院 > 資訊科學系 > 學位論文 >  Item 140.119/55134
    Please use this identifier to cite or link to this item: https://nccur.lib.nccu.edu.tw/handle/140.119/55134


    Title: 雲端筆記之混合式文字切割與辨識
    Segmentation and recognition of mixed characters for cloud-based notes
    Authors: 王冠智
    Wang, Guan Jhih
    Contributors: 廖文宏
    Liao, Wen Hung
    王冠智
    Wang, Guan Jhih
    Keywords: 文字結構濾波
    字體分類
    字體分類
    消除雜訊
    stroke filter
    text font discrimination
    text segmentation
    noise removal
    Date: 2011
    Issue Date: 2012-11-01 13:58:31 (UTC+8)
    Abstract: 文字辨識為常見的電腦視覺應用之一,隨著正確率逐漸的上升,許多新的服務相繼出現,本論文改善了筆記管理軟體最主要的問題-文字切割,並提出兩種新的中文印刷體及手寫體的分類方法。我們將筆記文件中較常見的重點標記過濾後,再使用新核心的文字結構濾波取得筆記文件中的文字區塊,新的核心數據大幅降低原始核心的計算時間。本論文也使用文字結構濾波作為分辨印刷體、手寫體的特徵值,由於文字結構濾波會依據筆畫結構給予能量回饋,使得較工整的印刷體與手寫體能有所區別,此外也使用Sobel搭配不同角度範圍進行字體辨識,實驗結果證實了本論文所提出的文字切割及字體分類方法對於筆記文件資訊的處理是有效的。
    Character recognition is an important and practical application of computer vision. With the advance of this technology, more and more services embedding text recognition functionality have become available. However, segmentation is still the central issue in many situations. In this thesis, we tackle the character segmentation problem in note taking and management applications. We propose novel methods for the discrimination of handwritten and machine-printed Chinese characters. First, we perform noise removal using heuristics and apply a stroke filter with modified kernels to efficiently compute the bounding box for the text area. The responses of the stroke filter also serve as clues for differentiating machine-printed and handwritten texts. They are further enhanced using a SVM-based classifier that employs aggregated directional responses of edge detectors as input. Experiment results have validated the efficacy of the proposed approaches in terms of text localization and style recognition.
    Reference: [[1] K. Jung, K. In Kim, and A. K. Jain, "Text information extraction in images and video: a survey," Pattern Recognition, vol. 37, pp. 977-997, 2004.
    [2] R. Smith, D. Antonova, and D.-S. Lee, "Adapting the Tesseract open source OCR engine for multilingual OCR," presented at the Proceedings of the International Workshop on Multilingual OCR, 2009, pp 1-8.
    [3] Q. Liu, C. Jung, and Y. Moon, "Text segmentation based on stroke filter," presented at the Proceedings of the 14th annual ACM international conference on Multimedia, 2006, pp. 129-132.
    [4] X. Li, W. Wang , Q. Huang , W. Gao , and L. Qing "A hybrid text segmentation approach," in Multimedia and Expo, 2009. ICME 2009. IEEE International Conference on, 2009, pp. 510-513.
    [5] I. DAUBECHIES, Ten Lectures on Wavelets, 1992.
    [6] Y.-C. Su , C.-W. Lee , and Y.-H. Yang "The fast stroke filter," in Machine Learning and Cybernetics (ICMLC), 2010 International Conference on, 2010, pp. 3044-3049.
    [7] Z. Wu , X. Fang , and C. Wang "A comprehensive video text localization system based on stroke filter," in Wireless Communications & Signal Processing, 2009. WCSP 2009. International Conference on, 2009, pp. 1-4.
    [8] L. Liu, D. Zhang, and J. You, "Detecting Wide Lines Using Isotropic Nonlinear Filtering," Image Processing, IEEE Transactions on, vol. 16, pp. 1584-1595, 2007.
    [9] M. Huang, M. Yang, F. Liu, and E.-H. Wu, "Stroke extraction in cartoon images using edge-enhanced isotropic nonlinear filter," presented at the Proceedings of the 9th ACM SIGGRAPH Conference on Virtual-Reality Continuum and its Applications in Industry, 2010, pp. 33-38.
    [10] G. Aghajari and J. Shanbehzadeh, "A Text Localization Algorithm in Color Image via New Projection Profile," in International MultiConference of Engineers and Computer Scientists, 2010, pp. 1486-1489.
    [11] Q. Ye, Q. Huang, W. Gao, and D. Zhao, "Fast and robust text detection in images and video frames," Image Vision Comput., vol. 23, pp. 565-576, 2005.
    [12] M. Pontil and A. Verri, "Support vector machines for 3D object recognition," Pattern Analysis and Machine Intelligence, IEEE Transactions on, vol. 20, pp. 637-646, 1998.
    [13] A. Gionis, P. Indyk, and R. Motwani, "Similarity Search in High Dimensions via Hashing," presented at the Proceedings of the 25th International Conference on Very Large Data Bases, 1999, pp. 518-529.
    [14] E. Kavallieratou, S. Stamatatos, and H. Antonopoulou, "Machine-Printed from Handwritten Text Discrimination," presented at the Proceedings of the Ninth International Workshop on Frontiers in Handwriting Recognition, 2004 , pp. 312-316.
    [15] L. F. da Silva, A. Conci, and A. Sanchez, "Automatic Discrimination between Printed and Handwritten Text in Documents," in Computer Graphics and Image Processing (SIBGRAPI), 2009 XXII Brazilian Symposium on, 2009, pp. 261-267.
    [16] S. N. Srihari, Y.-C. Shin, V. Ramanaprasad, and D.-S. Lee, "Name and Address Block Reader system for tax form processing," presented at the Proceedings of the Third International Conference on Document Analysis and Recognition (Volume 1) - Volume 1, 1995, pp. 5-10.
    [17] R. Duda and P. Hart, Pattern Classication and Scene Analysis, 1996.
    [18] Y. Zheng, H. Li, and D. Doermann, "Machine Printed Text and Handwriting Identification in Noisy Document Images," IEEE Trans. Pattern Anal. Mach. Intell., vol. 26, pp. 337-353, 2004.
    [19] G. G. Rajput, R. Horakeri, and S. Chandrakant, "Printed and Handwritten Mixed Kannada Numerals Recognition Using SVM," International Journal on Computer Science and Engineering, p. 5, 2010, pp. 1622-1626.
    [20] S. Chanda, K. Franke, and U. Pal, "Structural handwritten and machine print classification for sparse content and arbitrary oriented document fragments," presented at the Proceedings of the 2010 ACM Symposium on Applied Computing, 2010, pp. 18-22.
    [21] H. Freeman, "On the Encoding of Arbitrary Geometric Configurations," Electronic Computers, IRE Transactions on, vol. EC-10, pp. 260-268, 1961.
    [22] J. K. Guo and M. Y. Ma, "Separating handwritten material from machine printed text using hidden Markov models," in Document Analysis and Recognition, 2001. Proceedings. Sixth International Conference on, 2001, pp. 439-443.
    [23] R. Kandan, N. K. Reddy, K. R. Arvind, and A. G. Ramakrishnan, "A robust two level classification algorithm for text localization in documents," presented at the Proceedings of the 3rd international conference on Advances in visual computing - Volume Part II, 2007, pp. 96-105.
    [24] S. Suzuki and K. Be, "Topological structural analysis of digitized binary images by border following," Computer Vision, Graphics, and Image Processing, vol. 30, pp. 32-46, 1985.
    Description: 碩士
    國立政治大學
    資訊科學學系
    99753003
    100
    Source URI: http://thesis.lib.nccu.edu.tw/record/#G0099753003
    Data Type: thesis
    Appears in Collections:[資訊科學系] 學位論文

    Files in This Item:

    File SizeFormat
    300301.pdf7694KbAdobe PDF21240View/Open


    All items in 政大典藏 are protected by copyright, with all rights reserved.


    社群 sharing

    著作權政策宣告 Copyright Announcement
    1.本網站之數位內容為國立政治大學所收錄之機構典藏,無償提供學術研究與公眾教育等公益性使用,惟仍請適度,合理使用本網站之內容,以尊重著作權人之權益。商業上之利用,則請先取得著作權人之授權。
    The digital content of this website is part of National Chengchi University Institutional Repository. It provides free access to academic research and public education for non-commercial use. Please utilize it in a proper and reasonable manner and respect the rights of copyright owners. For commercial use, please obtain authorization from the copyright owner in advance.

    2.本網站之製作,已盡力防止侵害著作權人之權益,如仍發現本網站之數位內容有侵害著作權人權益情事者,請權利人通知本網站維護人員(nccur@nccu.edu.tw),維護人員將立即採取移除該數位著作等補救措施。
    NCCU Institutional Repository is made to protect the interests of copyright owners. If you believe that any material on the website infringes copyright, please contact our staff(nccur@nccu.edu.tw). We will remove the work from the repository and investigate your claim.
    DSpace Software Copyright © 2002-2004  MIT &  Hewlett-Packard  /   Enhanced by   NTU Library IR team Copyright ©   - Feedback