政大機構典藏-National Chengchi University Institutional Repository(NCCUR):Item 140.119/63712
English  |  正體中文  |  简体中文  |  Post-Print筆數 : 27 |  全文笔数/总笔数 : 114205/145239 (79%)
造访人次 : 52525602      在线人数 : 691
RC Version 6.0 © Powered By DSPACE, MIT. Enhanced by NTU Library IR team.
搜寻范围 查询小技巧:
  • 您可在西文检索词汇前后加上"双引号",以获取较精准的检索结果
  • 若欲以作者姓名搜寻,建议至进阶搜寻限定作者字段,可获得较完整数据
  • 进阶搜寻
    政大機構典藏 > 資訊學院 > 資訊科學系 > 學位論文 >  Item 140.119/63712


    请使用永久网址来引用或连结此文件: https://nccur.lib.nccu.edu.tw/handle/140.119/63712


    题名: 基於三元化特徵描述子之行動影像識別機制
    Image recognition on mobile devices using ternary feature descriptors
    作者: 辜致翔
    贡献者: 廖文宏
    辜致翔
    关键词: 三元化特徵描述子
    二元化特徵描述子
    行動影像辨識
    ternary feature descriptors
    binary feature descriptor
    mobile image recognition
    feature recognition
    日期: 2013
    上传时间: 2014-02-10 14:57:19 (UTC+8)
    摘要: 現在因應科技的發展,行動裝置計算能力和應用也成長快速,現行非常多應用需要結合各種不同的感應器或相機等來配合使用,然而這些裝置在不同的狀況下會有一些限制,在無法改變外在的環境下,勢必需要靠軟體去做補強或修正。
    本論文針對在有限的計算和存儲資源的移動裝置平台上進行物體的偵測與追踪。為了達到此一目標,我們提出了一個型態的圖像特徵,稱為區域三元描述子(Local Ternary Descriptors),期望在時間複雜度、抗噪性和準確率各個面向取得一個較佳的平衡。LTD是基於區域二元描述子(Local Binary Descriptors)所衍生出來的方法,如BRIEF,BRISK,FREAK。而使用三進位制編碼方法的動機在於,三元化處理可以減輕因在LBD的簡單threshold處理過後所產生的一些問題。而類似於LBD地方在於,LTDS之間的距離可以很容易地使用Hamming distance計算。實驗數據及比較分析後證明,本論文提出的區域三元描述子可以在雜訊環境的條件下表現出優異的效果。
    The rapid advances of information and communication technology have brought about the prevalence of mobile devices. Diverse applications on smartphones have emerged accordingly. Interactive media and augmented reality are two well-known examples that utilize these devices as an interface to present digital content to the users. Effective interface design is therefore a critical factor to guarantee satisfactory user experience.
    In this thesis, we address the detection and tracking of objects on mobile platforms with limited computation and storage resources. To strike a good balance among feature complexity, noise immunity and detection rate, we propose a novel class of image feature known as local ternary descriptors (LTD). LTDs are extensions of local binary descriptors (LBD) such as BRIEF, BRISK, and FREAK. The motivation for using ternary representation lies in the observation that the ternarization process can alleviate some problems caused by simple thresholding in LBD. Similar to LBD, the distance between two LTDs can be easily computed using Hamming distance. Experimental results and comparative analysis indicate that the proposed descriptor can achieve superior performance under noisy conditions.
    參考文獻: [1] Canny, J. "A Computational Approach To Edge Detection". IEEE Trans. Pattern Analysis and Machine Intelligence 8 (6): 679–714, 1986.
    [2] Scharr, Hanno. Dissertation (in German), Optimal Operators in Digital Image Processing, 2000.
    [3] C. Harris and M. Stephens. "A combined corner and edge detector". Proceedings of the 4th Alvey Vision Conference. pp. pages 147—151, 1988.hi
    [4] S. M. Smith and J. M. Brady . "SUSAN - a new approach to low level image processing". International Journal of Computer Vision 23 (1): 45–78, 1997.
    [5] M. Trajkovic and M. Hedley. "Fast corner detection". Image and Vision Computing 16 (2): 75–87, 1998.
    [6] D. Lowe. "Distinctive Image Features from Scale-Invariant Keypoints", International Journal of Computer Vision 60 (2), 2004.
    [7] DG Lowe, Object recognition from local scale-invariant features. Computer Vision, 1999. The Proceedings of the Seventh IEEE International Conference, vol 2, 1150-1157, 1999.
    [8] H Bay, T Tuytelaars, L Van Gool. Surf: Speeded up robust features. Computer Vision–ECCV 2006.
    [9] Calonder, Michael, et al. "BRIEF: binary robust independent elementary features." Computer Vision–ECCV 2010. Springer Berlin Heidelberg, 2010. 778-792.
    [10] Rublee, E., Rabaud, V., Konolige, K., & Bradski, G. (2011, November). ORB: an efficient alternative to SIFT or SURF. In Computer Vision (ICCV), 2011 IEEE International Conference on (pp. 2564-2571). IEEE.
    [11] Leutenegger, Stefan, Margarita Chli, and Roland Y. Siegwart. "BRISK: Binary robust invariant scalable keypoints." Computer Vision (ICCV), 2011 IEEE International Conference on. IEEE, 2011.
    [12] A Alahi, R Ortiz, P Vandergheynst. FREAK: Fast Retina Keypoint. Computer Vision and Pattern Recognition (CVPR), 510 – 517, 2012
    [13] J Heinly, E Dunn, JM Frahm Comparative Evaluation of Binary Features Computer Vision–ECCV 2012
    [14] Cha, S. H., Yoon, S., & Tappert, C. C. (2005). Enhancing binary feature vector similarity measures.
    [15] Miksik, O., & Mikolajczyk, K. (2012, November). Evaluation of local detectors and descriptors for fast feature matching. In Pattern Recognition (ICPR), 2012 21st International Conference on (pp. 2681-2684). IEEE.
    [16] Mair, E., Hager, G. D., Burschka, D., Suppa, M., & Hirzinger, G. (2010). Adaptive and generic corner detection based on the accelerated segment test. In Computer Vision–ECCV 2010 (pp. 183-196). Springer Berlin Heidelberg.
    [17] K. Mikolajczyk and C. Schmid. A performance evaluation of local descriptors. IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI), 2:1115–1125, 2005. 2, 5
    [18] 楊挺榮(2010) 基於延展式區域三元化徒刑之特徵描述子
    描述: 碩士
    國立政治大學
    資訊科學學系
    100753032
    102
    資料來源: http://thesis.lib.nccu.edu.tw/record/#G1007530321
    数据类型: thesis
    显示于类别:[資訊科學系] 學位論文

    文件中的档案:

    档案 描述 大小格式浏览次数
    032101.pdf4718KbAdobe PDF2734检视/开启


    在政大典藏中所有的数据项都受到原著作权保护.


    社群 sharing

    著作權政策宣告 Copyright Announcement
    1.本網站之數位內容為國立政治大學所收錄之機構典藏,無償提供學術研究與公眾教育等公益性使用,惟仍請適度,合理使用本網站之內容,以尊重著作權人之權益。商業上之利用,則請先取得著作權人之授權。
    The digital content of this website is part of National Chengchi University Institutional Repository. It provides free access to academic research and public education for non-commercial use. Please utilize it in a proper and reasonable manner and respect the rights of copyright owners. For commercial use, please obtain authorization from the copyright owner in advance.

    2.本網站之製作,已盡力防止侵害著作權人之權益,如仍發現本網站之數位內容有侵害著作權人權益情事者,請權利人通知本網站維護人員(nccur@nccu.edu.tw),維護人員將立即採取移除該數位著作等補救措施。
    NCCU Institutional Repository is made to protect the interests of copyright owners. If you believe that any material on the website infringes copyright, please contact our staff(nccur@nccu.edu.tw). We will remove the work from the repository and investigate your claim.
    DSpace Software Copyright © 2002-2004  MIT &  Hewlett-Packard  /   Enhanced by   NTU Library IR team Copyright ©   - 回馈