政大機構典藏-National Chengchi University Institutional Repository(NCCUR):Item 140.119/37101
English  |  正體中文  |  简体中文  |  Post-Print筆數 : 27 |  全文笔数/总笔数 : 113648/144635 (79%)
造访人次 : 51664153      在线人数 : 580
RC Version 6.0 © Powered By DSPACE, MIT. Enhanced by NTU Library IR team.
搜寻范围 查询小技巧:
  • 您可在西文检索词汇前后加上"双引号",以获取较精准的检索结果
  • 若欲以作者姓名搜寻,建议至进阶搜寻限定作者字段,可获得较完整数据
  • 进阶搜寻
    政大機構典藏 > 資訊學院 > 資訊科學系 > 學位論文 >  Item 140.119/37101


    请使用永久网址来引用或连结此文件: https://nccur.lib.nccu.edu.tw/handle/140.119/37101


    题名: 基於注意力與多模式分析之 數位相片管理系統設計與實作
    Design and implementation of a multi-modal attention-based photo manager
    作者: 孫新民
    贡献者: 廖文宏
    孫新民
    关键词: 電腦視覺
    多模式
    影像處理
    人工智慧
    日期: 2004
    上传时间: 2009-09-19 12:09:23 (UTC+8)
    摘要: 本論文敘述對於智慧型個人數位相片管理瀏覽平台之研究、設計與實作過程。系統設計上基於整合多重證據架構,採用影像內容與使用者瀏覽行為之分析作為自動分類,判斷影像重要性與推薦程度的依據。影像自動分類方面,包括外部給予的標準資訊-EXIF資訊與分析影像內容,以其中人物存在數量與面積比例為依據的影像分類。而在影像的推薦方面,則採用影像品質之分析-包括對焦品質分析、曝光品質分析-與分析使用者瀏覽相片時的行為-包括停留時間與專注程度的整合為分析重要程度依據;最後則採用多模式(Multi-Modal)架構整合不同的評估結果並作為推薦的結論。
    In this thesis, we present the design and implementation of an intelligent personal digital photo browsing platform. The proposed system relies on multiple evidences inferred from image content as well as user behavior. Specifically, external EXIF data and face detection results are utilized to coarsely classify the digital images. Measures of image quality, including clarity and contrast, are calculated to further refine the search result. Moreover, we use web cameras to record and analyze the viewing behavior of the user and attempt to correlate the interest of the viewer to the effective viewing time. Finally, a multi-modal system is put in place to integrate the clues acquired from different modules.
    參考文獻: 【1】 Richard Shim.,「影像左右快閃記憶卡命運」,CNET新聞專區,2004年,http://taiwan.cnet.com/news/ce/0,2000062982,20087086,00.htm
    【2】 Kerry Rodden, Kenneth R. Wood. 2003. How Do People Manage Their Digital Photographs? CHI 2003: NEW HORIZONS. Volume No. 5, Issue No. 1
    【3】 Hyunmo Kang, Ben Shneiderman.2002. Visualization Methods for Personal Photo Collections:Browsing and Searching in the PhotoFinder. Department of Computer Science, Human-Computer Interaction Laboratory
    【4】 Adobe Systems Incorporated, http://www.pacific.adobe.com/products/photoshopalbum/overview.html
    【5】 Ullas Gargi, Yining Deng, Daniel R. Tretter. 2002. Managing and Searching Personal Photo Collections. HP Laboratories Palo Alto
    【6】 Lynette Hirschman.1999. Intelligent Human-Computer Interfaces. The Edge Volume 3, Number 4
    【7】 P. Maes, T. Darrell, B. Blumberg, A. Pentland. 1995. The ALIVE system: full-body interaction with autonomous agents. Computer Animation`95 .
    【8】 許聞廉、陳克健,「自然智慧型輸入系統的語意分析─脈絡會意法」,1993年,Proceedings of the 6th International Symposium on Cognitive Aspects of the Chinese Language, (1993), 527-540.
    【9】 Japan Electronics and Information Technology Industries Association . Exchangeable image file format for digital still cameras : Exif Version 2.2
    【10】 TsuruZohTachibanaya..Description of Exif file format. 2001. http://park2.wakwak.com/~tsuruzoh/Computer/Digicams/exif-e.html#AboutExif
    【11】 Stuart Russell ,Peter Norvig. 2002. Artificial Intelligence: A Modern Approach Second Edition. Prentice Hall.
    【12】 Sanjay Kr. Singh, D. S. Chauhan, Mayank Vatsa, Richa Singh. 2003. A Robust Skin Color Based Face Detection Algorithm. Tamkang Journal of Science and Engineering, Vol. 6, No. 4, pp. 227-234
    【13】 Y. Gong and M. Sakauchi, "Detection of regions matching specified chromatic features", Computer Vision and Image Understanding, vol. 61, no. 2, 1995, pp 263 - 269
    【14】 Goldennumer.Net, “The human face is based entirely on Phi”, http://www.goldennumber.net/face.htm
    【15】 Zhou Wang, Alan C. Bovik, 2002 “WHY IS IMAGE QUALITY ASSESSMENT SO DIFFICULT?”, IEEE International Conference on Acoustics, Speech, & Signal Processing
    【16】 Zhou Wang, Alan C. Bovik. 2002. A Universal Image Quality Index. IEEE Signal Processing Letters, vol. 9, no. 3, pp. 81-84
    【17】 Norbert Wiener. 1942. Extrapolation, Interpolation, and Smoothing of Stationary Time Series. MIT Express
    【18】 Claude E. Shannon. 1948 . A Mathematical Theory of Communication. Bell System Technical Journal, vol. 27, pp. 379-423 and 623-656
    【19】 Jiawei Han, Micheline Kamber. 2001. Data Mining: Concepts and Techniques
    【20】 Gordon S. Linoff, Michael J. A. Berry, Michael J. A. Berry . 2001. Mining the Web: Transforming Customer Data.
    【21】 Paul Viola, Michael Jones. 2001. Rapid Object Detection using a Boosted Cascade of Simple Features. Proceedings IEEE Conf. on Computer Vision and Pattern Recognition
    【22】 E.S. Bigun, J.Bigün, B. Duc, S. Fischer. 1997. Expert conciliation for multi modal person authentication systems by Bayesian statistics, Audio and Video based Person Authentication - AVBPA97
    【23】 P. Verlinde, G. Chollet, and M. Acheroy. 2000. Multi-modal identity verification using expert fusion. Information Fusion, 1:17--33
    【24】 Conrad Sanderson, 2002, “Information fusion and person verification using speech & face information”, IDIAP–RR 02-33
    【25】 Arun Ross, Anil Jain, Jian-Zhong Qian. 2001. Information Fusion in Biometrics. Lecture Notes in Computer Science
    【26】 Metropolis,N., A. Rosenbluth, M. Rosenbluth, A. Teller, E. Teller, 1953,"Equation of State Calculations by Fast Computing Machines", J. Chem. Phys.,21, 6, 1087-1092,
    描述: 碩士
    國立政治大學
    資訊科學學系
    90753012
    93
    資料來源: http://thesis.lib.nccu.edu.tw/record/#G0090753012
    数据类型: thesis
    显示于类别:[資訊科學系] 學位論文

    文件中的档案:

    档案 描述 大小格式浏览次数
    75301201.pdf96KbAdobe PDF2727检视/开启
    75301202.pdf103KbAdobe PDF2789检视/开启
    75301203.pdf101KbAdobe PDF2748检视/开启
    75301204.pdf60KbAdobe PDF2655检视/开启
    75301205.pdf118KbAdobe PDF2839检视/开启
    75301206.pdf118KbAdobe PDF2843检视/开启
    75301207.pdf95KbAdobe PDF2756检视/开启
    75301208.pdf93KbAdobe PDF2759检视/开启
    75301209.pdf1296KbAdobe PDF2958检视/开启
    75301210.pdf1314KbAdobe PDF22400检视/开启
    75301211.pdf1672KbAdobe PDF23326检视/开启
    75301212.pdf406KbAdobe PDF21475检视/开启
    75301213.pdf301KbAdobe PDF21162检视/开启
    75301214.pdf1357KbAdobe PDF2937检视/开启
    75301215.pdf808KbAdobe PDF2914检视/开启
    75301216.pdf159KbAdobe PDF2764检视/开启


    在政大典藏中所有的数据项都受到原著作权保护.


    社群 sharing

    著作權政策宣告 Copyright Announcement
    1.本網站之數位內容為國立政治大學所收錄之機構典藏,無償提供學術研究與公眾教育等公益性使用,惟仍請適度,合理使用本網站之內容,以尊重著作權人之權益。商業上之利用,則請先取得著作權人之授權。
    The digital content of this website is part of National Chengchi University Institutional Repository. It provides free access to academic research and public education for non-commercial use. Please utilize it in a proper and reasonable manner and respect the rights of copyright owners. For commercial use, please obtain authorization from the copyright owner in advance.

    2.本網站之製作,已盡力防止侵害著作權人之權益,如仍發現本網站之數位內容有侵害著作權人權益情事者,請權利人通知本網站維護人員(nccur@nccu.edu.tw),維護人員將立即採取移除該數位著作等補救措施。
    NCCU Institutional Repository is made to protect the interests of copyright owners. If you believe that any material on the website infringes copyright, please contact our staff(nccur@nccu.edu.tw). We will remove the work from the repository and investigate your claim.
    DSpace Software Copyright © 2002-2004  MIT &  Hewlett-Packard  /   Enhanced by   NTU Library IR team Copyright ©   - 回馈