Loading...
|
Please use this identifier to cite or link to this item:
https://nccur.lib.nccu.edu.tw/handle/140.119/37101
|
Title: | 基於注意力與多模式分析之 數位相片管理系統設計與實作 Design and implementation of a multi-modal attention-based photo manager |
Authors: | 孫新民 |
Contributors: | 廖文宏 孫新民 |
Keywords: | 電腦視覺 多模式 影像處理 人工智慧 |
Date: | 2004 |
Issue Date: | 2009-09-19 12:09:23 (UTC+8) |
Abstract: | 本論文敘述對於智慧型個人數位相片管理瀏覽平台之研究、設計與實作過程。系統設計上基於整合多重證據架構,採用影像內容與使用者瀏覽行為之分析作為自動分類,判斷影像重要性與推薦程度的依據。影像自動分類方面,包括外部給予的標準資訊-EXIF資訊與分析影像內容,以其中人物存在數量與面積比例為依據的影像分類。而在影像的推薦方面,則採用影像品質之分析-包括對焦品質分析、曝光品質分析-與分析使用者瀏覽相片時的行為-包括停留時間與專注程度的整合為分析重要程度依據;最後則採用多模式(Multi-Modal)架構整合不同的評估結果並作為推薦的結論。 In this thesis, we present the design and implementation of an intelligent personal digital photo browsing platform. The proposed system relies on multiple evidences inferred from image content as well as user behavior. Specifically, external EXIF data and face detection results are utilized to coarsely classify the digital images. Measures of image quality, including clarity and contrast, are calculated to further refine the search result. Moreover, we use web cameras to record and analyze the viewing behavior of the user and attempt to correlate the interest of the viewer to the effective viewing time. Finally, a multi-modal system is put in place to integrate the clues acquired from different modules. |
Reference: | 【1】 Richard Shim.,「影像左右快閃記憶卡命運」,CNET新聞專區,2004年,http://taiwan.cnet.com/news/ce/0,2000062982,20087086,00.htm 【2】 Kerry Rodden, Kenneth R. Wood. 2003. How Do People Manage Their Digital Photographs? CHI 2003: NEW HORIZONS. Volume No. 5, Issue No. 1 【3】 Hyunmo Kang, Ben Shneiderman.2002. Visualization Methods for Personal Photo Collections:Browsing and Searching in the PhotoFinder. Department of Computer Science, Human-Computer Interaction Laboratory 【4】 Adobe Systems Incorporated, http://www.pacific.adobe.com/products/photoshopalbum/overview.html 【5】 Ullas Gargi, Yining Deng, Daniel R. Tretter. 2002. Managing and Searching Personal Photo Collections. HP Laboratories Palo Alto 【6】 Lynette Hirschman.1999. Intelligent Human-Computer Interfaces. The Edge Volume 3, Number 4 【7】 P. Maes, T. Darrell, B. Blumberg, A. Pentland. 1995. The ALIVE system: full-body interaction with autonomous agents. Computer Animation`95 . 【8】 許聞廉、陳克健,「自然智慧型輸入系統的語意分析─脈絡會意法」,1993年,Proceedings of the 6th International Symposium on Cognitive Aspects of the Chinese Language, (1993), 527-540. 【9】 Japan Electronics and Information Technology Industries Association . Exchangeable image file format for digital still cameras : Exif Version 2.2 【10】 TsuruZohTachibanaya..Description of Exif file format. 2001. http://park2.wakwak.com/~tsuruzoh/Computer/Digicams/exif-e.html#AboutExif 【11】 Stuart Russell ,Peter Norvig. 2002. Artificial Intelligence: A Modern Approach Second Edition. Prentice Hall. 【12】 Sanjay Kr. Singh, D. S. Chauhan, Mayank Vatsa, Richa Singh. 2003. A Robust Skin Color Based Face Detection Algorithm. Tamkang Journal of Science and Engineering, Vol. 6, No. 4, pp. 227-234 【13】 Y. Gong and M. Sakauchi, "Detection of regions matching specified chromatic features", Computer Vision and Image Understanding, vol. 61, no. 2, 1995, pp 263 - 269 【14】 Goldennumer.Net, “The human face is based entirely on Phi”, http://www.goldennumber.net/face.htm 【15】 Zhou Wang, Alan C. Bovik, 2002 “WHY IS IMAGE QUALITY ASSESSMENT SO DIFFICULT?”, IEEE International Conference on Acoustics, Speech, & Signal Processing 【16】 Zhou Wang, Alan C. Bovik. 2002. A Universal Image Quality Index. IEEE Signal Processing Letters, vol. 9, no. 3, pp. 81-84 【17】 Norbert Wiener. 1942. Extrapolation, Interpolation, and Smoothing of Stationary Time Series. MIT Express 【18】 Claude E. Shannon. 1948 . A Mathematical Theory of Communication. Bell System Technical Journal, vol. 27, pp. 379-423 and 623-656 【19】 Jiawei Han, Micheline Kamber. 2001. Data Mining: Concepts and Techniques 【20】 Gordon S. Linoff, Michael J. A. Berry, Michael J. A. Berry . 2001. Mining the Web: Transforming Customer Data. 【21】 Paul Viola, Michael Jones. 2001. Rapid Object Detection using a Boosted Cascade of Simple Features. Proceedings IEEE Conf. on Computer Vision and Pattern Recognition 【22】 E.S. Bigun, J.Bigün, B. Duc, S. Fischer. 1997. Expert conciliation for multi modal person authentication systems by Bayesian statistics, Audio and Video based Person Authentication - AVBPA97 【23】 P. Verlinde, G. Chollet, and M. Acheroy. 2000. Multi-modal identity verification using expert fusion. Information Fusion, 1:17--33 【24】 Conrad Sanderson, 2002, “Information fusion and person verification using speech & face information”, IDIAP–RR 02-33 【25】 Arun Ross, Anil Jain, Jian-Zhong Qian. 2001. Information Fusion in Biometrics. Lecture Notes in Computer Science 【26】 Metropolis,N., A. Rosenbluth, M. Rosenbluth, A. Teller, E. Teller, 1953,"Equation of State Calculations by Fast Computing Machines", J. Chem. Phys.,21, 6, 1087-1092, |
Description: | 碩士 國立政治大學 資訊科學學系 90753012 93 |
Source URI: | http://thesis.lib.nccu.edu.tw/record/#G0090753012 |
Data Type: | thesis |
Appears in Collections: | [資訊科學系] 學位論文
|
All items in 政大典藏 are protected by copyright, with all rights reserved.
|