政大機構典藏-National Chengchi University Institutional Repository(NCCUR):Item 140.119/79209
English  |  正體中文  |  简体中文  |  Post-Print筆數 : 27 |  全文笔数/总笔数 : 113318/144297 (79%)
造访人次 : 50962425      在线人数 : 935
RC Version 6.0 © Powered By DSPACE, MIT. Enhanced by NTU Library IR team.
搜寻范围 查询小技巧:
  • 您可在西文检索词汇前后加上"双引号",以获取较精准的检索结果
  • 若欲以作者姓名搜寻,建议至进阶搜寻限定作者字段,可获得较完整数据
  • 进阶搜寻
    政大機構典藏 > 資訊學院 > 資訊科學系 > 學位論文 >  Item 140.119/79209


    请使用永久网址来引用或连结此文件: https://nccur.lib.nccu.edu.tw/handle/140.119/79209


    题名: 設計與實作一個臉書粉絲頁資料抓取器
    Design and Implementation of a Facebook Fan Page Data Crawler
    作者: 鄭博元
    Cheng, Po Yuan
    贡献者: 徐國偉
    Hsu, Kuo Wei
    鄭博元
    Cheng, Po Yuan
    关键词: 臉書
    網路爬蟲
    平行處理
    Facebook
    Web Crawler
    Parallel Processing
    日期: 2015
    上传时间: 2015-11-02 14:50:44 (UTC+8)
    摘要: 近年來隨著社群網路服務的盛行,臉書已成為現代人最主要的社交工具,許多名人及公司企業也都搶搭著這股風潮,紛紛在臉書上建立起粉絲頁來和粉絲們互動,而在虛擬世界和現實社會之間,兩者所互相造成的影響帶動出許多新興研究議題,透過資訊技術收集虛擬世界裡的資料,能幫助人文學者與社會科學家探索出數位科技與人文社會間的新現象。
    本研究針對臉書上的粉絲頁,設計建構出一套臉書資料抓取系統,以協助學者研究分析粉絲頁的動態消息資料,本系統可幫助研究者搜尋出相關粉絲頁,並依照按讚次數排列呈現,協助挑選受歡迎的粉絲頁;讓研究者能抓取特定的粉絲頁資料,抓取到的資料經過解析後分為文章訊息、留言訊息、按讚訊息,並將結果儲存至資料庫;針對已抓取的粉絲頁,自動定時更新至最新資料。
    With the popularity of social networking services in recent years, Facebook has become a major social tool for people. Many celebrities and companies have also gone with the tide to and established a fan page on Facebook to interact with fans. The mutual influence of the virtual world and the real world drives many emerging research agenda. Using information technology to collect data in the virtual world can help the humanities scholars and social scientists to explore new phenomena between digital technology and humanities community.
    In this thesis, we focus on Facebook fan page data. We design and construct a Facebook fan page crawler to help scholars get data for analysis. The crawler can help researchers find the relevant fan pages along with the numbers of thumbs up and it can help researchers select fan pages. The crawler can help researchers to get the fan page data which they want by extracting post messages, comment messages, and like messages from the data and then storing the results into the database. The crawler also can set update timer to help researchers get the latest information.
    參考文獻: [1] 項潔、涂豐恩,導論—什麼是數位人文,從保存到創造:開啟數位人文研究,項潔編,頁9-28,臺灣大學出版中心,臺灣,2011。
    [2] 林泳舜,臉書專頁貼文類型、使用者動機與使用者投入參與三者關係之初探,世新大學碩士論文,2014。
    [3] 陳重任,社群網站使用者對社群網站態度之研究,中華大學碩士論文,2013。
    [4] 王莉瑛,社群網站使用者的網絡規模對社群網站的影響,元智大學碩士論文,2013。
    [5] 林近,社群媒體的沉默螺旋現象-以臉書為例,臺灣大學碩士論文,2014。
    [6] 康至青,健康相關訊息透過社群媒體行銷在台灣之探索性研究,國立中山大學碩士論文,2014。
    [7] 陳慧潔,國小高年級學童臉書使用行為、臉書成癮與人際溝通能力相關之研究,中華大學碩士論文,2013。
    [8] 黃昆山,以沉浸理論探討臉書遊戲商品購買意願,國立中央大學碩士論文,2012。
    [9] 陳子玲,運用臉書提升大學生參與健康促進活動意願之研究-以北部某技術學院為例,元智大學碩士論文,2012。
    [10] Sergey Brin and Lawrence Page, “The Anatomy of a Large-Scale Hypertextual Web Search Engine”, in Proceedings of the 7th International World Wide Web Conference, pp. 107–117, 1998.
    [11] Carlos Castillo, “Effective Web Crawling”, doctoral dissertation, University of Chile, 2004.
    [12] Junghoo Cho and Hector Garcia-Molina, “Parallel Crawlers”, in Proceedings of the 11th International Conference on World Wide Web, pp. 124-135, 2002.
    [13] Debajyoti Mukhopadhyay, Sajal Mukherjee, Soumya Ghosh, Saheli Kar, and Young-Chon Kim, “Architecture of A Scalable Dynamic Parallel WebCrawler with High Speed Downloadable Capability for a Web Search Engine”, in Proceedings of the 6th International Workshop on MSPT, pp. 103-108, 2006.
    [14] Rajashree Shettar and Dr. Shobha G, “Web Crawler on Client Machine”, in Proceedings of the International Multi Conference of Engineers and Computer Scientists, Vol 2, pp. 1121-1124, 2008.
    [15] Soumen Chakrabarti, Martin van den Berg, and Byron Dom , “Focused Crawling: a New Approach to Topic-specific Web Resource Discovery”, in Proceedings of the 8th International World Wide Web Conference, pp. 545-562, 1999.
    [16] Matko Bošnjak, Eduardo Oliveira, José Martins, Eduarda Mendes, and Luís Sarmento , “Twitterecho: a Distributed Focused Crawler to Support Open Research with Twitter Data”, in Proceedings of the 21st International Conference on World Wide Web, pp. 1233–1240, 2012.
    [17] Zhiyong Zhang and Olfa Nasraoui, “Profile-Based Focused Crawler for Social Media-Sharing Websites”, ICTAI `08. 20th IEEE International Conference on Tools with Artificial Intelligence, vol.1, pp. 317-324, 2008.
    [18] Melanie Neunerdt, Markus Niermann, Rudolf Mathar, and Bianka Trevisan, “Focused Crawling for Building Web Comment Corpora”, in Proceedings of the 10th Annual IEEE Consumer Communications and Networking Conference, pp. 761-765, 2013.
    [19] Duen Horng Chau, Shashank Pandit, Samuel Wang, and Christos Faloutsos, “Parallel Crawling for Online Social Networks”, in Proceedings of the 16th International Conference on World Wide Web, pp. 1283–1284, 2007.
    [20] Salvatore A. Catanese, Pasquale De Meo, Emilio Ferrara, Giacomo Fiumara, and Alessandro Provetti, “Crawling Facebook for Social Network Analysis Purposes”, in Proceedings of the International Conference on Web Intelligence, Mining and Semantics, pp. 52-59, Sogndal, Norway, ACM, 2011.
    [21] Bimal Viswanath, Alan Mislove, Meeyoung Cha, and Krishna P. Gummadi, “On the Evolution of User Interaction in Facebook”, in Proceedings of the ACM Workshop on Online Social Networks, pp. 37-42, 2009.
    [22] Tanu Siwag, Priyank Sirohi, and Niraj Singhal,”Novel Architecture of a Focused Crawler for Social Websites”, International Journal of Computer Engineering and Applications, Volume VII, Issue III, Part I, pp. 132-144, 2014.
    [23] 潘伯彥,臉書使用者行為蒐集系統之設計與實作,國立政治大學碩士論文,2015。
    描述: 碩士
    國立政治大學
    資訊科學學系
    102753030
    資料來源: http://thesis.lib.nccu.edu.tw/record/#G0102753030
    数据类型: thesis
    显示于类别:[資訊科學系] 學位論文

    文件中的档案:

    档案 大小格式浏览次数
    303001.pdf1545KbAdobe PDF2343检视/开启


    在政大典藏中所有的数据项都受到原著作权保护.


    社群 sharing

    著作權政策宣告 Copyright Announcement
    1.本網站之數位內容為國立政治大學所收錄之機構典藏,無償提供學術研究與公眾教育等公益性使用,惟仍請適度,合理使用本網站之內容,以尊重著作權人之權益。商業上之利用,則請先取得著作權人之授權。
    The digital content of this website is part of National Chengchi University Institutional Repository. It provides free access to academic research and public education for non-commercial use. Please utilize it in a proper and reasonable manner and respect the rights of copyright owners. For commercial use, please obtain authorization from the copyright owner in advance.

    2.本網站之製作,已盡力防止侵害著作權人之權益,如仍發現本網站之數位內容有侵害著作權人權益情事者,請權利人通知本網站維護人員(nccur@nccu.edu.tw),維護人員將立即採取移除該數位著作等補救措施。
    NCCU Institutional Repository is made to protect the interests of copyright owners. If you believe that any material on the website infringes copyright, please contact our staff(nccur@nccu.edu.tw). We will remove the work from the repository and investigate your claim.
    DSpace Software Copyright © 2002-2004  MIT &  Hewlett-Packard  /   Enhanced by   NTU Library IR team Copyright ©   - 回馈