政大機構典藏-National Chengchi University Institutional Repository(NCCUR):Item 140.119/76423
English  |  正體中文  |  简体中文  |  Post-Print筆數 : 27 |  Items with full text/Total items : 113822/144841 (79%)
Visitors : 51789088      Online Users : 527
RC Version 6.0 © Powered By DSPACE, MIT. Enhanced by NTU Library IR team.
Scope Tips:
  • please add "double quotation mark" for query phrases to get precise results
  • please goto advance search for comprehansive author search
  • Adv. Search
    HomeLoginUploadHelpAboutAdminister Goto mobile version
    政大典藏 > College of Commerce > Department of MIS > Theses >  Item 140.119/76423
    Please use this identifier to cite or link to this item: https://nccur.lib.nccu.edu.tw/handle/140.119/76423


    Title: 基於文件相似度的標籤推薦-應用於問答型網站
    Applying Tag Recommendation base on Document Similarity in Question and Answer Website
    Authors: 葉早彬
    Tsao, Pin Yeh
    Contributors: 楊建民
    葉早彬
    Tsao, Pin Yeh
    Keywords: 文字探勘
    標籤推薦
    群眾智慧
    Text Mining
    Tag Recommendation
    Collective Intelligence
    Date: 2015
    Issue Date: 2015-07-13 11:07:04 (UTC+8)
    Abstract: 隨著人們習慣的改變,從網路上獲取新知漸漸取代傳統媒體,這也延伸產生許多新的行為。社群標籤是近幾年流行的一種透過使用者標記來分類與詮釋資訊的方式,相較於傳統分類學要求物件被分類到預先定義好的類別,社群標籤則沒有這樣的要求,因此容易因應內容的變動做出調整。
    問答型網站是近年來興起的一種個開放性的知識分享平台,例如quora、Stack Overflow、yahoo 奇摩知識+,使用者可以在平台上與網友做問答的互動,在問與答的討論中,結合大眾的經驗與專長,幫助使用者找到滿意的答案,使用單純的問答系統的好處是可以不必在不同且以分類為主的論壇花費時間尋找答案,和在關鍵字搜索中的結果花費時間尋找答案。
    本研究希望能針對問答型網站的文件做自動標籤分類,運用標籤推薦技術來幫助使用者能夠更有效率的找到需要的問題,也讓問答平台可以把這些由使用者所產生的大量問題分群歸類。
    在研究過程蒐集Stack Exchange問答網站共20638個問題,使用naïve Bayes演算法與文件相似度計算的方式,進行標籤推薦,推薦適合的標籤給新進文件。在研究結果中,推薦標籤的準確率有64.2%
    本研究希望透過自動分類標籤,有效地分類問題。幫助使用者有效率的找到需要的問題,也能把這些由使用者所產生的大量問題分群歸類。
    With User`s behavior change. User access to new knowledge from the internet instead of from the traditional media. This Change leads to a lot new behavior. Social tagging is popular in recent years through a user tag to classify and annotate information. Unlike traditional taxonomy requiring items are classified into predefined categories, Social tagging is more elastic to adjust through the content change.
    Q & A Website is the rise in recent years. Like Quora , Stack Overflow , yahoo Knowledge plus. User can interact with other people form this platform , in Q & A discussion, with People`s experience and expertise to help the user find a satisfactory answer.
    This study hopes to build a tag recommendation system for Q & A Website. The recommendation system can help people find the right problem efficiently , and let Q & A platform can put these numerous problems into the right place.
    We collect 20,638 questions from Stack Exchange. Use naïve Bayes algorithm and document similarity calculation to recommend tag for the new document. The result of the evaluation show we can effectively recommend relevant tags for the new question.
    Reference: 林倩妏, and 卜小蝶. "標籤雲在圖書資訊服務之應用初探." 海峽兩岸圖書資訊學學術研討會論文集 二-117 (2010).
    陳光華,莊雅蓁. 應用於資訊檢索的中文同義詞之建構. 資訊傳播與圖書館學, 8 (1), 2001 年 9 月, 2001, 59-75.
    陸明怡. "以群眾智慧觀念為基礎之群體意見結論推論模式" (碩士論文) 清華大學(2011)
    楊玉齡譯,Surowiecki, James 原著,2005,《群眾的智慧:如何讓個人、團隊、企業 與社會變得更聰明》,台北市:遠流公司出版社。
    戴瑋. "應用社會化協同標籤於網路資源搜尋." (碩士論文)中央大學(2008).
    http://sparc.nfu.edu.tw/~tchen/DataMining2/ch5.ppt
    http://zh.wikipedia.org/wiki/最近鄰居法
    http://zh.wikipedia.org/wiki/Yahoo!奇摩知識+
    http://zh.wikipedia.org/wiki/Web_2.0
    Budura, A., Michel, S., Cudré-Mauroux, P., & Aberer, K. (2009). Neighborhood-based tag prediction. In The semantic web: research and applications (pp. 608-622). Springer Berlin Heidelberg.
    Cao, H., Xie, M., Xue, L., Liu, C., Teng, F., & Huang, Y. (2009). Social tag prediction base on supervised ranking model. In Proceeding of ECML/PKDD 2009 Discovery Challenge Workshop (pp. 35-48).
    Heymann, P., Ramage, D., & Garcia-Molina, H. (2008, July). Social tag prediction. In Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval (pp. 531-538). ACM.
    Huberman, S. G. B. A. The Structure of Collaborative Tagging Systems. No. cs. DL/0508082. cs/0508082, 2005.
    Jordan, A. (2002). On discriminative vs. generative classifiers: A comparison of logistic regression and naive bayes. Advances in neural information processing systems, 14, 841.
    Rodrigues, E. M., Milic-Frayling, N., & Fortuna, B. (2008, December). Social tagging behaviour in community-driven question answering. In Proceedings of the 2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology-Volume 01 (pp. 112-119). IEEE Computer Society.
    McCallum, A., & Nigam, K. (1998, July). A comparison of event models for naive bayes text classification. In AAAI-98 workshop on learning for text categorization (Vol. 752, pp. 41-48).
    Yin, D., Xue, Z., Hong, L., & Davison, B. D. (2010, July). A probabilistic model for personalized tag prediction. In Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining (pp. 959-968). ACM.
    Rendle, S., Balby Marinho, L., Nanopoulos, A., & Schmidt-Thieme, L. (2009, June). Learning optimal ranking with tensor factorization for tag recommendation. In Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining (pp. 727-736). ACM.
    Salton, G., McGill, M. (1983). Introduction to Modern Information Retrieval, New York: McGraw-Hill.
    Salton, G., Wong, A., & Yang, C. S. (1975). A vector space model for automatic indexing. Communications of the ACM, 18(11), 613-620.
    Sigurbjörnsson, B., & Van Zwol, R. (2008, April). Flickr tag recommendation based on collective knowledge. In Proceedings of the 17th international conference on World Wide Web (pp. 327-336). ACM.
    https://archive.org/details/stackexchange
    Description: 碩士
    國立政治大學
    資訊管理研究所
    102356003
    103
    Source URI: http://thesis.lib.nccu.edu.tw/record/#G0102356003
    Data Type: thesis
    Appears in Collections:[Department of MIS] Theses

    Files in This Item:

    File Description SizeFormat
    600301.pdf1363KbAdobe PDF2763View/Open
    600302.pdf1363KbAdobe PDF2421View/Open


    All items in 政大典藏 are protected by copyright, with all rights reserved.


    社群 sharing

    著作權政策宣告 Copyright Announcement
    1.本網站之數位內容為國立政治大學所收錄之機構典藏,無償提供學術研究與公眾教育等公益性使用,惟仍請適度,合理使用本網站之內容,以尊重著作權人之權益。商業上之利用,則請先取得著作權人之授權。
    The digital content of this website is part of National Chengchi University Institutional Repository. It provides free access to academic research and public education for non-commercial use. Please utilize it in a proper and reasonable manner and respect the rights of copyright owners. For commercial use, please obtain authorization from the copyright owner in advance.

    2.本網站之製作,已盡力防止侵害著作權人之權益,如仍發現本網站之數位內容有侵害著作權人權益情事者,請權利人通知本網站維護人員(nccur@nccu.edu.tw),維護人員將立即採取移除該數位著作等補救措施。
    NCCU Institutional Repository is made to protect the interests of copyright owners. If you believe that any material on the website infringes copyright, please contact our staff(nccur@nccu.edu.tw). We will remove the work from the repository and investigate your claim.
    DSpace Software Copyright © 2002-2004  MIT &  Hewlett-Packard  /   Enhanced by   NTU Library IR team Copyright ©   - Feedback