政大機構典藏-National Chengchi University Institutional Repository(NCCUR):Item 140.119/131481
English  |  正體中文  |  简体中文  |  Post-Print筆數 : 27 |  全文笔数/总笔数 : 113451/144438 (79%)
造访人次 : 51243843      在线人数 : 884
RC Version 6.0 © Powered By DSPACE, MIT. Enhanced by NTU Library IR team.
搜寻范围 查询小技巧:
  • 您可在西文检索词汇前后加上"双引号",以获取较精准的检索结果
  • 若欲以作者姓名搜寻,建议至进阶搜寻限定作者字段,可获得较完整数据
  • 进阶搜寻
    政大機構典藏 > 商學院 > 統計學系 > 學位論文 >  Item 140.119/131481


    请使用永久网址来引用或连结此文件: https://nccur.lib.nccu.edu.tw/handle/140.119/131481


    题名: 使用評論文字改善轉換率預測
    Improving Conversion Rate Prediction with Review Text
    作者: 許振楡
    Hsu, Chen-Yu
    贡献者: 翁久幸
    Weng, Jiu-Xing
    許振楡
    Hsu, Chen-Yu
    关键词: 轉換率預測
    文字評論
    機器學習
    日期: 2020
    上传时间: 2020-09-02 11:43:53 (UTC+8)
    摘要: 隨著電商平台的出現,顧客消費習慣逐漸受到改變,「線上評論」成為左右消費者購買意願的重要因素,參考過去學者 Chevalier 和 Mayzlin [3]對此議題的探討,以銷售排名作為反應變數,建立迴歸模型觀察評論分數、其他特徵的顯著程度,並無直接從評論文字萃取特徵,本論文建立在 Chevalier 和 Mayzlin [3]所提出的特徵,研究加上評論文字資訊能否更有效的預測顧客消費行為,評論文字資訊以 TFIDF、CBOW、Skip-gram 詞嵌入向量為特徵。
    本文以某旅遊電商平台評論資料集為主,研究分成三部分,第一部分使用機器學習方法以文字特徵預測評論分數,預測分數與實際分數相關係數介於 0.2 到0.4 之間。第二部分以轉換率為預測目標,第三部分預測下期轉換率漲跌,分別比較加入文字特徵與僅以分數、其他評論特徵所建模型是否有更好的預測效果,實驗結果顯示,在此資料集上不包含前期轉換率時預測轉換率及下期漲跌,加入文字特徵皆有變好,若含前期轉換率時則僅有小幅的提升。
    With the showing of electronic commerce, consuming behavior has been changed.“Online Review”is an important factor that has big emphasis on customers’purchase intention. According to Chevalier and Mayzlin [3] s’research, they take sales number as response variable and build regression model to check the significance of score characteristics and other characteristics. However, they don’t consider the text review due to lack of natural language preprocessing methods. This research add review text information to see whether model has a better ability to predict customer behavior. We take two kinds of TFIDF、CBOW and Skip-gram as text characteristics.
    Based on a traveling e-commerce review data, this research spit into three sections. In Section 1, predicting review score by using machine learning methods at first. In order to compare the difference between text characteristics and review score, we calculate the correlation of predicted score and original review score and get the result between 0.2 and 0.4. In section 2 and 3, our predict target is conversion rate and the trend of next week conversion rate, which go up, down or keep constant. We comparing model with text characteristics and without text characteristics to see whether text can bring useful information. Result shows that adding text characteristics truly can help predict conversion rate and the trend of next week conversion rate when model don’t combine previous conversion rate but only has a little help with previous conversion rate.
    參考文獻: [1] Salton Gerard and Michael J. McGill. Introduction to Modern Information Retrieval, October 1986.
    [2] Greg Corrado, Jeffrey Dean, Kai Chen and Tomas Mikolov. Efficient Estimation of Word Representations in Vector Space, September 2013.
    [3] Dina Mayzlin and Judith A. Chevalier. The Effect of Word of Mouth on Sales:Online Book Reviews, August 2006.
    [4] Eric Clemons, Guodong Gao and Lorin M. Hitt. When Online Reviews Meet Hyperdifferentiation : A Study of Craft Beer Industry, February 2006.
    [5] Nan Hu, Ling Liu and Jie Zhang. Do Online Reviews Affect Product Sales? The Role of Reviewer Characteristics and Temporal Effects, September 2008.
    [6] Yong Liu. Word of Mouth for Movies:Its Dynamics and Impact on Box Office Revenue, July 2006.
    [7] Jerome H. Friedman. Greedy Function Approximation: A Gradient Boosting Machine. The Annals of Statistics Vol. 29 No.5, 2001.
    [8] Guolin Ke, Qi Meng, Thomas Finley, Taifeng Wang, Wei Chen, Weidong Ma, Qiwei Ye and Tie-Yan Liu. LightGBM:A Highly Efficient Gradient Boosting Decision Tree, December 2017.
    [9] Menno van Zaanen and Pieter Kanters. Automatic Mood Classification Using tf*idf Based on Lyrics. In J. Stephen Downie and Remco C. Veltkamp, 11th International Society for Music Information and Retrieval Conference, August 2010.
    [10] Hsin-His Chen and Lun-Wei Ku. Mining opinions from the Web: Beyond relevance retrieval. Journal of the American Society for Information Science and Technology, 58(12), 1838-1850, August 2007.
    描述: 碩士
    國立政治大學
    統計學系
    107354029
    資料來源: http://thesis.lib.nccu.edu.tw/record/#G0107354029
    数据类型: thesis
    DOI: 10.6814/NCCU202001226
    显示于类别:[統計學系] 學位論文

    文件中的档案:

    没有与此文件相关的档案.



    在政大典藏中所有的数据项都受到原著作权保护.


    社群 sharing

    著作權政策宣告 Copyright Announcement
    1.本網站之數位內容為國立政治大學所收錄之機構典藏,無償提供學術研究與公眾教育等公益性使用,惟仍請適度,合理使用本網站之內容,以尊重著作權人之權益。商業上之利用,則請先取得著作權人之授權。
    The digital content of this website is part of National Chengchi University Institutional Repository. It provides free access to academic research and public education for non-commercial use. Please utilize it in a proper and reasonable manner and respect the rights of copyright owners. For commercial use, please obtain authorization from the copyright owner in advance.

    2.本網站之製作,已盡力防止侵害著作權人之權益,如仍發現本網站之數位內容有侵害著作權人權益情事者,請權利人通知本網站維護人員(nccur@nccu.edu.tw),維護人員將立即採取移除該數位著作等補救措施。
    NCCU Institutional Repository is made to protect the interests of copyright owners. If you believe that any material on the website infringes copyright, please contact our staff(nccur@nccu.edu.tw). We will remove the work from the repository and investigate your claim.
    DSpace Software Copyright © 2002-2004  MIT &  Hewlett-Packard  /   Enhanced by   NTU Library IR team Copyright ©   - 回馈