English  |  正體中文  |  简体中文  |  Post-Print筆數 : 27 |  Items with full text/Total items : 113318/144297 (79%)
Visitors : 51089837      Online Users : 928
RC Version 6.0 © Powered By DSPACE, MIT. Enhanced by NTU Library IR team.
Scope Tips:
  • please add "double quotation mark" for query phrases to get precise results
  • please goto advance search for comprehansive author search
  • Adv. Search
    HomeLoginUploadHelpAboutAdminister Goto mobile version
    政大機構典藏 > 商學院 > 統計學系 > 學位論文 >  Item 140.119/131481
    Please use this identifier to cite or link to this item: https://nccur.lib.nccu.edu.tw/handle/140.119/131481


    Title: 使用評論文字改善轉換率預測
    Improving Conversion Rate Prediction with Review Text
    Authors: 許振楡
    Hsu, Chen-Yu
    Contributors: 翁久幸
    Weng, Jiu-Xing
    許振楡
    Hsu, Chen-Yu
    Keywords: 轉換率預測
    文字評論
    機器學習
    Date: 2020
    Issue Date: 2020-09-02 11:43:53 (UTC+8)
    Abstract: 隨著電商平台的出現,顧客消費習慣逐漸受到改變,「線上評論」成為左右消費者購買意願的重要因素,參考過去學者 Chevalier 和 Mayzlin [3]對此議題的探討,以銷售排名作為反應變數,建立迴歸模型觀察評論分數、其他特徵的顯著程度,並無直接從評論文字萃取特徵,本論文建立在 Chevalier 和 Mayzlin [3]所提出的特徵,研究加上評論文字資訊能否更有效的預測顧客消費行為,評論文字資訊以 TFIDF、CBOW、Skip-gram 詞嵌入向量為特徵。
    本文以某旅遊電商平台評論資料集為主,研究分成三部分,第一部分使用機器學習方法以文字特徵預測評論分數,預測分數與實際分數相關係數介於 0.2 到0.4 之間。第二部分以轉換率為預測目標,第三部分預測下期轉換率漲跌,分別比較加入文字特徵與僅以分數、其他評論特徵所建模型是否有更好的預測效果,實驗結果顯示,在此資料集上不包含前期轉換率時預測轉換率及下期漲跌,加入文字特徵皆有變好,若含前期轉換率時則僅有小幅的提升。
    With the showing of electronic commerce, consuming behavior has been changed.“Online Review”is an important factor that has big emphasis on customers’purchase intention. According to Chevalier and Mayzlin [3] s’research, they take sales number as response variable and build regression model to check the significance of score characteristics and other characteristics. However, they don’t consider the text review due to lack of natural language preprocessing methods. This research add review text information to see whether model has a better ability to predict customer behavior. We take two kinds of TFIDF、CBOW and Skip-gram as text characteristics.
    Based on a traveling e-commerce review data, this research spit into three sections. In Section 1, predicting review score by using machine learning methods at first. In order to compare the difference between text characteristics and review score, we calculate the correlation of predicted score and original review score and get the result between 0.2 and 0.4. In section 2 and 3, our predict target is conversion rate and the trend of next week conversion rate, which go up, down or keep constant. We comparing model with text characteristics and without text characteristics to see whether text can bring useful information. Result shows that adding text characteristics truly can help predict conversion rate and the trend of next week conversion rate when model don’t combine previous conversion rate but only has a little help with previous conversion rate.
    Reference: [1] Salton Gerard and Michael J. McGill. Introduction to Modern Information Retrieval, October 1986.
    [2] Greg Corrado, Jeffrey Dean, Kai Chen and Tomas Mikolov. Efficient Estimation of Word Representations in Vector Space, September 2013.
    [3] Dina Mayzlin and Judith A. Chevalier. The Effect of Word of Mouth on Sales:Online Book Reviews, August 2006.
    [4] Eric Clemons, Guodong Gao and Lorin M. Hitt. When Online Reviews Meet Hyperdifferentiation : A Study of Craft Beer Industry, February 2006.
    [5] Nan Hu, Ling Liu and Jie Zhang. Do Online Reviews Affect Product Sales? The Role of Reviewer Characteristics and Temporal Effects, September 2008.
    [6] Yong Liu. Word of Mouth for Movies:Its Dynamics and Impact on Box Office Revenue, July 2006.
    [7] Jerome H. Friedman. Greedy Function Approximation: A Gradient Boosting Machine. The Annals of Statistics Vol. 29 No.5, 2001.
    [8] Guolin Ke, Qi Meng, Thomas Finley, Taifeng Wang, Wei Chen, Weidong Ma, Qiwei Ye and Tie-Yan Liu. LightGBM:A Highly Efficient Gradient Boosting Decision Tree, December 2017.
    [9] Menno van Zaanen and Pieter Kanters. Automatic Mood Classification Using tf*idf Based on Lyrics. In J. Stephen Downie and Remco C. Veltkamp, 11th International Society for Music Information and Retrieval Conference, August 2010.
    [10] Hsin-His Chen and Lun-Wei Ku. Mining opinions from the Web: Beyond relevance retrieval. Journal of the American Society for Information Science and Technology, 58(12), 1838-1850, August 2007.
    Description: 碩士
    國立政治大學
    統計學系
    107354029
    Source URI: http://thesis.lib.nccu.edu.tw/record/#G0107354029
    Data Type: thesis
    DOI: 10.6814/NCCU202001226
    Appears in Collections:[統計學系] 學位論文

    Files in This Item:

    There are no files associated with this item.



    All items in 政大典藏 are protected by copyright, with all rights reserved.


    社群 sharing

    著作權政策宣告 Copyright Announcement
    1.本網站之數位內容為國立政治大學所收錄之機構典藏,無償提供學術研究與公眾教育等公益性使用,惟仍請適度,合理使用本網站之內容,以尊重著作權人之權益。商業上之利用,則請先取得著作權人之授權。
    The digital content of this website is part of National Chengchi University Institutional Repository. It provides free access to academic research and public education for non-commercial use. Please utilize it in a proper and reasonable manner and respect the rights of copyright owners. For commercial use, please obtain authorization from the copyright owner in advance.

    2.本網站之製作,已盡力防止侵害著作權人之權益,如仍發現本網站之數位內容有侵害著作權人權益情事者,請權利人通知本網站維護人員(nccur@nccu.edu.tw),維護人員將立即採取移除該數位著作等補救措施。
    NCCU Institutional Repository is made to protect the interests of copyright owners. If you believe that any material on the website infringes copyright, please contact our staff(nccur@nccu.edu.tw). We will remove the work from the repository and investigate your claim.
    DSpace Software Copyright © 2002-2004  MIT &  Hewlett-Packard  /   Enhanced by   NTU Library IR team Copyright ©   - Feedback