政大機構典藏-National Chengchi University Institutional Repository(NCCUR):Item 140.119/112274

English | 正體中文 | 简体中文 | Post-Print筆數 : 27 | Items with full text/Total items : 115256/146303 (79%)
Visitors : 54539197 Online Users : 319

RC Version 6.0 © Powered By DSPACE, MIT. Enhanced by NTU Library IR team.

Scope

please add "double quotation mark" for query phrases to get precise results

please goto advance search for comprehansive author search

Adv. Search

Home ‧ Login ‧ Upload ‧ Help ‧ About ‧ Administer

Goto mobile version

政大機構典藏 > 商學院 > 資訊管理學系 > 學位論文 > Item 140.119/112274

Please use this identifier to cite or link to this item: https://nccur.lib.nccu.edu.tw/handle/140.119/112274

Title:	零售藥妝顧客購買頻率與利潤之分析 Analysis of Customer Purchase Frequency and Profitability in Retail Pharmacy Stores
Authors:	黃兆椿
Contributors:	莊皓鈞黃兆椿
Keywords:	零售業 RFM 集中度廣度資料分析 Retailing RFM Clumpiness Breadth Data Analytics
Date:	2017
Issue Date:	2017-08-28 13:38:41 (UTC+8)
Abstract:	本研究主要探討藥妝零售產業提升預測顧客行為的模型與方法，並以RFM模型為基礎進行延伸。RFM模型在行銷領域中是廣泛被使用的模型，具有良好預測和分群顧客的能力，本研究在此模型中加入了兩項新指標：集中度 (C) 和廣度 (B)，並針對顧客的「交易頻率」和「交易利潤」進行分析，藉此找出優於RFM的指標組合。首先將RFM、C、B共五項指標進行排列組合，並以迴歸分析驗證新增的兩項指標能顯著提升模型解釋能力，接著將RFM指標組合及RFMCB指標組合分別作為機器學習方法的解釋變數以預測顧客行為。對顧客交易頻率而言，C和B兩項指標的加入能顯著提升其預測能力，對顧客交易利潤而言，新指標的加入，平均而言對於預測精準度有所提升，但在部分資料中會使誤差值增加以致整體誤差的最大值有所提升。 This research proposes modeling techniques to better predict customer behaviors in the retail industry. Extending the widely-adopted RFM model in marketing, we introduce two new metrics – clumpiness (C) and breadth (B). Using more than two million transaction records from over 100 retail pharmacy stores in Taiwan, we fit a set of regression models, in which we assess the explanatory power of different combinations of RFMCB for customer purchase frequency and profitability. Our analysis shows that the RFM model is significantly inferior to models with C and/or B, suggesting that C and B are indeed promising metrics. In the next stage, we will apply machine learning methods to incorporate C and B into predictive models and assess their out-of-sample prediction performance. On Average, RFMCB outperforms RFM in predicting Frequency & Profit. However, there are some cases where RFMCB leads to larger prediction error.
Reference:	Akaike, H. (1974). A new look at the statistical model identification. IEEE transactions on automatic control, 19(6), 716-723. Auria, L., & Moro, R. A. (2008). Support vector machines (SVM) as a technique for solvency analysis. Bell, D. R., & Lattin, J. M. (1998). Shopping behavior and consumer preference for store price format: Why “large basket” shoppers prefer EDLP. Marketing Science, 17(1), 66-88. Berger, P., & Magliozzi, T. (1992). The effect of sample size and proportion of buyers in the sample on the performance of list segmentation equations generated by regression analysis. Journal of Direct Marketing, 6(1), 13-22. Bhattacharyya, S. (1999). Direct marketing performance modeling using genetic algorithms. INFORMS Journal on Computing, 11(3), 248-257. Breiman, L. (1996). Bagging predictors. Machine learning, 24(2), 123-140. Chai, T., & Draxler, R. R. (2014). Root mean square error (RMSE) or mean absolute error (MAE)?–Arguments against avoiding RMSE in the literature. Geoscientific Model Development, 7(3), 1247-1250. Chen, T., & Guestrin, C. (2016, August). Xgboost: A scalable tree boosting system. In Proceedings of the 22Nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (pp. 785-794). ACM. Colombo, R., & Jiang, W. (1999). A stochastic RFM model. Journal of Interactive Marketing, 13(3), 2-12. Cortes, C., & Vapnik, V. (1995). Support-vector networks. Machine learning, 20(3), 273-297 Coussement, K., Van den Bossche, F. A., & De Bock, K. W. (2014). Data accuracy`s impact on segmentation performance: Benchmarking RFM analysis, logistic regression, and decision trees. Journal of Business Research, 67(1), 2751-2758. Cui, G., Wong, M. L., & Lui, H. K. (2006). Machine learning for direct marketing response models: Bayesian networks with evolutionary programming. Management Science, 52(4), 597-612. Drucker, H., Burges, C. J., Kaufman, L., Smola, A., & Vapnik, V. (1997). Support vector regression machines. Advances in neural information processing systems, 9, 155-161. Elith, J., Leathwick, J. R., & Hastie, T. (2008). A working guide to boosted regression trees. Journal of Animal Ecology, 77(4), 802-813. Friedman, J. H. (2002). Stochastic gradient boosting. Computational Statistics & Data Analysis, 38(4), 367-378. Hastie, T. J., & Tibshirani, R. J. (1990). Generalized additive models (Vol. 43). CRC press. Haughton, D., & Oulabi, S. (1997). Direct marketing modeling with CART and CHAID. Journal of Interactive Marketing, 11(4), 42-52. Hosseini, Seyed Mohammad Seyed, Anahita Maleki, and Mohammad Reza Gholamian. "Cluster analysis using data mining approach to develop CRM methodology to assess the customer loyalty." Expert Systems with Applications 37.7 (2010): 5259-5264. Jean Halliday. (2002). Database Marketing: GM plays cards right. Retrieved January 14, 2017, from http://adage.com/article/interactive-media-marketing/database-marketing-gm-plays-cards/52084/ Jiang, W. (2002). On weak base hypotheses and their implications for boosting regression and classification. Annals of statistics, 51-73. Johnson, N. L. (1949). Systems of frequency curves generated by methods of translation. Biometrika, 36(1/2), 149-176. Kahan, R. (1998). Using database marketing techniques to enhance your one-to-one marketing initiatives. Journal of Consumer Marketing, 15(5), 491-493. Khajvand, M., Zolfaghar, K., Ashoori, S., & Alizadeh, S. (2011). Estimating customer lifetime value based on RFM analysis of customer purchase behavior: Case study. Procedia Computer Science, 3, 57-63. Kohavi, R. (1995, August). A study of cross-validation and bootstrap for accuracy estimation and model selection. In Ijcai (Vol. 14, No. 2, pp. 1137-1145). Kumar, V., Srinivasan, K., Rao, V. R., Zhang, Y., Bradlow, E. T., & Small, D. S. (2015). Commentaries and Reply on “Predicting Customer Value Using Clumpiness: From RFM to RFMC” by Yao Zhang, Eric T. Bradlow, and Dylan S. Small. Marketing Science, 34(2), 209-217. Ling, C. X., & Li, C. (1998, August). Data Mining for Direct Marketing: Problems and Solutions. In KDD (Vol. 98, pp. 73-79). Marcus, C. (1998). A practical yet meaningful approach to customer segmentation. Journal of consumer marketing, 15(5), 494-504. McCarty, J. A., & Hastak, M. (2007). Segmentation approaches in data-mining: A comparison of RFM, CHAID, and logistic regression. Journal of business research, 60(6), 656-662. Natekin, A., & Knoll, A. (2013). Gradient boosting machines, a tutorial. Frontiers in neurorobotics, 7, 21. Netzer, O., Lattin, J. M., & Srinivasan, V. (2008). A hidden Markov model of customer relationship dynamics. Marketing Science, 27(2), 185-204. Petrison, L. A., Blattberg, R. C., & Wang, P. (1997). Database marketing: Past, present, and future. Journal of Interactive Marketing, 11(4), 109-125. Prasad, A. M., Iverson, L. R., & Liaw, A. (2006). Newer classification and regression tree techniques: bagging and random forests for ecological prediction. Ecosystems, 9(2), 181-199. Ridgeway, G. (2002). Looking for lumps: Boosting and bagging for density estimation. Computational Statistics & Data Analysis, 38(4), 379-392. Ridgeway, G. (2007). Generalized Boosted Models: A guide to the gbm package. Update, 1(1), 2007. Schweidel, D. A., Bradlow, E. T., & Fader, P. S. (2011). Portfolio dynamics for customers of a multiservice provider. Management Science, 57(3), 471-486. Sohrabi, B., & Khanlari, A. (2007). Customer lifetime value (CLV) measurement based on RFM model. Iranian Accounting & Auditing Review, 14(47), 7-20. Verhoef, P. C., Spring, P. N., Hoekstra, J. C., & Leeflang, P. S. (2003). The commercial use of segmentation and predictive modeling techniques for database marketing in the Netherlands. Decision Support Systems, 34(4), 471-481. Wagenmakers, E. J., & Farrell, S. (2004). AIC model selection using Akaike weights. Psychonomic bulletin & review, 11(1), 192-196. Yeh, I. C., Yang, K. J., & Ting, T. M. (2009). Knowledge discovery on RFM model using Bernoulli sequence. Expert Systems with Applications, 36(3), 5866-5871. Zhang, Y., Bradlow, E. T., & Small, D. S. (2013). New measures of clumpiness for incidence data. Journal of Applied Statistics, 40(11), 2533-2548. Zhang, Y., Bradlow, E. T., & Small, D. S. (2014). Predicting customer value using clumpiness: From RFM to RFMC. Marketing Science, 34(2), 195-208. Zwilling M. L. (2013), “Negative Binomial Regression,” The Mathematica Journal, dx.doi.org/10.3888/tmj.15-6 林軒田 (民104年12月8日)。Machine Learning Foundation (機器學習基石) 【部落格影音資料】取自https://www.youtube.com/playlist?list=PLXVfgk9fNX2I7tB6oIINGBmW50rrmFTqf
Description:	碩士國立政治大學資訊管理學系 104356032
Source URI:	http://thesis.lib.nccu.edu.tw/record/#G0104356032
Data Type:	thesis
Appears in Collections:	[資訊管理學系] 學位論文

Files in This Item:

File	Size	Format
603201.pdf	911Kb	Adobe PDF2	38	View/Open

All items in 政大典藏 are protected by copyright, with all rights reserved.

社群 sharing

著作權政策宣告 Copyright Announcement

1.本網站之數位內容為國立政治大學所收錄之機構典藏，無償提供學術研究與公眾教育等公益性使用，惟仍請適度，合理使用本網站之內容，以尊重著作權人之權益。商業上之利用，則請先取得著作權人之授權。
The digital content of this website is part of National Chengchi University Institutional Repository. It provides free access to academic research and public education for non-commercial use. Please utilize it in a proper and reasonable manner and respect the rights of copyright owners. For commercial use, please obtain authorization from the copyright owner in advance.

2.本網站之製作，已盡力防止侵害著作權人之權益，如仍發現本網站之數位內容有侵害著作權人權益情事者，請權利人通知本網站維護人員(nccur@nccu.edu.tw)，維護人員將立即採取移除該數位著作等補救措施。
NCCU Institutional Repository is made to protect the interests of copyright owners. If you believe that any material on the website infringes copyright, please contact our staff(nccur@nccu.edu.tw). We will remove the work from the repository and investigate your claim.

DSpace Software Copyright © 2002-2004 MIT & Hewlett-Packard / Enhanced by NTU Library IR team Copyright © - Feedback