政大機構典藏-National Chengchi University Institutional Repository(NCCUR):Item 140.119/153145

English | 正體中文 | 简体中文 | Post-Print筆數 : 27 | Items with full text/Total items : 115581/146615 (79%)
Visitors : 55643659 Online Users : 320

RC Version 6.0 © Powered By DSPACE, MIT. Enhanced by NTU Library IR team.

Scope

please add "double quotation mark" for query phrases to get precise results

please goto advance search for comprehansive author search

Adv. Search

Home ‧ Login ‧ Upload ‧ Help ‧ About ‧ Administer

Goto mobile version

政大機構典藏 > 商學院 > 資訊管理學系 > 學位論文 > Item 140.119/153145

Please use this identifier to cite or link to this item: https://nccur.lib.nccu.edu.tw/handle/140.119/153145

Title:	高維多變量分布的近似學習 High-Dimensional Multivariate Distributions: Approximation and Learning
Authors:	林盈盈 Lin, Ying-Ying
Contributors:	周彥君莊皓鈞 Chou, Yen-Chun Chuang, Hao-Chun 林盈盈 Lin, Ying-Ying
Keywords:	多變量分布混合密度網路低秩近似法高斯 Copula 報童問題 Multivariate Distribution Mixture Density Network Low-rank approximation Gaussian Copula Newsvendor problem
Date:	2024
Issue Date:	2024-09-04 14:02:43 (UTC+8)
Abstract:	本研究深入探討高維多變量分布的近似學習問題，並提出兩種創新方法以應對高維度配適的挑戰：結合多變量混合密度網絡（MDN）與低秩近似法，及基於高斯Copula和低秩近似的長短期記憶模型（LSTM）。研究結果表明，這些方法在處理複雜的多變量分布和高維共變異矩陣估計方面均表現出顯著效果。在高維度的模擬實驗中，低秩近似法顯著提升MDN模型在高維數據配適中的準確性與穩定性。此外，本研究模擬高維度的自回歸離散時間序列數據，並針對多品項報童問題進行條件風險價值（CVaR）最佳化，發現使用經典高斯分布作為邊際分布的Copula LSTM模型表現不如預期，因此採用更具彈性的指數修正高斯分布，此方法在時間序列聯合分布學習和風險管理中的利潤優化上展現更優的性能。 This study investigates the approximation of high-dimensional multivariate distributions and introduces two innovative methods to tackle the challenges of high-dimensional distribution fitting: combining Mixture Density Networks (MDN) with low-rank approximation and developing a Copula LSTM model based on Gaussian Copula, Long Short-Term Memory (LSTM), and low-rank approximation. The results indicate that these methods are effective in managing complex multivariate distributions and estimating high-dimensional covariance matrices. In high-dimensional simulation experiments, the low-rank approximation notably enhances the accuracy and stability of the MDN model. Furthermore, this study simulates high-dimensional autoregressive discrete time series data and performs Conditional Value at Risk (CVaR) optimization for multi-item newsvendor problems. It reveals that the Copula LSTM model using classical Gaussian distributions as marginal distributions underperforms. Consequently, an exponentially modified Gaussian distribution is adopted, demonstrating superior performance in time series joint distribution learning and risk-averse profit optimization.
Reference:	Bishop, C. M. (1994). Mixture density networks. Technical Report. Aston University, Birmingham. (Unpublished). Bishop, C. M. (2006). Pattern Recognition and Machine Learning (Information Science and Statistics). Springer-Verlag. Brando Guillaumes, A. (2017). Mixture density networks for distribution and uncertainty estimation. Universitat Politècnica de Catalunya. Carruthers, J., & Finnie, T. (2023). Using mixture density networks to emulate a stochastic within-host model of Francisella tularensis infection. PLOS Computational Biology, 19(12), e1011266. Charpentier, A., Fermanian, J.-D., & Scaillet, O. (2007). The estimation of copulas: Theory and practice. Copulas: From theory to application in finance, 35. Chen, D., Xue, Y., & Gomes, C. (2018). End-to-end learning for the deep multivariate probit model. International Conference on Machine Learning, Chen, J., & Tan, X. (2009). Inference for multivariate normal mixtures. Journal of Multivariate Analysis, 100(7), 1367-1383. Chen, Y., Xu, M., & Zhang, Z. G. (2009). A risk-averse newsvendor model under the CVaR criterion. Operations research, 57(4), 1040-1044. Danaher, P. J., & Smith, M. S. (2011). Modeling multivariate distributions using copulas: Applications in marketing. Marketing science, 30(1), 4-21. Elidan, G. (2013). Copulas in machine learning. Copulae in Mathematical and Quantitative Finance: Proceedings of the Workshop Held in Cracow, 10-11 July 2012 (pp. 39-60). Berlin, Heidelberg: Springer Berlin Heidelberg. Gonçalves, J. N., Cortez, P., Carvalho, M. S., & Frazao, N. M. (2021). A multivariate approach for multi-step demand forecasting in assembly industries: Empirical evidence from an automotive supply chain. Decision Support Systems, 142, 113452. Haney, S. (2011). Practical applications and properties of the Exponentially Modified Gaussian (EMG) distribution Drexel University]. Huber, J., Müller, S., Fleischmann, M., & Stuckenschmidt, H. (2019). A data-driven newsvendor problem: From data to decision. European Journal of Operational Research, 278(3), 904-915. Jammernegg, W., & Kischka, P. (2012). Newsvendor problems with VaR and CVaR consideration. Handbook of newsvendor problems: models, extensions and applications, 197-216. Kruse, J. (2020). Technical report: Training mixture density networks with full covariance matrices. arXiv preprint arXiv:2003.05739. Liboschik, T., Fokianos, K., & Fried, R. (2017). tscount: An R package for analysis of count time series following generalized linear models. Journal of Statistical Software, 82, 1-51. Madan, D. B. (2020). Multivariate distributions for financial returns. International Journal of Theoretical and Applied Finance, 23(06), 2050041. Makansi, O., Ilg, E., Cicek, O., & Brox, T. (2019). Overcoming limitations of mixture density networks: A sampling and fitting framework for multimodal future prediction. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (pp. 7144-7153). Norwood, B., Roberts, M. C., & Lusk, J. L. (2004). Ranking crop yield models using out‐of‐sample likelihood functions. American Journal of Agricultural Economics, 86(4), 1032-1043. Peerlings, D. E., van den Brakel, J. A., Baştürk, N., & Puts, M. J. (2022). Multivariate density estimation by neural networks. IEEE Transactions on Neural Networks and Learning Systems. Reynolds, D. A. (2009). Gaussian mixture models. Encyclopedia of biometrics, 741(659-663). Salinas, D., Bohlke-Schneider, M., Callot, L., Medico, R., & Gasthaus, J. (2019). High-dimensional multivariate forecasting with low-rank gaussian copula processes. Advances in neural information processing systems, 32. Trentin, E., Lusnig, L., & Cavalli, F. (2018). Parzen neural networks: Fundamentals, properties, and an application to forensic anthropology. Neural Networks, 97, 137-151. Vapnik, V. (1995). The nature of statistical learning theory. Springer. Wang, J., & Taaffe, M. R. (2015). Multivariate mixtures of normal distributions: properties, random vector generation, fitting, and as models of market daily changes. INFORMS Journal on Computing, 27(2), 193-203. Wang, T., Cho, K., & Wen, M. (2019). Attention-based mixture density recurrent networks for history-based recommendation. Proceedings of the 1st International Workshop on Deep Learning Practice for High-Dimensional Sparse Data (pp. 1-9). Wilson, A. G., & Ghahramani, Z. (2010). Copula processes. Advances in neural information processing systems, 23. Xu, J., & Cao, L. (2023). Copula variational LSTM for high-dimensional cross-market multivariate dependence modeling. IEEE Transactions on Neural Networks and Learning Systems. Yu, J.-b., & Xi, L.-f. (2009). A neural network ensemble-based model for on-line monitoring and diagnosis of out-of-control signals in multivariate manufacturing processes. Expert systems with applications, 36(1), 909-921. Zeldes, Y., Theodorakis, S., Solodnik, E., Rotman, A., Chamiel, G., & Friedman, D. (2017). Deep density networks and uncertainty in recommender systems. arXiv preprint arXiv:1711.02487. Zhou, Y., Gao, J., & Asfour, T. (2020). Movement primitive learning and generalization: Using mixture density networks. IEEE Robotics & Automation Magazine, 27(2), 22-32.
Description:	碩士國立政治大學資訊管理學系 111356005
Source URI:	http://thesis.lib.nccu.edu.tw/record/#G0111356005
Data Type:	thesis
Appears in Collections:	[資訊管理學系] 學位論文

Files in This Item:

File	Description	Size	Format
600501.pdf		2742Kb	Adobe PDF	0	View/Open

All items in 政大典藏 are protected by copyright, with all rights reserved.

社群 sharing

著作權政策宣告 Copyright Announcement

1.本網站之數位內容為國立政治大學所收錄之機構典藏，無償提供學術研究與公眾教育等公益性使用，惟仍請適度，合理使用本網站之內容，以尊重著作權人之權益。商業上之利用，則請先取得著作權人之授權。
The digital content of this website is part of National Chengchi University Institutional Repository. It provides free access to academic research and public education for non-commercial use. Please utilize it in a proper and reasonable manner and respect the rights of copyright owners. For commercial use, please obtain authorization from the copyright owner in advance.

2.本網站之製作，已盡力防止侵害著作權人之權益，如仍發現本網站之數位內容有侵害著作權人權益情事者，請權利人通知本網站維護人員(nccur@nccu.edu.tw)，維護人員將立即採取移除該數位著作等補救措施。
NCCU Institutional Repository is made to protect the interests of copyright owners. If you believe that any material on the website infringes copyright, please contact our staff(nccur@nccu.edu.tw). We will remove the work from the repository and investigate your claim.

DSpace Software Copyright © 2002-2004 MIT & Hewlett-Packard / Enhanced by NTU Library IR team Copyright © - Feedback