Loading...
|
Please use this identifier to cite or link to this item:
https://nccur.lib.nccu.edu.tw/handle/140.119/130959
|
Title: | 基於 BERT 與 GRU 深度學習模型 - 建構新聞情緒下 Black-Litterman 投資組合 Based on BERT and GRU Deep Learning Model - Constructing a Black-Litterman Portfolio under News Sentiment |
Authors: | 夏秉宏 Hsia, Ping-Hung |
Contributors: | 翁久幸 林士貴 Weng, Chiu-Hsing Lin, Shih-Kuei 夏秉宏 Hsia, Ping-Hung |
Keywords: | 投資組合理論 情感分析 股票投資預測 自然語言處理 深度學習 Portfolio theory Sentiment analysis Stock price prediction Natural language processing Deep learning |
Date: | 2020 |
Issue Date: | 2020-08-03 17:32:00 (UTC+8) |
Abstract: | Black-Litterman模型(Black et al., 1990)試圖通過投資者觀點分配的建構來解決 Markowitz Portfolio模型(Markowitz, 1952)所遇到的問題。然而,建立投資者觀點分配需要對投資資產的未來報酬進行預測,由於我們針對股票進行投資,故可被視為一個股價預測的問題。 在本研究中,我們使用深度學習的方法來預測我們的資產價格,除了以資產的股票價格和交易量作為特徵之外,同時也認為新聞情緒是影響股票走勢的重要因素之一。
首先,我們使用 BERT(Devlin et al., 2018)衡量新聞情緒。將之定義為一個二元分類問題,並透過 BERT 模型進行情感分析訓練來判斷新聞資料帶來消息的好與壞。接著,利用三種不同的深度學習模型,分別為 vanilla RNN(Rumelhart et al., 1985),LSTM(Hochreiter et al., 1997)和 GRU(Cho et al., 2014)對股票價格進行預測,觀察不同模型的預測能力是否會影響 Black-Litterman 模型之表現結果。為了擁有夠多之新聞資料數量訓練BERT 模型,我們以美國標準普爾500指數(S&P 500) 中之七檔成分股作為投資標的,目標在於建構績效良好之投資組合。因此,我們將以四種財務指標衡量基於三種不同深度學習模型建構出之Black-Litterman模型之績效,並以其他三種投資組合作為我們的基準模型。
從本研究實證分析,我們可得到以下之結果:
1. 在三種深度學習模型中,我們以均方誤差 (Mean Square Error) 比較模型預測結果的好壞。GRU 模型 在七項投資股票資產中的表現皆優於其餘兩個模型,更能夠有效捕捉到股票未來之走勢及價格。而 LSTM 模型的表現也比 RNN 模型來得更佳。
2. 在投資組合的模型比較中,以 BERT 判斷新聞情緒並以 GRU 模型預測股價所建構出之 Black-Litterman 模型擁有最高的 46.6% 年化報酬率。同時,其擁有最高的 13.0% Sharpe Ratio 與 17.9% 之 Sortino Ratio,代表其在一定風險程度下,仍較其他建構出之投資組合來得更加優異。 The Black-Litterman Model (Black et al., 1990) attempts to solve the problems encountered by the Markowitz Portfolio Model (Markowitz, 1952) through the construction of investor view distribution. However, the construction of an investor`s point of view distribution requires future returns on investment assets. In this study, we use deep learning methods to predict our asset prices. In addition to the asset’s stock price and trading volume, we also assume that sentiment from news is one of the important factors that affect the stock trend.
First, we use BERT (Devlin et al., 2018) to measure news sentiment. It is defined as a binary classification problem, and sentiment analysis training is conducted through the BERT model to judge whether the stock news bring good news or bad news. Then, use three different deep learning models, namely vanilla RNN (Rumelhart et al., 1985), LSTM (Hochreiter et al., 1997) and GRU (Cho et al., 2014) to predict the stock price and observe whether the predictive ability of the different models will affect the performance of the Black-Litterman model. In order to have enough news materials to train the BERT model, we use the seven stocks in the S&P500 as investment assets. The goal is to build a portfolio with good performance. Therefore, we will use four financial metrics to measure the performance of the Black-Litterman model constructed based on these three different deep learning models. At the same time, there are three benchmark models with the other portfolio methods.
From the empirical analysis of our study, we can get the following results:
1. Among the three deep learning models, we use mean square error to compare the model prediction results. The GRU model outperforms the other two models in the performance of seven investment stock assets, and can more effectively capture the future trend and price of the stock. The LSTM model performs better than the RNN model.
2. In the comparison of the portfolio models, the Black-Litterman model constructed by using BERT to measure news sentiment and using the GRU model to predict stock prices has the highest annualized return rate of 46.6%. At the same time, it has the highest 13.0% Sharpe Ratio and 17.9% Sortino Ratio, which means that it is still better than other constructed portfolios under a certain degree of risk. |
Reference: | Akita, R., Yoshihara, A., Matsubara, T., Uehara, K. (2016). Deep learning for stock prediction using numerical and textual information. International Conference on Computer and Information Science IEEE, (June), 1-6.
Ariyo, A. A., Adewumi, A. O., Ayo, C. K. (2014). Stock price prediction using the ARIMA model. International Conference on Computer Modelling and Simulation IEEE, (March), 106-112.
Avramov, D., & Zhou, G. (2010). Bayesian portfolio analysis. Annual Review Financial Economics, 2(1), 25-47.
Bahdanau, D., Cho, K., & Bengio, Y. (2014). Neural machine translation by jointly learning to align and translate. arXiv preprint arXiv:1409.0473.
Bengio, Y., Ducharme, R., Vincent, P., Jauvin, C. (2003). A neural probabilistic language model. Journal of machine learning research, 3(Feb), 1137-1155.
Best, M. J., & Grauer, R. R. (1991). On the sensitivity of mean-variance-efficient portfolios to changes in asset means: some analytical and computational results. The review of financial studies, 4(2), 315-342.
Birbeck, E., & Cliff, D. (2018). Using Stock Prices as Ground Truth in Sentiment Analysis to Generate Profitable Trading Signals. IEEE Symposium Series on Computational Intelligence, (November), 1868-1875.
Black, F., & Litterman, R. (1990). Asset allocation: combining investor views with market equilibrium. Goldman Sachs Fixed Income Research, 115.
Black, F., \\& Litterman, R. (1992). Global portfolio optimization. Financial analysts journal, 48(5), 28-43.
Blei, D. M., Ng, A. Y., & Jordan, M. I. (2003). Latent dirichlet allocation. Journal of machine Learning research, 3(January), 993-1022.
Cambria, E., Speer, R., Havasi, C., & Hussain, A. (2010). Senticnet: A publicly available semantic resource for opinion mining. AAAI Fall Symposium Series., (November).
Chang, C. C., & Lin, C. J. (2011). LIBSVM: A library for support vector machines. ACM transactions on intelligent systems and technology (TIST), 2(3), 1-27.
Cho, K., Van Merriënboer, B., Gulcehre, C., Bahdanau, D., Bougares, F., Schwenk, H., & Bengio, Y. (2014). Learning phrase representations using RNN encoder-decoder for statistical machine translation. arXiv preprint arXiv:1406.1078.
Devlin, J., Chang, M. W., Lee, K., & Toutanova, K. (2018). Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805.
Ding, X., Zhang, Y., Liu, T., & Duan, J. (2014). Using structured events to predict stock price movement: An empirical investigation. Empirical Methods in Natural Language Processing, (October), 1415-1425.
Fama, E. F. (1995). Random walks in stock market prices. Financial analysts journal, 51(1), 75-80.
He, G., & Litterman, R. (2002). The intuition behind Black-Litterman model portfolios. Social Science Research Network 334304.
Hochreiter, S., & Schmidhuber, J. (1997). Long short-term memory. Neural computation, 9(8), 1735-1780.
Idzorek, T. (2007). A step-by-step guide to the Black-Litterman model: Incorporating user-specified confidence levels. Forecasting expected returns in the financial markets, 17-38.
Le, Q., & Mikolov, T. (2014). Distributed representations of sentences and documents. International conference on machine learning, (January), 1188-1196.
Lee, W. (2000). Advanced Theory and Methodology of Tactical Asset Allocation. New York: John Wiley & Sons.
Litterman, B. (2004). Modern investment management: an equilibrium approach (Vol. 246). John Wiley & Sons.
Malkiel, B. G., & Fama, E. F. (1970). Efficient capital markets: A review of theory and empirical work. The journal of Finance, 25(2), 383-417.
Markowitz, H.M. (1952). “Portfolio Selection.” The Journal of Finance, (March), 77-91.
Michaud, R. O. (1989). The Markowitz optimization enigma: Is ‘optimized’optimal?. Financial Analysts Journal, 45(1), 31-42.
Mikolov, T., Karafiát, M., Burget, L., Černocký, J., & Khudanpur, S. (2010). Recurrent neural network based language model.Eleventh annual conference of the international speech communication association.
Mikolov, T., Sutskever, I., Chen, K., Corrado, G. S., & Dean, J. (2013). Distributed representations of words and phrases and their compositionality. Advances in neural information processing systems, 3111-3119.
Satchell, S., & Scowcroft, A. (2000). A demystification of the Black–Litterman model: Managing quantitative and traditional portfolio construction. Journal of Asset Management, 1}(2), 138-150.
Peters, M. E., Neumann, M., Iyyer, M., Gardner, M., Clark, C., Lee, K., & Zettlemoyer, L. (2018). Deep contextualized word representations. arXiv preprint arXiv:1802.05365.
Rumelhart, D. E., Hinton, G. E., & Williams, R. J. (1985). Learning internal representations by error propagation (No. ICS-8506). California Univ San Diego La Jolla Inst for Cognitive Science.
Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A. N., ... & Polosukhin, I. (2017). Attention is all you need. Advances in neural information processing systems, 5998-6008.
Wang, J. H., Liu, T. W., Luo, X., & Wang, L. (2018). An lstm approach to short text sentiment classification with word embeddings. Proceedings of the 30th conference on computational linguistics and speech processing, (October), 214-223.
Wu, Y., Schuster, M., Chen, Z., Le, Q. V., Norouzi, M., Macherey, W., & Klingner, J. (2016). Google`s neural machine translation system: Bridging the gap between human and machine translation. arXiv preprint arXiv:1609.08144.
Xing, F. Z., Cambria, E., & Welsch, R. E. (2018). Intelligent asset allocation via market sentiment views. IEEE ComputatioNal iNtelligeNCe magaziNe, 13(4), 25-34. |
Description: | 碩士 國立政治大學 統計學系 107354017 |
Source URI: | http://thesis.lib.nccu.edu.tw/record/#G0107354017 |
Data Type: | thesis |
DOI: | 10.6814/NCCU202000816 |
Appears in Collections: | [統計學系] 學位論文
|
Files in This Item:
File |
Description |
Size | Format | |
401701.pdf | | 1370Kb | Adobe PDF2 | 0 | View/Open |
|
All items in 政大典藏 are protected by copyright, with all rights reserved.
|