Loading...
|
Please use this identifier to cite or link to this item:
https://nccur.lib.nccu.edu.tw/handle/140.119/32680
|
Title: | 以文件分類技術預測股價趨勢 Predicting Trends of Stock Prices with Text Classification Techniques |
Authors: | 陳俊達 Chen, Jiun-da |
Contributors: | 王台平 劉昭麟 Wang, Tai-Ping Liu, Chao-Lin 陳俊達 Chen, Jiun-da |
Keywords: | 股價預測 文字探勘 簡易貝氏模型 k最近鄰居模型 混合模型 Stock Price Prediction text mining naïve Bayesian models k-nearest neighbors models hybrid models |
Date: | 2006 |
Issue Date: | 2009-09-17 14:02:52 (UTC+8) |
Abstract: | 股價的漲跌變化是由於證券市場中眾多不同投資人及其投資決策後所產生的結果。然而,影響股價變動的因素眾多且複雜,新聞也屬於其中一種,新聞事件不但是投資人用來得知該股票上市公司的相關營運資訊的主要媒介,同時也是影響投資人決定或變更其股票投資策略的主要因素之一。本研究提出以新聞文件做為股價漲跌預測系統的基礎架構,透過文字探勘技術及分類技術來建置出能預測當日個股收盤股價漲跌趨勢之系統。 本研究共提出三種分類模型,分別是簡易貝氏模型、k最近鄰居模型以及混合模型,並設計了三組實驗,分別是分類器效能的比較、新聞樣本資料深度的比較、以及新聞樣本資料廣度的比較來檢驗系統的預測效能。實驗結果顯示,本研究所提出的分類模型可以有效改善相關研究中整體正確率高但各個類別的預測效能卻差異甚大的情況。而對於影響投資人獲利與否的關鍵類別"漲"及類別"跌"的平均預測效能上,本研究所提出的這三種分類模型亦同時具有良好的成效,可以做為投資人進行投資決策時的有效參考依據。 Stocks` closing price levels can provide hints about investors` aggregate demands and aggregate supplies in the stock trading markets. If the level of a stock`s closing price is higher than its previous closing price, it indicates that the aggregate demand is stronger than the aggregate supply in this trading day. Otherwise, the aggregate demand is weaker than the aggregate supply. It would be profitable if we can predict the individual stock`s closing price level. For example, in case that one stock`s current price is lower than its previous closing price. We can do the proper strategies(buy or sell) to gain profit if we can predict the stock`s closing price level correctly in advance. In this thesis, we propose and evaluate three models for predicting individual stock`s closing price in the Taiwan stock market. These models include a naïve Bayes model, a k-nearest neighbors model, and a hybrid model. Experimental results show the proposed methods perform better than the NewsCATS system for the "UP" and "DOWN" categories. |
Reference: | [1] Yahoo!奇摩股市,http://tw.stock.yahoo.com/。 [2] 中文斷詞系統,http://ckipsvr.iis.sinica.edu.tw/。 [3] 中央研究院資訊科學所中文組實驗室中文詞知識庫小組,http://godel.iis.sinica.edu.tw/CKIP/index.htm。 [4] 中華民國證券櫃檯買賣中心,http://www.otc.org.tw/。 [5] 方世榮,統計學導論,華泰書局,頁39-81、215-231,1993。 [6] 王春笙,以技術指標預測台灣股市股價漲跌之實證研究-以類神經網路與複迴歸模式建構,台灣大學資訊管理研究所碩士論文,1996。 [7]王疏艷,基於決策樹方法的分類規則的挖掘,海鼎出版,2002,http://hd123.com/asprun/Message/MessageList.asp?gid=17658。 [8] 杜金龍,基本分析在台灣股市應用的訣竅,財訊出版社,頁9-30,2002。 [9] 邱浩政,量化研究與統計分析,五南圖書,頁3-11,2000。 [10] 施正宏,結合總體經濟指標及個股財報資料以預測個股漲跌-以台灣電子類股為例,中原大學資訊管理學系碩士論文,2004。 [11] 淺井涌二郎,投資劃線原理,投資月刊社,頁6-108,1978。 [12] 曾元顯,"關鍵詞自動擷取技術與相關詞回饋",中國圖書館學會會報59期,頁59-64,1997。 [13] 曾龍,資料採礦-概念與技術,維科圖書,頁279-330,2003。 [14] 臺灣證券交易所,http://www.tse.com.tw/。 [15] 謝劍平,現代投資學,智勝文化,頁402-519,1998。 [16] 謝德宗,投資學,華泰書局,頁235-253、324、403-418,1997。 [17] 鍾任明,運用文字探勘於日內股價漲跌趨勢預測之研究,中原大學資訊管理研究所碩士論文,2005。 [18] 鐘朝宏,投資學,五南圖書,頁243-368、400-441,1992。 [19] Helmut Braun and John S. Chandler, "Predicting Stock Market Behavior through Rule Induction: An Application of the Learning-from-Example Approach," Decision Sciences, volume 18, number 3, pp. 415-429, 1987. [20] Man-Chung Chan, Chi-Cheong Wong, W. F. Tse, Bernard K.-S. Cheung, Gordon Y.-N. Tang, "Artificial Intelligence in Portfolio Management," Intelligent Data Engineering and Automated Learning, volume 2412 , pp. 403-409, 2002. [21] Corinna Cortes and Vladimir Vapnik, "Support-Vector Networks," Machine Learning, Volume 20, Number 13, 1995. [22] Eugene Fama, "Efficient Capital Markets: A Review of Theory and Empirical Work," The Journal of Finance Papers and Proceedings of the Twenty-Eighth Annual Meeting of the American Finance Association New York, volume 25, number 2, pp. 383-417, 1969. [23] Gabriel Pui Cheong Fung, Jeffrey Xu Yu and Wai Lam, "News Sensitive Stock Trend Prediction," Proceedings of the Sixth Pacific-Asia Conference on Knowledge Discovery and Data Mining, pp. 289-296, 2002. [24] Győző Gidófalvi, "Using News Articles to Predict Stock Price Movements," Technical Report: CSE 254, Department of Computer Science and Engineering, University of California, San Diego, La Jolla, CA, USA, 2001. [25] Jiawei Han and Micheline Kamber, Data Mining: Concepts and Techniques, Second Edition, Morgan Kaufmann, pp. 614-626, 2006. [26] John H. Holland, "Adaptation in Natural and Artificial Systems," University of Michigan Press, Ann Arbor, 1975. [27] Hans Peter Luhn, "The Automatic Creation of Literature Abstracts," IBM of Research and Development, pp. 159-165, 1958. [28] Wei-Yun Ma and Keh-Jiann Chen, "Introduction to CKIP Chinese Word Segmentation System for the First International Chinese Word Segmentation Bakeoff," Proceedings of ACL, Second SIGHAN Workshop on Chinese Language Processing, pages 168-171, 2003. [29] MarketThoughts.com,http://www.marketthoughts.com/dow_theory.html. [30] Marc-André Mittermayer, "Forecasting Intraday Stock Price Trends with Text Mining Techniques," Proceedings of the Thirty-Seventh Annual Hawaii International Conference on System Sciences, Track 3, p. 30064b, 2004. [31] Gerard Salton, Automatic Text Processing: The Transformation, Analysis, and Retrieval of Information by Computer, Addison-Wesley, 1989. [32] Gerard Salton, A. Wong, and C. S. Yang, "A Vector Space Model for Automatic Indexing," Communications of the ACM, volume 18, pp. 613-620, 1975. [33] Robert P. Schumaker and Hsinchun Chen, "Textual Analysis of Stock Market Prediction Using Financial News Articles," Proceedings of the Twelfth Americas Conference on Information Systems, Acapulco, Mexico, 2006. [34] Sholom Weiss, Nitin Indurkhya, Tong Zhang and Fred Damerau, Text mining: predictive methods for analyzing unstructured information, Springer, pp. 35-91, 2005. [35] Wikipedia,http://www.wikipedia.org/. [36] Ian H. Witten and Eibe Frank, Data Mining: Practical Machine Learning Tools and Techniques with Java Implementations, Morgan Kaufmann, pp. 76-80, pp. 88-96, pp. 149-151, pp. 244-252, pp. 296-304, 2000. [37] Beat Wüthrich, Vincent Cho, S. Leung, D. Permunetilleke, K. Sankaran, and J. Zhang, "Daily Stock Market Forecast from Textual Web Data," Proceedings of the 1998 IEEE International Conference on Systems, Man, and Cybernetics, pp. 2720-2725, 1998. |
Description: | 碩士 國立政治大學 資訊科學學系 94753014 95 |
Source URI: | http://thesis.lib.nccu.edu.tw/record/#G0094753014 |
Data Type: | thesis |
Appears in Collections: | [資訊科學系] 學位論文
|
All items in 政大典藏 are protected by copyright, with all rights reserved.
|