政大機構典藏-National Chengchi University Institutional Repository(NCCUR):Item 140.119/124147
English  |  正體中文  |  简体中文  |  Post-Print筆數 : 27 |  全文笔数/总笔数 : 113656/144643 (79%)
造访人次 : 51733018      在线人数 : 597
RC Version 6.0 © Powered By DSPACE, MIT. Enhanced by NTU Library IR team.
搜寻范围 查询小技巧:
  • 您可在西文检索词汇前后加上"双引号",以获取较精准的检索结果
  • 若欲以作者姓名搜寻,建议至进阶搜寻限定作者字段,可获得较完整数据
  • 进阶搜寻
    政大機構典藏 > 商學院 > 金融學系 > 學位論文 >  Item 140.119/124147


    请使用永久网址来引用或连结此文件: https://nccur.lib.nccu.edu.tw/handle/140.119/124147


    题名: 強化學習應用於美式選擇權評價
    Applying Reinforcement Learning to American Option Pricing
    作者: 許琳
    Xu, Lin
    贡献者: 江彌修
    Chiang, Mi-Hsiu
    許琳
    Xu, Lin
    关键词: 美式選擇權
    定價
    強化學習
    最小平方策略迭代
    最小平方蒙地卡羅法
    American option
    Pricing
    Reinforcement learning
    LSPI
    FQI
    LSM
    日期: 2019
    上传时间: 2019-07-01 10:48:51 (UTC+8)
    摘要: 本文研究了強化學習應用於美式選擇權定價問題,首先,使用 Li, Szepesvari and Schuurmans 提出之最小平方策略迭代(LSPI)演算法學習美式賣權履約策略並進行定價,將蘋果公司美式股票選擇權之真實市場數據處理後套用於 LSPI 方法,並將 LSPI 方法與 Tsitsiklis and Van Roy提出之FQI方法和傳統最小平方蒙地卡羅法比較定價準確性。其次,使用符合金融市場之分析方式,將賣權分價內外不同情況分析,並進行敏感度分析,觀察強化學習使用之參數對於定價結果之影響。模擬結果表示,LSPI 方法與 FQI 方法 總體優於 LSM 方法,強化學習對於愈價內之賣權定價愈準確。本文發現強化學習在商品定價領域仍有很大研究潛力,特別是模擬路徑方式與執行動作多樣性方面值得進一步討論。
    In this paper we apply the reinforcement learning method to American options pricing. We mainly consider the least squares policy iteration (LSPI) proposed by Li, Szepesvari and Schuurmans(2009) to learn the exercise policy and pricing method of American put options. We price AAPL American stock option with processed real market data, and compare the accuracy between LSPI, FQI proposed by Tsitsiklis and Van Roy(2001), and the standard least square Monte Carlo method (LSM). In order to investigate the influence of parameters used in LSPI on pricing results, the analysis method in financial market, sensitivity analysis is carried out under different situations which are divided according to whether the put option is in-the-money or out-of-the-money. The simulation result shows that LSPI and FQI are superior to LSM in general, and LSPI is more accurate in pricing deeper in-the-money put option. We also find that the reinforcement learning method still has great research potential in the field of derivatives pricing. In particular, there is a need for further investigation on simulation method of price path or selecting action variety.
    參考文獻: [1] Barone-Adesi, G. and Whaley, R. (1987). Efficient Analytical Approximation of American Option Values. Journal of Finance, Vol. 42, 301-320.
    [2] Bellman, R. (1957). A Markovian Decision Process. Journal of Mathematics and Mechanics, Vol. 6, 679–684.
    [3] Black, F. and Scholes, M. (1973). The Pricing of Options and Corporate Liabilities. Journal of Political Economy, Vol. 81, 637-659.
    [4] Boyle, P. P. (1977). Options: A Monte Carlo Approach. Journal of Financial Economics, Vol. 4, 323–338.
    [5] Boyle, P. P. (1986). A lattice framework for option pricing with two state variables, Journal of Financial and Quantitative Analysis, Vol. 23(1), 1-12.
    [6] Brennan, M. and Schwartz, E. (1977). The Valuation of American Put Options. Journal of Finance, Vol. 32, 449-462.
    [7] Cox, J. C., Ross S. A. and Rubinstein, M. (1979). Option Pricing: A simplified Approach. Journal of Financial Economics, Vol. 7, 229-264.
    [8] Dubrov, B. (2015). Monte Carlo Simulation with Machine Learning for Pricing American Options and Convertible Bonds. SSRN.
    [9] Geske, R. (1979). The Valuation of Compound Options. Journal of Financial Economics, Vol. 7, 63–81.
    [10] Geske, R. (1979). A Note on an Analytic Valuation Formula for Unprotected American Call Options on Stocks with Known Dividends. Journal of Financial Economics, Vol. 7, 275–380.
    [11] Geske, R. (1981). Comments on Whaley’s Note. Journal of Financial Economics, Vol. 9, 213–215.
    [12] Geske, R. and Johnson, H. E. (1984). The American Put Valued Analytically. Journal of Finance, Vol. 39, 1511-1524.
    [13] Haug, E.G., Haug, J. and Lewis, A. (2003). Back to Basics: a New Approach to the Discrete Dividend Problem. Wilmott Magazine, 37–47.
    [14] Howard, R. A. (1960). Dynamic Programming and Markov Processes. Cambridge, Mass: MIT Press.
    [15] Hull, J. C. (2011). Options, Futures, and Other Derivatives, 8th edition. United States of America: Prentice Hall.
    [16] Johnson, H. (1983). An Analytical Approximation for the American Put Price. Journal of Financial and Quantitative Analysis, Vol. 18, 141-148.
    [17] Ju., N. and Zhong, R. (1998). An Approximate Formula for Pricing American Options. Review of Financial Studies, Vol. 11, 627-646.
    [18] Lagoudakis, M. G. and Parr, R. (2003). Least-Squares Policy Iteration. Journal of Machine Learning Research , Vol. 4, 1107 – 1149.
    [19] Li, Y., Szepesvari, C. and Schuurmans, D. (2009). Learning Exercise Policies for American Options. In Proc. of the Twelfth International Conference on Artificial Intelligence and Statistics, JMLR: W&CP, Vol. 5, 352-359.
    [20] Longstaff, F. A. and Schwartz, E. S. (2001). Valuing American options by simulation: a simple Least-Squares approach. Review Financial Studies, Vol. 14, 113-147.
    [21] Medvedev, A. and Scaillet, O. (2010). Pricing American options under stochastic volatility and stochastic interest rates. Journal of Financial Economics, Vol. 98, 145–159.
    [22] Merton, R. (1976). Option pricing when underlying stock returns are discontinuous. Journal of Financial Economics, Vol. 2, 125–144.
    [23] Roll, R. (1977). An Analytical Valuation Formula for Unprotected American Call Options on Stocks with Known Dividends. Journal of Financial Economics, Vol. 5, 251-258.
    [24] Tilley, J. A. (1993). Valuing American options in a path simulation model. Transactions of the Society of Actuaries, Vol. 45, 83–104.
    [25] Tsitsiklis, J. N. and Van Roy, B. (2001). Regression Methods for Pricing Complex American-style Options. IEEE Transactions on Neural Networks(special issue on computational finance), Vol. 12(4), 694–703.
    [26] Whaley, R. E. (1981). On the Valuation of American Call Options on Stocks with Known Dividends. Journal of Financial Economics, Vol. 10, 207–211.
    [27]陳戚光,(2001)。選擇權:理論.實務與應用。台灣:智勝文化。
    描述: 碩士
    國立政治大學
    金融學系
    106352047
    資料來源: http://thesis.lib.nccu.edu.tw/record/#G0106352047
    数据类型: thesis
    DOI: 10.6814/NCCU201900058
    显示于类别:[金融學系] 學位論文

    文件中的档案:

    档案 大小格式浏览次数
    204701.pdf2399KbAdobe PDF2138检视/开启


    在政大典藏中所有的数据项都受到原著作权保护.


    社群 sharing

    著作權政策宣告 Copyright Announcement
    1.本網站之數位內容為國立政治大學所收錄之機構典藏,無償提供學術研究與公眾教育等公益性使用,惟仍請適度,合理使用本網站之內容,以尊重著作權人之權益。商業上之利用,則請先取得著作權人之授權。
    The digital content of this website is part of National Chengchi University Institutional Repository. It provides free access to academic research and public education for non-commercial use. Please utilize it in a proper and reasonable manner and respect the rights of copyright owners. For commercial use, please obtain authorization from the copyright owner in advance.

    2.本網站之製作,已盡力防止侵害著作權人之權益,如仍發現本網站之數位內容有侵害著作權人權益情事者,請權利人通知本網站維護人員(nccur@nccu.edu.tw),維護人員將立即採取移除該數位著作等補救措施。
    NCCU Institutional Repository is made to protect the interests of copyright owners. If you believe that any material on the website infringes copyright, please contact our staff(nccur@nccu.edu.tw). We will remove the work from the repository and investigate your claim.
    DSpace Software Copyright © 2002-2004  MIT &  Hewlett-Packard  /   Enhanced by   NTU Library IR team Copyright ©   - 回馈