English  |  正體中文  |  简体中文  |  Post-Print筆數 : 27 |  Items with full text/Total items : 113822/144841 (79%)
Visitors : 51828155      Online Users : 511
RC Version 6.0 © Powered By DSPACE, MIT. Enhanced by NTU Library IR team.
Scope Tips:
  • please add "double quotation mark" for query phrases to get precise results
  • please goto advance search for comprehansive author search
  • Adv. Search
    HomeLoginUploadHelpAboutAdminister Goto mobile version
    政大機構典藏 > 商學院 > 統計學系 > 學位論文 >  Item 140.119/136767
    Please use this identifier to cite or link to this item: https://nccur.lib.nccu.edu.tw/handle/140.119/136767


    Title: 結合spline及分箱方式之廣義線性模型預測
    Generalized linear model prediction combined with spline and binning method
    Authors: 楊翔宇
    Yang, Shiang-Yu
    Contributors: 黃子銘
    Huang, Tzee-Ming
    楊翔宇
    Yang, Shiang-Yu
    Keywords: 無母數方法
    變數選取
    分段多項式
    節點選取
    分箱方法
    B-spline
    Nonparametric method
    Piecewise polynomial
    Variable selection
    Knot selection
    WOE of binning
    Binning method
    Date: 2021
    Issue Date: 2021-08-05 10:12:41 (UTC+8)
    Abstract: 在日常生活中,總是要面臨許多資料。大部分的資料都是夾雜著類別型變數以及連續型變數的資料。針對這種資料,提出了一個方式可以對自變數稍作些許處理,並以處理後的自變數加以預測資料,達到不錯的效果。
    本研究方法將會使用R語言以針對銀行信用卡違約付款的資料作為主要的研究對象。以下個月是否有違約行為作為反應變數,其反應變數以1(有違約行為)、0(無違約行為)做為表示。利用模型可以從中了解信用卡用戶的基本訊息影響違約行為與否的機率,供以衡量信用卡用戶未來將會違約的機率,以幫助銀行對這些客戶進行限制,以降低銀行虧損的風險。
    In our daily lives, we always have to face a great amount of large datasets. Most of them are combined with categorical variables and continuous variables. Regarding this type of data, we proposed a method for model construction and prediction.
    The proposed method is applied to the data of bank credit card default payments as the main research object. The response variable is the payment situation in the following months. “1” means the user with breach of contract and “0” means without breach of contract. Using the model, we can understand the association between the basic information of credit card users and their default behavior, which can be used to measure the probabilities that credit card users will default in the future, so as to help banks monitor customers and reduce the risk of bank losses.
    Reference: [1] L. Breiman, J. H. Friedman, R. A. Olshen, and C. J. Stone. Classification and Regression Trees. Wadsworth and Brooks, Monterey, CA, 1984.
    [2] C. de Boor. A Practical Guide to Splines. Springer Verlag, New York, 1978.
    [3] J. F. Gamble. Asbestos and colon cancer: A weight-of-the-evidence review. Environmental Health Perspectives, 102:1038-1050, 1994.
    [4] I. Guyon and A. Elisseeff. An introduction to variable and feature selection. Journal of Machine Learning Research, 3:1157-1182, 2003.
    [5] T. K. Ho. Random decision forests. In Proceedings of the Third International Conference on Document Analysis and Recognition (Volume 1), pages 278-282,
    Montreal, Que.,Canada, 1995. IEEE Computer Society.
    [6] T. M Huang. A knot selection algorithm for splines in logistic regression. In Proceedings of the 2020 3rd International Conference on Mathematics and Statistics,
    page 29-33, New York, NY, USA, 2020. Association for Computing Machinery.
    [7] J. Jinot and S. Bayard. Dissent respiratory health effects of passive smoking: Epa’s weight-of-evidence analysis. Journal of Clinical Epidemiology, 47(4):339-349, 1994.
    [8] R. Kerber. Chimerge: Discretization of numeric attributes. In Proceedings of the Tenth National Conference on Artificial Intelligence, AAAI’92, page 123-128.
    AAAI Press, 1992.
    [9] N. Shaltout, M. Elhefnawi, A. Rafea, and A. Moustafa. Information gain as a feature selection method for the efficient classification of influenza based on viral hosts. Lecture Notes in Engineering and Computer Science, 1:625-631, 2014.
    [10] D. Weed. Weight of evidence: A review of concept and methods. Risk analysis : an official publication of the Society for Risk Analysis, 25:1545-1557, 2005.
    [11] G. Zeng. A necessary condition for a good binning algorithm in credit scoring. Applied Mathematical Sciences, Vol. 8:3229-3242, 2014.
    Description: 碩士
    國立政治大學
    統計學系
    108354008
    Source URI: http://thesis.lib.nccu.edu.tw/record/#G0108354008
    Data Type: thesis
    DOI: 10.6814/NCCU202100841
    Appears in Collections:[統計學系] 學位論文

    Files in This Item:

    File Description SizeFormat
    400801.pdf1738KbAdobe PDF268View/Open


    All items in 政大典藏 are protected by copyright, with all rights reserved.


    社群 sharing

    著作權政策宣告 Copyright Announcement
    1.本網站之數位內容為國立政治大學所收錄之機構典藏,無償提供學術研究與公眾教育等公益性使用,惟仍請適度,合理使用本網站之內容,以尊重著作權人之權益。商業上之利用,則請先取得著作權人之授權。
    The digital content of this website is part of National Chengchi University Institutional Repository. It provides free access to academic research and public education for non-commercial use. Please utilize it in a proper and reasonable manner and respect the rights of copyright owners. For commercial use, please obtain authorization from the copyright owner in advance.

    2.本網站之製作,已盡力防止侵害著作權人之權益,如仍發現本網站之數位內容有侵害著作權人權益情事者,請權利人通知本網站維護人員(nccur@nccu.edu.tw),維護人員將立即採取移除該數位著作等補救措施。
    NCCU Institutional Repository is made to protect the interests of copyright owners. If you believe that any material on the website infringes copyright, please contact our staff(nccur@nccu.edu.tw). We will remove the work from the repository and investigate your claim.
    DSpace Software Copyright © 2002-2004  MIT &  Hewlett-Packard  /   Enhanced by   NTU Library IR team Copyright ©   - Feedback