Loading...
|
Please use this identifier to cite or link to this item:
https://nccur.lib.nccu.edu.tw/handle/140.119/146902
|
Title: | 基於主成分分析之Spline迴歸節點選取演算法 An Algorithm for Knot Selection in Spline Regression Based on Principal Component Analysis |
Authors: | 蔡耀德 Tsai, Yao-De |
Contributors: | 黃子銘 Huang, Tzee-Ming 蔡耀德 Tsai, Yao-De |
Keywords: | 函數近似 無母數迴歸 Spline 函數 主成分分析 節點選取 非固定節點 Function approximation Nonparametric regression Spline function Principal component analysis Knot selection Free knots |
Date: | 2023 |
Issue Date: | 2023-09-01 14:56:48 (UTC+8) |
Abstract: | 在函數近似的方法中,我們可以使用無母數迴歸進行近似,其中常以Spline函數為做為基底函數進行函數估計。而在Spline函數的建構中,節點的選取相當重要,其位置與個數將直接影響函數的估計 效果。因此本文以冪基底及截斷冪基底作為Spline函數的基底,提出基於主成分分析(PCA)的節點選取方法。在選取節點時,依照設定間距δ,將測試節點分為重要節點及次要節點,其後將次要節點效應進行PCA,並排除應變數受次要節點之主成分的影響,依照貝氏資訊量準則(BIC)逐步挑選重要節點,最後設定不同的δ找出能使模型BIC最小的重要節點列表。在實驗數據中,將本文方法與向前選取法、Huang(2019)的方法在B-spline函數以及四個訊號處理的函數進行比較,透過Wilcoxon配對秩檢定可知本文提出的方法在B-spline函數中樣本數小時優於另外兩種方法,且在四個訊號函數中不論樣本數大小優於另外兩種方法,然而在樣本數多時明顯需要耗費較多時間。 In nonparametric regression, it is common to first approximate the regression function using functions known up to a finite dimensional parameter and then estimate the unknown parameter to obtain the regression function estimator. One commonly used class of functions for function approximation is the class of splines. When using a spline for function approximation, the position and number of knots of the spline crucially affect the approximation performance. Therefore, in this thesis, a knot selection method based on principal component analysis (PCA) is proposed. The proposed knot selection method involves testing whether knots at specific locations need to be placed. During the knot selection process, splines are expressed as linear combinations of power basis functions and truncated power basis functions based on the test knots, and the test knots are divided into important and minor knots based on a specified spacing parameter δ. Subsequently, PCA is applied to the effects of minor knots, and a test is performed to test the significance of a test knot after the influence of principal components of minor knots on the dependent variable is eliminated. Important knots are then selected stepwise according to the Bayesian Information Criterion (BIC). Finally, finding the set of important knots that minimizes the BIC of the model by considering a set of δ`s. Simulation experiments have been carried out to compare the proposed method with forward selection and Huang`s method (2019), where the regression function is either a simulated spline function or one of four specific functions used in signal processing. The experiment results indicate that the proposed method outperforms the other two methods when the sample size is small and the regression function is a spline, or when the regression function is one of the four given functions, regardless of the sample size. The comparison is based on Wilcoxon`s matched pairs rank test. However, the proposed method is time-consuming when the sample size is large. |
Reference: | Bartels, R. H., Beatty, J. C., and Barsky, B. A. (1995). An introduction to splines for use in computer graphics and geometric modeling. Morgan Kaufmann. Dani, A., Ratnasari, V., and Budiantara, I. (2021). Optimal knots point and bandwidth selection in modeling mixed estimator nonparametric regression. IOP Conference Series: Materials Science and Engineering, 1115(1):012020. Donoho, D. L. and Johnstone, I. M. (1994). Ideal spatial adaptation by wavelet shrinkage. biometrika, 81(3):425–455. Eubank, R. L. (1999). Nonparametric regression and spline smoothing. CRC press. Fan, J., Lou, Z., and Yu, M. (2023). Are latent factor regression and sparse regression adequate? Journal of the American Statistical Association, pages 1–13. Friedman, J. H. (1991). Multivariate adaptive regression splines. The annals of statistics, 19(1):1–67. George, E. I. and McCulloch, R. E. (1993). Variable selection via Gibbs sampling. Journal of the American Statistical Association, 88(423):881–889. Huang, T. M. (2019). A knot selection algorithm for regression splines. Proceedings of the 62th ISI World Statistics Congress, Contributed Paper Session, 2:372–377. Ratnasari, V., Budiantara, I., Ratna, M., and Zain, I. (2016). Estimation of nonparametric regression curve using mixed estimator of multivariable truncated spline and multivariable kernel. Global Journal of Pure and Applied Mathematics, 12(6):5047–5057. Wang, R.-H. (2013). Multivariate spline functions and their applications, volume 529. Springer Science & Business Media. Zhou, S. and Shen, X. (2001). Spatially adaptive regression splines and accurate knot selection schemes. Journal of the American Statistical Association, 96(453):247–259. |
Description: | 碩士 國立政治大學 統計學系 110354016 |
Source URI: | http://thesis.lib.nccu.edu.tw/record/#G0110354016 |
Data Type: | thesis |
Appears in Collections: | [統計學系] 學位論文
|
Files in This Item:
File |
Description |
Size | Format | |
401601.pdf | | 3529Kb | Adobe PDF2 | 0 | View/Open |
|
All items in 政大典藏 are protected by copyright, with all rights reserved.
|