Functional data; Non parametric regression; Semimetric; Wavelet; Misalignment; Cross-Validation
Abstract :
[en] Our goal is to predict a scalar value or a group membership from the discretized observation of curves with sharp local features that might vary both vertically and horizontally. To this aim, we propose to combine the use of the nonparametric functional regression estimator developed by Ferraty and Vieu (2006) [18] with the Bagidis semimetric developed by Timmermans and von Sachs (submitted for publication) [36] with a view of efficiently measuring dissimilarities between curves with sharp patterns. This association is revealed as powerful. Under quite general conditions, we first obtain an asymptotic expansion for the small ball probability indicating that Bagidis induces a fractal topology on the functional space. We then provide the rate of convergence of the nonparametric regression estimator in this case, as a function of the parameters of the Bagidis semimetric. We propose to optimize those parameters using a cross-validation procedure, and show the optimality of the selected vector. This last result has a larger scope and concerns the optimization of any vector parameter characterizing a semimetric used in this context. The performances of our methodology are assessed on simulated and real data examples. Results are shown to be superior to those obtained using competing semimetrics as soon as the variations of the significant sharp patterns in the curves have a horizontal component.
Ait-Saidi A., Ferraty F., Kassa R., Vieu P. Cross-validated estimations in the single-functional index model. Statistics 2008, 42:475-494.
Aneiros-Pérez G., Cardot H., Estévez-Pérez G., Vieu P. Maximum ozone concentration forecasting by functional non-parametric approaches. Environmetrics 2004, 15:675-685.
Aneiros-Pérez G., Vieu P. Nonparametric time series prediction: a semi-functional partial linear modeling. Journal of Multivariate Analysis 2008, 99:834-857.
Aneiros-Pérez G., Vieu P. Nonparametric time series prediction: a semi-functional partial linear modeling. Statistical Inference for Stochastic Processes 2008, 99:834-857.
Bosq D. Linear Processes in Function Spaces 2000, Springer, Berlin.
Burba F., Ferraty F., Vieu P. k-nearest neighbour method in functional nonparametric regression. Journal of Nonparametric Statistics 2009, 21:453-469.
Cai T.T., Hall P. Prediction in functional linear regression. Annals of Statistics 2006, 34:2159.
Crambes C., Kneip A., Sarda P. Smoothing splines estimators for functional linear regression. Annals of Statistics 2008.
Delsol L., Ferraty F., Vieu P. Structural test in regression on functional variables. Journal of Multivariate Analysis 2011, 102:422-447.
Ferraty F., Laksaci A., Tadj A., Vieu P. Rate of uniform consistency for nonparametric estimates with functional variables. Journal of Statistical Planning and Inference 2010, 140:335-352.
Ferraty F., Laksaci A., Tadj A., Vieu P. Kernel regression with functional response. Electronic Journal of Statistics 2011, 5:159-171.
Ferraty F., Laksaci A., Vieu P. Estimating some characteristics of the conditional distribution in nonparametric functional models. Statistical Inference for Stochastic Processes 2005, 9:47-76.
Ferraty F., Laksaci A., Vieu P. Functional time series prediction via conditional mode estimation. Comptes Rendus de l'Academie des Sciences, Paris 2005, 340:389-392.
Ferraty F., Rabhi A., Vieu P. Conditional quantiles for dependent functional data with application to the climatic El nino phenomenon. Sankhyā: The Indian Journal of Statistics 2005, 67:378-398. (special issue on Quantile regression and related methods).
Ferraty F., Rabhi A., Vieu P. Estimation non-paramétrique de la fonction de hasard avec variable explicative fonctionelle. Revue Roumaine de Mathématiques Pures et Appliquées 2006, 53:1-18.
Ferraty F., Romain Y. The Oxford Handbook of Functional Data Analysis 2010, Oxford University Press, USA, City.
Ferraty F., Vieu P. Nonparametric models for functional data, with applications in regression, time series prediction and curve discrimination. Journal of Nonparametric Statistics 2004, 16:111-127.
Ferraty F., Vieu P. Nonparametric Functional Data Analysis: Theory and Practice. Springer Series in Statistics 2006, Springer-Verlag New York, Inc., Secaucus, NJ, USA.
Ferraty F., Vieu P. Additive prediction and boosting for functional data. Computational Statistics and Data Analysis 2009, 53:1400-1413.
Fryzlewicz P. Unbalanced Haar technique for non parametric function estimation. Journal of the American Statistical Association 2007, 102:1318-1327.
Girardi M., Sweldens W. A new class of unbalanced Haar wavelets that form an unconditional basis for Lp on general measure spaces. Journal of Fourier Analysis and Applications 1997, 3:457-474.
Guyon I., Elisseeff A. An introduction to variable and feature selection. Journal of Machine Learning Research 2003, 3:1157-1182.
Hardle W., Marron J. Optimal bandwidth selection in nonparametric regression function estimation. Annals of Statistics 1985, 13:1465-1481.
Laksaci A., Mechab B. Nonparametric estimation of the hazard function with functional regressor: the case of spatial data. Revue Roumaine de Mathématiques Pures et Appliquées 2010, 55:35-51. (estimation non-paramétrique de la fonction de hasard avec variable explicative fonctionnelle: cas des données spatiales).
Marron J., Hardle W. Random approximations to some measures of accuracy in nonparametric curve estimation. Journal of Multivariate Analysis 1986, 20:91-113.
Masry E. Nonparametric regression estimation for dependent functional data: asymptotic normality. Stochastic Processes and their Applications 2005, 115:155-177.
Quintela-Del-Rio A. Hazard function given a functional variable: non-parametric estimation under strong mixing conditions. Journal of Nonparametric Statistics 2008, 20:413-430.
R Development Core Team, R: a language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria, 2008. ISBN: 3-900051-07-0.
Rachdi M., Vieu P. Nonparametric regression for functional data: automatic smoothing parameter selection. Journal of Statistical Planning and Inference 2007, 137:2784-2801.
Ramsay J., Dalzell C. Some tools for functional data analysis (with discussion). Journal of the Royal Statistical Society, Series B 1991, 53:539-572.
Ramsay J., Silverman B. Applied Functional Data Analysis: Methods and Case Studies. Springer Series in Statistics 2002, Springer-Verlag, New York.
Ramsay J., Silverman B. Functional Data Analysis. Springer Series in Statistics 2005, Springer. second ed.
Sood A., James G.M., Tellis G.J. Functional regression: a new model for predicting market penetration of new products. Marketing Science 2009, 28:36-51.
Stone C.J. Optimal global rates of convergence for nonparametric regression. The Annals of Statistics 1982, 10:1040-1053.
C. Timmermans, P. Fryzlewicz, Shah: shape-adaptive haar wavelet transform for images with applications to classification, ISBA Discussion Paper 2012/15, 2012 (submitted for publication).
C. Timmermans, R. von Sachs, Bagidis, a new method for statistical analysis of differences between curves with sharp local patterns, ISBA Discussion Paper 2010/30, 2010 (submitted for publication).