FABS algorithm; Gini coefficient; Lorenz regression; SCAD penalty; single-index models; Statistics and Probability; Statistics, Probability and Uncertainty
Abstract :
[en] The Lorenz regression estimates the explained Gini coefficient, a quantity with a natural application in the measurement of inequality of opportunity. Assuming a single-index model, it corresponds to the Gini coefficient of the conditional expectation of a response given some covariates and it can be estimated without having to estimate the link function. However, it is prone to overestimation when many covariates are included. In this paper, we propose a penalised bootstrap procedure which selects the relevant covariates and produces valid inference for the explained Gini coefficient. The obtained estimator achieves the Oracle property. Numerically, it is computed by the SCAD-FABS algorithm, an adaptation of the FABS algorithm to the SCAD penalty. The performance of the procedure is ensured by theoretical guarantees and assessed via Monte-Carlo simulations. Finally, a real data example is presented.
Disciplines :
Quantitative methods in economics & management
Author, co-author :
Jacquemain, Alexandre ; ISBA, UCLouvain, Belgium
Heuchenne, Cédric ; Université de Liège - ULiège > HEC Liège : UER > UER Opérations : Statistique appliquée à la gestion et à l'économie
Pircalabelu, Eugen; HEC Liege, University of Liège, Belgium
Language :
English
Title :
A penalised bootstrap estimation procedure for the explained Gini coefficient
Publication date :
2024
Journal title :
Electronic Journal of Statistics
eISSN :
1935-7524
Publisher :
Institute of Mathematical Statistics
Volume :
18
Issue :
1
Pages :
247 - 300
Peer reviewed :
Peer Reviewed verified by ORBi
Funding text :
Computational resources have been provided by the supercomputing facilities of the UCLouvain (CISM/UCL) and the Consortium des Équipements de Calcul Intensif en Fédération Wallonie Bruxelles (CÉCI) funded by the Fond de la Recherche Scientifique de Belgique (F.R.S.-FNRS) under convention 2.5020.11.
Bourguignon, F., Ferreira, F. H. G. and Menéndez, M. (2007). Inequality of Opportunity in Brazil. Review of Income and Wealth 53 585–618..
Brunori, P., Hufe, P. and Mahler, D. G. (2018). The Roots of Inequality: Estimating Inequality of Opportunity from Regression Trees Working Paper, World Bank, Washington, DC, available at SSRN: https://ssrn. com/abstract=3127234. https://doi.org/10.1596/1813-9450-8349.
Cavanagh, C. and Sherman, R. P. (1998). Rank Estimators for Monotonic Index Models. Journal of Econometrics 84 351–381. MR1630210.
Cowell, F. A. and Flachaire, E. (2015). Chapter 6 – Statistical Methods for Distributional Analysis. In Handbook of Income Distribution, (A. B. Atkinson and F. Bourguignon, eds.). Handbook of Income Distribution 2 359-465. Elsevier. https://doi.org/10.1016/B978-0-444-59428- 0.00007-2.
Escanciano, J. C. and Terschuur, J. R. (2023). Debiased Semiparametric U-Statistics: Machine Learning Inference on Inequality of Opportunity..
Fan, J. and Li, R. (2001). Variable Selection via Nonconcave Penalized Likelihood and Its Oracle Properties. Journal of the American Statistical Association 96 1348–1360. https://doi.org/10.1198/ 016214501753382273. MR1946581.
Ferreira, F. H. G. and Gignoux, J. (2011). The Measurement of Inequality of Opportunity: Theory and an Application to Latin America. Review of Income and Wealth 57 622–657. https://doi.org/10.1111/j. 1475-4991.2011.00467.x.
Friedman, J., Hastie, T., Höfling, H. and Tibshirani, R. (2007). Pathwise Coordinate Optimization. The Annals of Applied Statistics 1 302–332. https://doi.org/10.1214/07-AOAS131. MR2415737.
Han, A. K. (1987). Non-Parametric Analysis of a Generalized Regression Model: The Maximum Rank Correlation Estimator. Journal of Econometrics 35 303–316. https://doi.org/10.1016/0304-4076(87)90030-3. MR0903188.
Heuchenne, C. and Jacquemain, A. (2022). Inference for Monotone Single-Index Conditional Means: A Lorenz Regression Approach. Computational Statistics & Data Analysis 167 107347. MR4317417.
Lin, H. and Peng, H. (2013). Smoothed Rank Correlation of the Linear Transformation Regression Model. Computational Statistics & Data Analysis 57 615–630. https://doi.org/10.1016/j.csda.2012.07.012. MR2981113.
Mack, Y. P. and Müller, H.-G. (1989). Derivative Estimation in Nonparametric Regression with Random Predictor Variable. Sankhy¯a: The Indian Journal of Statistics, Series A (1961–2002) 51 59–72. MR1065559.
Mack, Y. P. and Silverman, B. W. (1982). Weak and Strong Uniform Consistency of Kernel Regression Estimates. Zeitschrift für Wahrscheinlichkeitstheorie und Verwandte Gebiete 61 405–415. https://doi.org/ 10.1007/BF00539840. MR0679685.
Roemer, J. E. and Trannoy, A. (2015). Chapter 4 – Equality of Opportunity. In Handbook of Income Distribution, (Atkinson, Antony, Bourguignon and François, eds.). Handbook of Income Distribution 2 217–300. Elsevier..
Shi, X., Huang, Y., Huang, J. and Ma, S. (2018). A Forward and Backward Stagewise Algorithm for Nonconvex Loss Functions with Adaptive Lasso. Computational Statistics & Data Analysis 124 235–251. https:// doi.org/10.1016/j.csda.2018.03.006. MR3787624.
Silverman, B. W. (1978). Weak and Strong Uniform Consistency of the Kernel Estimate of a Density and Its Derivatives. The Annals of Statistics 6 177–184. MR0471166.
van der Vaart, A. W. (1998). Asymptotic Statistics. Cambridge University Press. MR1652247.
van der Vaart, A. W. and Wellner, J. A. (1996). Weak Convergence and Empirical Processes: With Applications to Statistics. Springer Series in Statistics. Springer New York, NY. https://doi.org/10.1007/978- 1-4757-2545-2_15. MR1385671.
Yamato, H. (1973). Uniform Convergence of an Estimator of a Distribution Function. Bulletin of Mathematical Statistics 15 69–78. https:// doi.org/info:doi/10.5109/13073. MR0329113.
Zhang, C.-H. and Zhang, S. S. (2014). Confidence Intervals for Low Dimensional Parameters in High Dimensional Linear Models. Journal of the Royal Statistical Society. Series B (Statistical Methodology) 76 217–242. MR3153940