Transformation To Normality Based On Empirical Distribution Functions
DOI:
https://doi.org/10.32871/rmrj1402.02.12Keywords:
transformation to normality, Box-Cox method, Johnson method, inequalities, Dvoretzky-Kiefer-WolfowitzAbstract
The paper examines an effi cient alternative to the Box-Cox and Yeo-Johnson’s
transformation to normality procedures which works under very general conditions. The method hinges on two fundamental results : the fact that the cumulative distribution function F(x) of a random variable X always has a U(0,1) distribution and the Box-Mueller transformation of uniform random variables to standard normal random variables. Given two observations x and y, we computed Fn(x) and Fn(y) , which for large n, are approximately uniform random variables. These values are then inputted into the Box-Mueller transformations. Bounds for the Kolmogorov-Smirnov statistic between the distribution of the transformed observations and the normal distribution are provided through numerical simulation and by appealing to the Dvoretzky-Kiefer-Wolfowitz inequality.
References
Craig, W. & Hogg , An Introduction to Mathematical Statistics, (Wiley and Sons, New York, 2000)
Dudley, R. M. (1999). “Uniform Central Limit Theoremsâ€, Cambridge University Press. ISBN 0 521 46102.
Durrett, R. (1991). Probability: Theory and Examples. Pacific Grove, CA: Wadsworth & Brooks/Cole.
Esseen, C. (1956). “A moment inequality with an application to the central limit theoremâ€. Skand. Aktuarietidskr. 39: 160–170.
Feller, W. (1972). An Introduction to Probability Theory and Its Applications, Volume II (2nd ed.). New York: John Wiley & Sons.
Graybill, J. An Introductory Course in Mathematical Statistics (Wiley Series, New York, 1987)
Huber, P. (1985). Projection pursuit. The annals of Statistics, 13(2):435 – 475.
Johnson, R & Wichern, Applied Multivariate Statistical Analysis (Wiley and Sons, New York, 2000)
Manoukian, E. B. (1986). Modern Concepts and Theorems of Mathematical Statistics. New York: Springer-Verlag.
Serfling, R. J. (1980). Approximation Theorems of Mathematical Statistics. New York: John Wiley & Sons.
Shevtsova, I. G. (2007). “Sharpening of the upper bound of the absolute constant in the Berry–Esseen inequalityâ€. Theory of Probability and its Applications 51 (3): 549–553.
Shevtsova, I. G. (2008). “On the absolute constant in the Berry-Esseen inequalityâ€. The Collection of Papers of Young Scientists of the Faculty of Computational Mathematics and CyberneticsTheory of Probability and
its Applications (5): 101-110.
Shiganov, I.S. (1986). “Refinement of the upper bound of a constant in the remainder term of the central limit theoremâ€. Journal of Soviet mathematics 35: 109–115.
Shorack, G.R., Wellner J.A. (1986) Empirical Processes with Applications to Statistics, Wiley.
Tyurin, I.S. (2009). “On the accuracy of the Gaussian approximationâ€. Doklady Mathematics 80 (3): 840-843.
A. W. van der Vaart (1998), Asymptotic Statistics. Cambridge Series in Probabilistic Mathematics.
Vapnik, V.N. and Chervonenkis, A. Ya (1971). On uniform convergence of the frequencies of events to their probabilities. Theor. Prob. Appl. 16, 264-280.
Yeo, I. & Johnson, R. (2000). A new family of power transformations to improve normality or symmetry. Biometrika, 87, 954-959.
Downloads
Published
How to Cite
Issue
Section
License
Copyright of the Journal belongs to the University of San Jose-Recoletos