Transformation To Normality Based On Empirical Distribution Functions

Authors

  • Mark S. Borres University of San Jose-Recoletos
  • Efren O. Barabat University of San Jose-Recoletos

DOI:

https://doi.org/10.32871/rmrj1402.02.12

Keywords:

transformation to normality, Box-Cox method, Johnson method, inequalities, Dvoretzky-Kiefer-Wolfowitz

Abstract

The paper examines an effi cient alternative to the Box-Cox and Yeo-Johnson’s
transformation to normality procedures which works under very general conditions. The method hinges on two fundamental results : the fact that the cumulative distribution function F(x) of a random variable X always has a U(0,1) distribution and the Box-Mueller transformation of uniform random variables to standard normal random variables. Given two observations x and y, we computed Fn(x) and Fn(y) , which for large n, are approximately uniform random variables. These values are then inputted into the Box-Mueller transformations. Bounds for the Kolmogorov-Smirnov statistic between the distribution of the transformed observations and the normal distribution are provided through numerical simulation and by appealing to the Dvoretzky-Kiefer-Wolfowitz inequality.

Author Biographies

Mark S. Borres, University of San Jose-Recoletos

graduated Bachelor of Science in Mathematics–major in Pure Mathematics at the University of the Philippines, Cebu College. Since 2009, he worked for the University of San Jose- Recoletos as a faculty member of the College of Arts and Sciences and handled Mathematics subjects such as College Algebra, Advanced Algebra, Abstract Algebra, Analytical Geometry, Euclidean geometry, Trigonometry, Business Mathematics, Linear Programming, Mathematics of Investment, Discrete Structure, and Statistics across colleges. Currently, he is one of the Research Staff of CPRDS doing research on Fractal Statistics and Fractal Geometry and also assumes the position of Secretary in the Recoletos Multidisciplinary Research Journal.

Efren O. Barabat, University of San Jose-Recoletos

an Electronics Engineer, graduated from the University of San Jose-Recoletos in 2010, Cum Laude honors. He ranked as top 9 examinee in the April 2011 ECE Licensure Examination. He worked as Field Engineer in SMART Communications, Inc. from 2011 to 2012. Currently, a full-time faculty member of the Electronics Engineering Department of USJ-R College of Engineering, handling Mathematics and Major Subjects of ECE.

References

Cook, R. D. & Weisberg, S. (1999). Applied Regression Including Computing and Graphics, New York: Wiley.

Craig, W. & Hogg , An Introduction to Mathematical Statistics, (Wiley and Sons, New York, 2000)

Dudley, R. M. (1999). “Uniform Central Limit Theorems”, Cambridge University Press. ISBN 0 521 46102.

Durrett, R. (1991). Probability: Theory and Examples. Pacific Grove, CA: Wadsworth & Brooks/Cole.

Esseen, C. (1956). “A moment inequality with an application to the central limit theorem”. Skand. Aktuarietidskr. 39: 160–170.

Feller, W. (1972). An Introduction to Probability Theory and Its Applications, Volume II (2nd ed.). New York: John Wiley & Sons.

Graybill, J. An Introductory Course in Mathematical Statistics (Wiley Series, New York, 1987)

Huber, P. (1985). Projection pursuit. The annals of Statistics, 13(2):435 – 475.

Johnson, R & Wichern, Applied Multivariate Statistical Analysis (Wiley and Sons, New York, 2000)

Manoukian, E. B. (1986). Modern Concepts and Theorems of Mathematical Statistics. New York: Springer-Verlag.

Serfling, R. J. (1980). Approximation Theorems of Mathematical Statistics. New York: John Wiley & Sons.

Shevtsova, I. G. (2007). “Sharpening of the upper bound of the absolute constant in the Berry–Esseen inequality”. Theory of Probability and its Applications 51 (3): 549–553.

Shevtsova, I. G. (2008). “On the absolute constant in the Berry-Esseen inequality”. The Collection of Papers of Young Scientists of the Faculty of Computational Mathematics and CyberneticsTheory of Probability and
its Applications (5): 101-110.

Shiganov, I.S. (1986). “Refinement of the upper bound of a constant in the remainder term of the central limit theorem”. Journal of Soviet mathematics 35: 109–115.

Shorack, G.R., Wellner J.A. (1986) Empirical Processes with Applications to Statistics, Wiley.

Tyurin, I.S. (2009). “On the accuracy of the Gaussian approximation”. Doklady Mathematics 80 (3): 840-843.

A. W. van der Vaart (1998), Asymptotic Statistics. Cambridge Series in Probabilistic Mathematics.

Vapnik, V.N. and Chervonenkis, A. Ya (1971). On uniform convergence of the frequencies of events to their probabilities. Theor. Prob. Appl. 16, 264-280.

Yeo, I. & Johnson, R. (2000). A new family of power transformations to improve normality or symmetry. Biometrika, 87, 954-959.

Downloads

Published

2014-12-28

How to Cite

Borres, M. S., & Barabat, E. O. (2014). Transformation To Normality Based On Empirical Distribution Functions. Recoletos Multidisciplinary Research Journal, 2(2). https://doi.org/10.32871/rmrj1402.02.12

Issue

Section

Articles

Most read articles by the same author(s)

1 2 3 > >>