Exciting news! We're transitioning to the Statewide California Earthquake Center. Our new website is under construction, but we'll continue using this website for SCEC business in the meantime. We're also archiving the Southern Center site to preserve its rich history. A new and improved platform is coming soon!

Approximating the Distribution of Pareto Sums

Ilya Zaliapin, Yan Y. Kagan, & Frederic P. Schoenberg

Published June 2005, SCEC Contribution #770

Heavy tailed random variables (rvs) have proven to be an essential element in modeling a wide variety of natural and human-induced processes, and the sums of heavy tailed rvs represent a particularly important construction in such models. Oriented toward both geophysical and statistical audiences, this paper discusses the appearance of the Pareto law in seismology and addresses the problem of the statistical approximation for the sums of independent rvs with common Pareto distribution F(x)=1 – x–agr for 1/2 < agr < 2. Such variables have infinite second moment which prevents one from using the Central Limit Theorem to solve the problem. This paper presents five approximation techniques for the Pareto sums and discusses their respective accuracy. The main focus is on the median and the upper and lower quantiles of the sumrsquos distribution. Two of the proposed approximations are based on the Generalized Central Limit Theorem, which establishes the general limit for the sums of independent identically distributed rvs in terms of stable distributions; these approximations work well for large numbers of summands. Another approximation, which replaces the sum with its maximal summand, has less than 10% relative error for the upper quantiles when agr < 1. A more elaborate approach considers the two largest observations separately from the rest of the observations, and yields a relative error under 1% for the upper quantiles and less than 5% for the median. The last approximation is specially tailored for the lower quantiles, and involves reducing the non-Gaussian problem to its Gaussian equivalent; it too yields errors less than 1%. Approximation of the observed cumulative seismic moment in California illustrates developed methods.

Key Words
United States, California, seismicity, seismic moment, Pareto sums, human activity, statistical analysis, effects, algorithms, earthquakes

Citation
Zaliapin, I., Kagan, Y. Y., & Schoenberg, F. P. (2005). Approximating the Distribution of Pareto Sums. Pure and Applied Geophysics, 162(6-7), 1187-1228. doi: 10.1007/s00024-004-2666-3.