Back to Search Start Over

Efficient computation of multivariate empirical distribution functions at the observed values

Authors :
David Lee
Harry Joe
Source :
Computational Statistics. 33:1413-1428
Publication Year :
2017
Publisher :
Springer Science and Business Media LLC, 2017.

Abstract

Consider the evaluation of model-based functions of cumulative distribution functions that are integrals. When the cumulative distribution function does not have a tractable form but simulation of the multivariate distribution is easily feasible, we can evaluate the integral via a Monte Carlo sample, replacing the model-based distribution function by the empirical distribution function. Given a simulation sample of size N, the naive method uses $$O(N^{2})$$ comparisons to compute the empirical distribution function at all N sample vectors. To obtain faster computational speed when N needs to be large to achieve a desired accuracy, we propose methods modified from the popular merge sort and quicksort algorithms that preserve their average $$O(N\log _{2}N)$$ complexity in the bivariate case. The modified merge sort algorithm can be extended to the computation of a d-dimensional empirical distribution function at the observed values with $$O(N\log _{2}^{d-1}N)$$ complexity. Simulation studies suggest that the proposed algorithms provide substantial time savings when N is large.

Details

ISSN :
16139658 and 09434062
Volume :
33
Database :
OpenAIRE
Journal :
Computational Statistics
Accession number :
edsair.doi...........f686bb601d4633a0d57fe715ec8f9327
Full Text :
https://doi.org/10.1007/s00180-017-0771-x