Skip to Main Content
 
Translator disclaimer

Xiaoming Huoa & Gábor J. Székelybc

a School of Industrial and Systems Engineering, Georgia Institute of Technology, Atlanta, GA 30332 ()

b National Science Foundation Arlington, VA 22203

c Alfréd Rényi Institute of Mathematics Hungarian Academy of Sciences Budapest, Hungary ()

Color versions of one or more of the figures in the article can be found online at www.tandfonline.com/r/tech.

Supplementary Material

Distance covariance and distance correlation have been widely adopted in measuring dependence of a pair of random variables or random vectors. If the computation of distance covariance and distance correlation is implemented directly accordingly to its definition then its computational complexity is O(n2), which is a disadvantage compared to other faster methods. In this article we show that the computation of distance covariance and distance correlation of real-valued random variables can be implemented by an O(nlog n) algorithm and this is comparable to other computationally efficient algorithms. The new formula we derive for an unbiased estimator for squared distance covariance turns out to be a U-statistic. This fact implies some nice asymptotic properties that were derived before via more complex methods. We apply the fast computing algorithm to some synthetic data. Our work will make distance correlation applicable to a much wider class of problems. A supplementary file to this article, available online, includes a Matlab and C-based software that realizes the proposed algorithm.

Login options

Purchase * Save for later
Online

Article Purchase 24 hours to view or download: USD 51.00 Add to cart

Issue Purchase 30 days to view or download: USD 105.00 Add to cart

* Local tax will be added as applicable