Hellinger Distance of Probability Distributions
Definition
The following distance function defined on probability distributions themselves is called the Hellinger distance.
Discrete 1
Let be a probability mass function. The Hellinger distance of is defined as:
Continuous 2
Let be a probability density function. The Hellinger distance of is defined as:
Explanation
The Hellinger distance is, by definition, a distance function that directly compares probability mass functions or probability density functions. It is bounded by , being when they are identical and when they are completely different. While the Kullback-Leibler divergence is widely used for comparing probability distributions, the Hellinger distance has the distinguishing feature of being a proper distance function, allowing the discussion of metric spaces.
See Also
Gingold, J.A., Coakley, E.S., Su, J. et al. Distribution Analyzer, a methodology for identifying and clustering outlier conditions from single-cell distributions, and its application to a Nanog reporter RNAi screen. BMC Bioinformatics 16, 225 (2015). https://doi.org/10.1186/s12859-015-0636-7 ↩︎
Wibisono. (2024). Optimal score estimation via empirical Bayes smoothing. https://doi.org/10.48550/arXiv.2402.07747 ↩︎