Bandwidth Selection for Phylogenetic Tree Density Estimation

Published:Dec 29, 2025 13:01
1 min read
ArXiv

Analysis

This paper addresses the problem of bandwidth selection for kernel density estimation (KDE) applied to phylogenetic trees. It proposes a likelihood cross-validation (LCV) method for selecting the optimal bandwidth in a tropical KDE, a KDE variant using a specific distance metric for tree spaces. The paper's significance lies in providing a theoretically sound and computationally efficient method for density estimation on phylogenetic trees, which is crucial for analyzing evolutionary relationships. The use of LCV and the comparison with existing methods (nearest neighbors) are key contributions.

Reference

The paper demonstrates that the LCV method provides a better-fit bandwidth parameter for tropical KDE, leading to improved accuracy and computational efficiency compared to nearest neighbor methods, as shown through simulations and empirical data analysis.