The availability of whole genome sequences was expected to supply essentially unlimited data for phylogenetics. However, strict reliance on single-copy genes for this purpose has drastically limited the amount of data that can be used. Here, we review several approaches for increasing the amount of data used for phylogenetic inference, focusing on methods that allow for the inclusion of duplicated genes (paralogs). Recently developed methods that are robust to high levels of incomplete lineage sorting also appear to be robust to the inclusion of paralogs, suggesting a promising way to take full advantage of genomic data. We discuss the pitfalls of these approaches, as well as further avenues for research. Copyright © 2020 Elsevier Ltd. All rights reserved.
Megan L Smith, Matthew W Hahn. New Approaches for Inferring Phylogenies in the Presence of Paralogs. Trends in genetics : TIG. 2021 Feb;37(2):174-187
PMID: 32921510
View Full Text