Correlation Engine 2.0
Clear Search sequence regions

  • fungi (1)
  • metazoa (1)
  • plants (1)
  • protein level (1)
  • proteomes (5)
  • Sizes of these terms reflect their relevance to your search.

    To study unknown proteins on a large scale, a reference system has been set up for the three better studied eukaryotic kingdoms, built with 36 proteomes as taxonomically diverse as possible. Proteins from 362 other eukaryotic proteomes with no known homologue in this set were then analyzed, focusing noteworthy on singletons, that is, on such proteins with no known homologue in their own proteome. Consistently, for a given species, no more than 12% of the singletons thus found are known at the protein level, according to Uniprot. In addition, since they rely on the information found in the alignment of homologous sequences, predictions of AlphaFold2 for their tridimensional structure are poor. In the case of metazoan species, the number of singletons rarely exceeds 1000 for the species the closest to the reference system (divergence times below 75 Myr). Interestingly, in the cases of viridiplantae and fungi, larger amounts of singletons are found for such species, as if the timescale on which singletons are added to proteomes were different in metazoa and in other eukaryotic kingdoms. In order to confirm this phenomenon, further studies of proteomes closer to those of the reference system are, however, needed. © 2023. The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature.


    Yves-Henri Sanejouand. On the Unknown Proteins of Eukaryotic Proteomes. Journal of molecular evolution. 2023 Aug;91(4):492-501

    Expand section icon Mesh Tags

    Expand section icon Substances

    PMID: 37219573

    View Full Text