Article (Scientific journals)
Broadly sampled orthologous groups of eukaryotic proteins for the phylogenetic study of plastid-bearing lineages.
Van Vlierberghe, Mick; Philippe, Hervé; Baurain, Denis
2021In BMC Research Notes, 14 (1), p. 143
Peer Reviewed verified by ORBi
 

Files


Full Text
Van_Vlierberghe_et_al_2021a_BMC_Res_Notes_postprint_editor.pdf
Publisher postprint (1.05 MB)
Download

All documents in ORBi are protected by a user license.

Send to



Details



Keywords :
Eukaryota/genetics; Evolution, Molecular; Genome; Phylogeny; Plants; Plastids/genetics; Algae; CASH; Contamination; Endosymbiotic gene transfer (EGT); Eukaryotic evolution; Horizontal or lateral gene transfer (HGT/LGT); Kleptoplasty; Organelles; Orthology; Phylogenomics; Proteomes
Abstract :
[en] OBJECTIVES: Identifying orthology relationships among sequences is essential to understand evolution, diversity of life and ancestry among organisms. To build alignments of orthologous sequences, phylogenomic pipelines often start with all-vs-all similarity searches, followed by a clustering step. For the protein clusters (orthogroups) to be as accurate as possible, proteomes of good quality are needed. Here, our objective is to assemble a data set especially suited for the phylogenomic study of algae and formerly photosynthetic eukaryotes, which implies the proper integration of organellar data, to enable distinguishing between several copies of one gene (paralogs), taking into account their cellular compartment, if necessary. DATA DESCRIPTION: We submitted 73 top-quality and taxonomically diverse proteomes to OrthoFinder. We obtained 47,266 orthogroups and identified 11,775 orthogroups with at least two algae. Whenever possible, sequences were functionally annotated with eggNOG and tagged after their genomic and target compartment(s). Then we aligned and computed phylogenetic trees for the orthogroups with IQ-TREE. Finally, these trees were further processed by identifying and pruning the subtrees exclusively composed of plastid-bearing organisms to yield a set of 31,784 clans suitable for studying photosynthetic organism genome evolution.
Disciplines :
Genetics & genetic processes
Author, co-author :
Van Vlierberghe, Mick ;  Université de Liège - ULiège > InBioS
Philippe, Hervé
Baurain, Denis  ;  Université de Liège - ULiège > Département des sciences de la vie > Phylogénomique des eucaryotes
Language :
English
Title :
Broadly sampled orthologous groups of eukaryotic proteins for the phylogenetic study of plastid-bearing lineages.
Publication date :
2021
Journal title :
BMC Research Notes
eISSN :
1756-0500
Publisher :
BioMed Central, London, United Kingdom
Volume :
14
Issue :
1
Pages :
143
Peer reviewed :
Peer Reviewed verified by ORBi
Funders :
FRIA - Fonds pour la Formation à la Recherche dans l'Industrie et dans l'Agriculture [BE]
F.R.S.-FNRS - Fonds de la Recherche Scientifique [BE]
ULiège - Université de Liège [BE]
Available on ORBi :
since 10 August 2021

Statistics


Number of views
39 (7 by ULiège)
Number of downloads
37 (4 by ULiège)

Scopus citations®
 
4
Scopus citations®
without self-citations
1
OpenCitations
 
4

Bibliography


Similar publications



Contact ORBi