Article (Scientific journals)
Consensus assessment of the contamination level of publicly available cyanobacterial genomes.
Cornet, Luc; Meunier, Loïc; Van Vlierberghe, Mick et al.
2018In PLoS ONE, 13 (7), p. 0200323
Peer Reviewed verified by ORBi
 

Files


Full Text
Cornet_et_al_2018_PLOS_ONE_postprint_editor.pdf
Publisher postprint (6.96 MB)
Download
Annexes
Cornet_et_al_2018_PLOS_ONE_suppl_data.zip
(3.39 MB)
Supplemental Data
Download

All documents in ORBi are protected by a user license.

Send to



Details



Abstract :
[en] Publicly available genomes are crucial for phylogenetic and metagenomic studies, in which contaminating sequences can be the cause of major problems. This issue is expected to be especially important for Cyanobacteria because axenic strains are notoriously difficult to obtain and keep in culture. Yet, despite their great scientific interest, no data are currently available concerning the quality of publicly available cyanobacterial genomes. As reliably detecting contaminants is a complex task, we designed a pipeline combining six methods in a consensus strategy to assess the contamination level of 440 genome assemblies of Cyanobacteria. Two methods are based on published reference databases of ribosomal genes (SSU rRNA 16S and ribosomal proteins), one is indirectly based on a reference database of marker genes (CheckM), and three are based on complete genome analysis. Among those genome-wide methods, Kraken and DIAMOND blastx share the same reference database that we derived from Ensembl Bacteria, whereas CONCOCT does not require any reference database, instead relying on differences in DNA tetramer frequencies. Given that all the six methods appear to have their own strengths and limitations, we used the consensus of their rankings to infer that >5% of cyanobacterial genome assemblies are highly contaminated by foreign DNA (i.e., contaminants were detected by 5 or 6 methods). Our results will help researchers to check the quality of publicly available genomic data before use in their own analyses. Moreover, we argue that journals should make mandatory the submission of raw read data along with genome assemblies in order to facilitate the detection of contaminants in sequence databases.
Disciplines :
Genetics & genetic processes
Biochemistry, biophysics & molecular biology
Microbiology
Author, co-author :
Cornet, Luc ;  Université de Liège - ULiège > Département des sciences de la vie > Phylogénomique des eucaryotes
Meunier, Loïc ;  Université de Liège - ULiège > Département des sciences de la vie > Phylogénomique des eucaryotes
Van Vlierberghe, Mick ;  Université de Liège - ULiège > Département des sciences de la vie > Phylogénomique des eucaryotes
Léonard, Raphaël  ;  Université de Liège - ULiège > Département des sciences de la vie > Cristallographie des macromolécules biologiques
Durieu, Benoit ;  Université de Liège - ULiège > Département des sciences de la vie > Centre d'ingénierie des protéines
Lara, Yannick  ;  Université de Liège - ULiège > Département de géologie > Paléobiogéologie - Paléobotanique - Paléopalynologie (PPP)
Misztak, Agnieszka
Sirjacobs, Damien ;  Université de Liège - ULiège > Département des sciences de la vie > Phylogénomique des eucaryotes
Javaux, Emmanuelle  ;  Université de Liège - ULiège > Département de géologie > Paléobiogéologie - Paléobotanique - Paléopalynologie (PPP)
Philippe, Herve
Wilmotte, Annick  ;  Université de Liège - ULiège > Département des sciences de la vie > Physiologie et génétique bactériennes
Baurain, Denis  ;  Université de Liège - ULiège > Département des sciences de la vie > Phylogénomique des eucaryotes
Language :
English
Title :
Consensus assessment of the contamination level of publicly available cyanobacterial genomes.
Publication date :
2018
Journal title :
PLoS ONE
eISSN :
1932-6203
Publisher :
Public Library of Science, United States - California
Volume :
13
Issue :
7
Pages :
e0200323
Peer reviewed :
Peer Reviewed verified by ORBi
Tags :
CÉCI : Consortium des Équipements de Calcul Intensif
European Projects :
FP7 - 308074 - ELITE - Early Life Traces, Evolution, and Implications for Astrobiology
Funders :
F.R.S.-FNRS - Fonds de la Recherche Scientifique [BE]
BELSPO - Politique scientifique fédérale [BE]
FRIA - Fonds pour la Formation à la Recherche dans l'Industrie et dans l'Agriculture [BE]
ULiège - Université de Liège [BE]
ANR - Agence Nationale de la Recherche [FR]
WBI - Wallonie-Bruxelles International [BE]
CE - Commission Européenne [BE]
Available on ORBi :
since 14 August 2018

Statistics


Number of views
163 (41 by ULiège)
Number of downloads
103 (16 by ULiège)

Scopus citations®
 
35
Scopus citations®
without self-citations
24
OpenCitations
 
33

Bibliography


Similar publications



Contact ORBi