Article (Scientific journals)
AMAW: automated gene annotation for non-model eukaryotic genomes
Meunier, Loïc; Baurain, Denis; Cornet, Luc
2023In F1000Research, 12, p. 186
Peer Reviewed verified by ORBi
 

Files


Full Text
Meunier_et_al_2023_F1000Research_preprint_editor.pdf
Author preprint (841.62 kB) Creative Commons License - Attribution
Preprint editor
Download

All documents in ORBi are protected by a user license.

Send to



Details



Keywords :
automation; evidence data acquisition; gene prediction; Genome annotation; non-model unicellular eukaryotes; Singularity container; Biochemistry, Genetics and Molecular Biology (all); Immunology and Microbiology (all); Pharmacology, Toxicology and Pharmaceutics (all); General Pharmacology, Toxicology and Pharmaceutics; General Immunology and Microbiology; General Biochemistry, Genetics and Molecular Biology; General Medicine
Abstract :
[en] Background: The annotation of genomes is a crucial step regarding the analysis of new genomic data and resulting insights, and this especially for emerging organisms which allow researchers to access unexplored lineages, so as to expand our knowledge of poorly represented taxonomic groups. Complete pipelines for eukaryotic genome annotation have been proposed for more than a decade, but the issue is still challenging. One of the most widely used tools in the field is MAKER2, an annotation pipeline using experimental evidence (mRNA-seq and proteins) and combining different gene prediction tools. MAKER2 enables individual laboratories and small-scale projects to annotate non-model organisms for which pre-existing gene models are not available. The optimal use of MAKER2 requires gathering evidence data (by searching and assembling transcripts, and/or collecting homologous proteins from related organisms), elaborating the best annotation strategy (training of gene models) and efficiently orchestrating the different steps of the software in a grid computing environment, which is tedious, time-consuming and requires a great deal of bioinformatic skills. Methods: To address these issues, we present AMAW (Automated MAKER2 Annotation Wrapper), a wrapper pipeline for MAKER2 that automates the above-mentioned tasks. Importantly, AMAW also exists as a Singularity container recipe easy to deploy on a grid computer, thereby overcoming the tricky installation of MAKER2. Use case: The performance of AMAW is illustrated through the annotation of a selection of 32 protist genomes, for which we compared its annotations with those produced with gene models directly available in AUGUSTUS. Conclusions: Importantly, AMAW also exists as a Singularity container recipe easy to deploy on a grid computer, thereby overcoming the tricky installation of MAKER2
Research center :
InBios - Integrative Biological Sciences - ULiège
Disciplines :
Genetics & genetic processes
Microbiology
Author, co-author :
Meunier, Loïc ;  Université de Liège - ULiège > Integrative Biological Sciences (InBioS)
Baurain, Denis  ;  Université de Liège - ULiège > Département des sciences de la vie > Phylogénomique des eucaryotes
Cornet, Luc  ;  Université de Liège - ULiège > Département des sciences de la vie > Phylogénomique des eucaryotes ; Mycology and Aerobiology, Sciensano, Ixelles, Belgium
Language :
English
Title :
AMAW: automated gene annotation for non-model eukaryotic genomes
Publication date :
2023
Journal title :
F1000Research
eISSN :
2046-1402
Publisher :
F1000 Research Ltd
Volume :
12
Pages :
186
Peer reviewed :
Peer Reviewed verified by ORBi
Tags :
CÉCI : Consortium des Équipements de Calcul Intensif
Funders :
BELSPO - Politique scientifique fédérale [BE]
F.R.S.-FNRS - Fonds de la Recherche Scientifique [BE]
Funding text :
This work was supported by the F.R.S-FNRS. Computational resources were provided by the Consortium des Équipements de Calcul Intensif (CÉCI) funded by the F.R.S.-FNRS (2.5020.11), and through two research grants to DB: B2/191/P2/BCCM GEN-ERA (Belgian Science Policy Office - BELSPO) and CDR J.0008.20 (F.R.S.-FNRS). LC was also supported by the GEN-ERA research grant.
Data Set :
Zenodo

Archived source code (see publications for additional and up-to-date links)

Available on ORBi :
since 01 July 2023

Statistics


Number of views
20 (4 by ULiège)
Number of downloads
28 (1 by ULiège)

Scopus citations®
 
0
Scopus citations®
without self-citations
0
OpenCitations
 
0

Bibliography


Similar publications



Contact ORBi