[en] The tetraploid genome and clonal propagation of the cultivated potato (Solanum tuberosum L.)1,2 dictate a slow, non-accumulative breeding mode of the most important tuber crop. Transitioning potato breeding to a seed-propagated hybrid system based on diploid inbred lines has the potential to greatly accelerate its improvement3. Crucially, the development of inbred lines is impeded by manifold deleterious variants; explaining their nature and finding ways to eliminate them is the current focus of hybrid potato research4-10. However, most published diploid potato genomes are unphased, concealing crucial information on haplotype diversity and heterozygosity11-13. Here we develop a phased potato pangenome graph of 60 haplotypes from cultivated diploids and the ancestral wild species, and find evidence for the prevalence of transposable elements in generating structural variants. Compared with the linear reference, the graph pangenome represents a broader diversity (3,076 Mb versus 742 Mb). Notably, we observe enhanced heterozygosity in cultivated diploids compared with wild ones (14.0% versus 9.5%), indicating extensive hybridization during potato domestication. Using conservative criteria, we identify 19,625 putatively deleterious structural variants (dSVs) and reveal a biased accumulation of deleterious single nucleotide polymorphisms (dSNPs) around dSVs in coupling phase. Based on the graph pangenome, we computationally design ideal potato haplotypes with minimal dSNPs and dSVs. These advances provide critical insights into the genomic basis of clonal propagation and will guide breeders to develop a suite of promising inbred lines.
Disciplines :
Agriculture & agronomy
Author, co-author :
Cheng, Lin ; Université de Liège - ULiège > TERRA Research Centre ; National Key Laboratory of Tropical Crop Breeding, Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China
Wang, Nan; National Key Laboratory of Tropical Crop Breeding, Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China ; National Key Laboratory of Tropical Crop Breeding, Chinese Academy of Tropical Agricultural Sciences, Haikou, China
Bao, Zhigui ; National Key Laboratory of Tropical Crop Breeding, Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China ; Department of Molecular Biology, Max Planck Institute for Biology Tübingen, Tübingen, Germany
Zhou, Qian; School of Agriculture and Biotechnology, Sun Yat-Sen University, Shenzhen, China
Guarracino, Andrea ; Department of Genetics, Genomics and Informatics, University of Tennessee Health Science Center, Memphis, TN, USA
Yang, Yuting; National Key Laboratory of Tropical Crop Breeding, Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China
Wang, Pei; National Key Laboratory of Tropical Crop Breeding, Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China
Zhang, Zhiyang ; National Key Laboratory of Tropical Crop Breeding, Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China
Tang, Dié; National Key Laboratory of Tropical Crop Breeding, Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China ; Department of Genetics, Yale University School of Medicine, New Haven, CT, USA
Zhang, Pingxian ; National Key Laboratory of Tropical Crop Breeding, Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China
Wu, Yaoyao; National Key Laboratory of Tropical Crop Breeding, Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China ; College of Horticulture, Nanjing Agricultural University, Nanjing, China
Zhou, Yao ; National Key Laboratory of Tropical Crop Breeding, Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China ; Key Laboratory of Plant Molecular Physiology, Institute of Botany, Chinese Academy of Sciences, University of Chinese Academy of Sciences, Beijing, China
Zheng, Yi; National Key Laboratory of Tropical Crop Breeding, Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China
Hu, Yong; National Key Laboratory of Tropical Crop Breeding, Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China
Lian, Qun; National Key Laboratory of Tropical Crop Breeding, Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China
Ma, Zhaoxu ; National Key Laboratory of Tropical Crop Breeding, Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China
Zhang, Chunzhi ; National Key Laboratory of Tropical Crop Breeding, Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China
Lucas, William J; Department of Plant Biology, College of Biological Sciences, University of California, Davis, Davis, CA, USA
Garrison, Erik ; Department of Genetics, Genomics and Informatics, University of Tennessee Health Science Center, Memphis, TN, USA
Stein, Nils ; Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Seeland, Germany ; Crop Plant Genetics, Institute of Agricultural and Nutritional Sciences, Martin-Luther-University of Halle-Wittenberg, Halle (Saale), Germany
Städler, Thomas; Institute of Integrative Biology and Zurich-Basel Plant Science Center, ETH Zurich, Zurich, Switzerland
Zhou, Yongfeng ; National Key Laboratory of Tropical Crop Breeding, Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China ; National Key Laboratory of Tropical Crop Breeding, Chinese Academy of Tropical Agricultural Sciences, Haikou, China
Huang, Sanwen ; National Key Laboratory of Tropical Crop Breeding, Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China. huangsanwen@caas.cn ; National Key Laboratory of Tropical Crop Breeding, Chinese Academy of Tropical Agricultural Sciences, Haikou, China. huangsanwen@caas.cn
E. Stokstad The new potato Science 363 574 577 2019Sci..363.574S 1:CAS:528:DC%2BC1MXhsVOhurbL 30733400
D.M. Spooner M. Ghislain R. Simon S.H. Jansky T. Gavrilenko Systematics, diversity, genetics, and evolution of wild and cultivated potatoes Bot. Rev. 80 283 383
C. Zhang et al. Genome design of hybrid potato Cell 184 3873 3883 1:CAS:528:DC%2BB3MXhsVWhtbjO 34171306
The Potato Genome Sequencing Consortium. Genome sequence and analysis of the tuber crop potato Nature 475 189 195
Q. Zhou et al. Haplotype-resolved genome analyses of a heterozygous diploid potato Nat. Genet. 52 1018 1023 1:CAS:528:DC%2BB3cXhvFegurvO 32989320 7527274
Q. Lian et al. Acquisition of deleterious mutations during potato polyploidization J. Integr. Plant Biol. 61 7 11 1:CAS:528:DC%2BC1MXhsFynt74%3D 30474354
Z. Bao et al. Genome architecture and tetrasomic inheritance of autotetraploid potato Mol. Plant 15 1211 1226 1:CAS:528:DC%2BB38Xhs1SmtL%2FN 35733345
S.H. Jansky et al. Reinventing potato as a diploid inbred line-based crop Crop Sci. 56 1412 1422 1:CAS:528:DC%2BC2sXns1ens7w%3D
C. Zhang et al. The genetic basis of inbreeding depression in potato Nat. Genet. 51 374 378 1:CAS:528:DC%2BC1MXlvFSgtbk%3D 30643248
Y. Wu et al. Phylogenomic discovery of deleterious mutations facilitates hybrid potato breeding Cell 186 2313 2328.e15 1:CAS:528:DC%2BB3sXptlWnt7c%3D 37146612
N. van Lieshout et al. Solyntus, the new highly contiguous reference genome for potato (Solanum tuberosum) G3 10 3489 3495 32759330 7534448
R. Freire et al. Chromosome-scale reference genome assembly of a diploid potato clone derived from an elite variety G3 11 1:CAS:528:DC%2BB38XhvVOlu73K 34534288 8664475 jkab330
D. Tang et al. Genome evolution and diversity of wild and cultivated potatoes Nature 606 535 541 2022Natur.606.535T 1:CAS:528:DC%2BB38XhsFWhu7bN 35676481 9200641
H. Sun et al. Chromosome-scale and haplotype-resolved genome assembly of a tetraploid potato cultivar Nat. Genet. 54 342 348 1:CAS:528:DC%2BB38XmtVehsbc%3D 35241824 8920897
M. Ye et al. Generation of self-compatible diploid potato by knockout of S-RNase Nat. Plants 4 651 654 1:CAS:528:DC%2BC1cXhsFSqtb%2FM 30104651
S. Chun J.C. Fay Identification of deleterious mutations within three human genomes Genome Res. 19 1553 1561 1:CAS:528:DC%2BD1MXhtFCjsLrJ 19602639 2752137
C.D. Marsden et al. Bottlenecks and selective sweeps during domestication have increased deleterious genetic variation in dogs Proc. Natl Acad. Sci. USA 113 152 157 2016PNAS.113.152M 1:CAS:528:DC%2BC2MXitVylsr%2FJ 26699508
Q. Liu Y. Zhou P.L. Morrell B.S. Gaut Deleterious variants in Asian rice and the potential cost of domestication Mol. Biol. Evol. 34 908 924 1:CAS:528:DC%2BC1cXhvV2ms7nP 28087781
X. Zhang et al. Haplotype-resolved genome assembly provides insights into evolutionary history of the tea plant Camellia sinensis Nat. Genet. 53 1250 1259 1:CAS:528:DC%2BB3MXhsFOntLnJ 34267370 8346365
E.D. Jarvis et al. Semi-automated assembly of high-quality diploid human reference genomes Nature 611 519 531 2022Natur.611.519J 1:CAS:528:DC%2BB38Xis1Onu7zO 36261518 9668749
M. Schreiber M. Jayakodi N. Stein M. Mascher Plant pangenomes for crop improvement, biodiversity and evolution Nat. Rev. Genet. 25 563 577 1:CAS:528:DC%2BB2cXktVyjsLo%3D 38378816 7616794
M.A. Hardigan et al. Genome diversity of tuber-bearing Solanum uncovers complex evolutionary history and targets of domestication in the cultivated potato Proc. Natl Acad. Sci. USA 114 E9999 E10008 1:CAS:528:DC%2BC2sXhslemsrzL 29087343 5699086
F.A. Simão R.M. Waterhouse P. Ioannidis E.V. Kriventseva E.M. Zdobnov BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs Bioinformatics 31 3210 3212 26059717
W.-W. Liao et al. A draft human pangenome reference Nature 617 312 324 2023Natur.617.312L 1:CAS:528:DC%2BB3sXpvVansLk%3D 37165242 10172123
G.M. Pham et al. Construction of a chromosome-scale long-read reference genome assembly for potato GigaScience 9 32964225 7509475 giaa100
E. Garrison et al. Building pangenome graphs Nat. Methods 21 2008 2012 1:CAS:528:DC%2BB2cXitlant77L 39433878
G. Hickey et al. Pangenome graph construction from genome alignments with Minigraph-Cactus Nat. Biotechnol. 42 663 673 1:CAS:528:DC%2BB3sXpvVaisbk%3D 37165083
E. Garrison A. Guarracino Unbiased pangenome graphs Bioinformatics 39 btac743 1:CAS:528:DC%2BB3sXhsFKis77F 36448683
Z. Gong et al. Repeatless and repeat-based centromeres in potato: implications for centromere evolution Plant Cell 24 3559 3574 1:CAS:528:DC%2BC38Xhs1ygs7nM 22968715 3480287
I. Bozan et al. Pangenome analyses reveal impact of transposable elements and ploidy on the evolution of potato species Proc. Natl Acad. Sci. USA 120 2023PNAS.12017119B 1:CAS:528:DC%2BB3sXhvVCksLrP 37487084 10401005 e2211117120
M. Domínguez et al. The impact of transposable elements on tomato diversity Nat. Commun. 11 2020NatCo.11.4058D 32792480 7426864 4058
T. Wicker et al. A detailed look at 7 million years of genome evolution in a 439 kb contiguous sequence at the barley Hv‐eIF4E locus: recombination, rearrangements and repeats Plant J. 41 184 194 1:CAS:528:DC%2BD2MXhtVWitrY%3D 15634196
P. Balachandran et al. Transposable element-mediated rearrangements are prevalent in human genomes Nat. Commun. 13 2022NatCo.13.7115B 1:CAS:528:DC%2BB38XivFOms77L 36402840 9675761 7115
Z. Fang et al. Megabase-scale inversion polymorphism in the wild ancestor of maize Genetics 191 883 894 22542971 3389981
E.L. Berdan A. Blanckaert R.K. Butlin C. Bank Deleterious mutation accumulation and the long-term fate of chromosomal inversions PLoS Genet. 17 e1009411 1:CAS:528:DC%2BB3MXmvFanu7s%3D 33661924 7963061
Y. Zhou et al. The population genetics of structural variants in grapevine domestication Nat. Plants 5 965 979 31506640
E. Roumeliotis B. Kloosterman M. Oortwijn R.G. Visser C.W. Bachem The PIN family of proteins in potato and their putative role in tuberization Front. Plant Sci. 4 524 24391658 3867687
S.K. Cho et al. Polypyrimidine tract-binding proteins of potato mediate tuberization through an interaction with StBEL5 RNA J. Exp. Bot. 66 6835 6847 1:CAS:528:DC%2BC28XpvFGlurs%3D 26283046 4623692
B. Wang et al. De novo genome assembly and analyses of 12 founder inbred lines provide insights into maize heterosis Nat. Genet. 55 312 323 1:CAS:528:DC%2BB3sXhslKhtrY%3D 36646891
H. Xiao et al. Adaptive and maladaptive introgression in grapevine domestication Proc. Natl Acad. Sci. USA 120 e2222041120 1:CAS:528:DC%2BB3sXhsVWhtrvF 37276420 10268302
P.C. Bethke et al. Diploid potatoes as a catalyst for change in the potato industry Am. J. Potato Res. 99 337 357
S. Mezmouk J. Ross-Ibarra The pattern and distribution of deleterious mutations in maize G3 4 163 171 24281428
Y. Li et al. Resequencing of 200 human exomes identifies an excess of low-frequency non-synonymous coding variants Nat. Genet. 42 969 972 1:CAS:528:DC%2BC3cXht1elu7rP 20890277
N. Wang et al. Structural variation and parallel evolution of apomixis in citrus during domestication and diversification Natl Sci. Rev. 9 nwac114 1:CAS:528:DC%2BB3sXnsFCjs70%3D 36415319 9671666
B.E. Harcourt Reflecting on the subject: a critique of the social influence conception of deterrence, the broken windows theory, and order-maintenance policing New York style Mich. Law Rev. 97 291
A.P. Morgan et al. Structural variation shapes the landscape of recombination in mouse Genetics 206 603 619 1:CAS:528:DC%2BC1cXhvVOltr3M 28592499 5499175
D. Porubsky et al. Recurrent inversion polymorphisms in humans associate with genetic instability and genomic disorders Cell 185 1986 2005 1:CAS:528:DC%2BB38Xht1ant7rM 35525246 9563103
B.A. Rowan et al. An ultra high-density Arabidopsis thaliana crossover map that refines the influences of structural variation and epigenetic features Genetics 213 771 787 1:CAS:528:DC%2BB3cXovVWru7s%3D 31527048 6827372
P. Ramu et al. Cassava haplotype map highlights fixation of deleterious mutations during clonal propagation Nat. Genet. 49 959 963 1:CAS:528:DC%2BC2sXmtFGhtL4%3D 28416819
Y. Jiao et al. Regulation of OsSPL14 by OsmiR156 defines ideal plant architecture in rice Nat. Genet. 42 541 544 1:CAS:528:DC%2BC3cXmsVWntbY%3D 20495565
X. Jiang et al. Genomic features of meiotic crossovers in diploid potato Hortic. Res. 10 uhad079 1:CAS:528:DC%2BB2cXhsVyntrzL 37323232 10261879
B.J. Hayes et al. Potential approaches to create ultimate genotypes in crops and livestock Nat. Genet. 56 2310 2317 1:CAS:528:DC%2BB2cXit1altLvM 39402155
S. Filler-Hayut K. Kniazev C. Melamed-Bessudo A.A. Levy Targeted inter-homologs recombination in Arabidopsis euchromatin and heterochromatin Int. J. Mol. Sci. 22 12096 1:CAS:528:DC%2BB3MXis1Cisb%2FJ 34829981 8622013
C. Schmidt P. Schindele H. Puchta From gene editing to genome engineering: restructuring plant chromosomes via CRISPR/Cas Abiotech 1 21 31 36305002
G. Rakocevic et al. Fast and accurate genomic analyses using genome graphs Nat. Genet. 51 354 362 1:CAS:528:DC%2BC1MXlvFSgtLo%3D 30643257
M. Alonge et al. Major impacts of widespread structural variation on gene expression and crop improvement in tomato Cell 182 145 161. e23 1:CAS:528:DC%2BB3cXhtFykt7fN 32553272 7354227
E.M. Leffler et al. Resistance to malaria through structural variation of red blood cell invasion receptors Science 356 eaam6393 28522690 5575826
H. Yan et al. Pangenomic analysis identifies structural variation associated with heat tolerance in pearl millet Nat. Genet. 55 507 518 1:CAS:528:DC%2BB3sXkt1yntLw%3D 36864101 10011142
D.F. Conrad et al. Origins and functional impact of copy number variation in the human genome Nature 464 704 712 1:CAS:528:DC%2BD1MXht1CisLrL 19812545
J.R. Xue et al. The functional and evolutionary impacts of human-specific deletions in conserved elements Science 380 eabn2253 1:CAS:528:DC%2BB3sXovFehu7o%3D 37104592 10202372
H. Li et al. The sequence alignment/map format and SAMtools Bioinformatics 25 2078 2079 19505943 2723002
J.N. Burton et al. Chromosome-scale scaffolding of de novo genome assemblies based on chromatin interactions Nat. Biotechnol. 31 1119 1125 1:CAS:528:DC%2BC3sXhslWjtLvM 24185095 4117202
N. Kaplan J. Dekker High-throughput genome scaffolding from in vivo DNA interaction frequency Nat. Biotechnol. 31 1143 1147 1:CAS:528:DC%2BC3sXhvVaitLbP 24270850 3880131
G. Marçais C. Kingsford A fast, lock-free approach for efficient parallel counting of occurrences of k-mers Bioinformatics 27 764 770 21217122 3051319
T.R. Ranallo-Benavidez K.S. Jaron M.C. Schatz GenomeScope 2.0 and Smudgeplot for reference-free profiling of polyploid genomes Nat. Commun. 11 2020NatCo.11.1432R 1:CAS:528:DC%2BB3cXlt1Wisb0%3D 32188846 7080791 1432
H. Cheng G.T. Concepcion X. Feng H. Zhang H. Li Haplotype-resolved de novo assembly using phased assembly graphs with hifiasm Nat. Methods 18 170 175 1:CAS:528:DC%2BB3MXis1OntL0%3D 33526886 7961889
O. Dudchenko et al. De novo assembly of the Aedes aegypti genome using Hi-C yields chromosome-length scaffolds Science 356 92 95 2017Sci..356..92D 1:CAS:528:DC%2BC2sXlsVymsbo%3D 28336562 5635820
N.C. Durand et al. Juicebox provides a visualization system for Hi-C contact maps with unlimited zoom Cell Syst. 3 99 101 1:CAS:528:DC%2BC2sXhtFKks7w%3D 27467250 5596920
Martin, M. et al. WhatsHap: fast and accurate read-based phasing. Preprint at BioRxivhttps://doi.org/10.1101/085050 (2016).
D. Mapleson G. Garcia Accinelli G. Kettleborough J. Wright B.J. Clavijo KAT: a K-mer analysis toolkit to quality control NGS datasets and genome assemblies Bioinformatics 33 574 576 1:CAS:528:DC%2BC1cXhvFagtrvL 27797770
S. Ou et al. Benchmarking transposable element annotation methods for creation of a streamlined, comprehensive pipeline Genome Biol. 20 1:CAS:528:DC%2BC1MXisVSntb3O 31843001 6913007 275
J.M. Flynn et al. RepeatModeler2 for automated genomic discovery of transposable element families Proc. Natl Acad. Sci. USA 117 9451 9457 2020PNAS.117.9451F 1:CAS:528:DC%2BB3cXnvFeqt74%3D 32300014 7196820
F. Delehelle S. Cussat-Blanc J.-M. Alliot H. Luga P. Balaresque ASGART: fast and parallel genome scale segmental duplications mapping Bioinformatics 34 2708 2714 1:CAS:528:DC%2BC1MXhtV2gt7fK 30101303
G. Benson Tandem repeats finder: a program to analyze DNA sequences Nucleic Acids Res. 27 573 580 1:CAS:528:DyaK1MXhtVKmtrg%3D 9862982 148217
D. Kim B. Langmead S.L. Salzberg HISAT: a fast spliced aligner with low memory requirements Nat. Methods 12 357 360 1:CAS:528:DC%2BC2MXjvFOnsL0%3D 25751142 4655817
M. Pertea et al. StringTie enables improved reconstruction of a transcriptome from RNA-seq reads Nat. Biotechnol. 33 290 295 1:CAS:528:DC%2BC2MXivFais70%3D 25690850 4643835
T. Brůna K.J. Hoff A. Lomsadze M. Stanke M. Borodovsky BRAKER2: automatic eukaryotic genome annotation with GeneMark-EP+ and AUGUSTUS supported by a protein database NAR Genomics Bioinformatics 3 lqaa108 33575650 7787252
M. Stanke et al. AUGUSTUS: ab initio prediction of alternative transcripts Nucleic Acids Res. 34 W435 W439 1:CAS:528:DC%2BD28Xps1yiu78%3D 16845043 1538822
A.V. Lukashin M. Borodovsky GeneMark. hmm: new solutions for gene finding Nucleic Acids Res. 26 1107 1115 1:CAS:528:DyaK1cXhvVWksr4%3D 9461475 147337
The Tomato Genome Consortium. The tomato genome sequence provides insights into fleshy fruit evolution Nature 485 635 641 2012Natur.485.635T
W. Li A. Godzik Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences Bioinformatics 22 1658 1659 1:CAS:528:DC%2BD28XmsVent7s%3D 16731699
C. Holt M. Yandell MAKER2: an annotation pipeline and genome-database management tool for second-generation genome projects BMC Bioinformatics 12 491 22192575 3280279
L. Venturini S. Caim G.G. Kaithakottil D.L. Mapleson D. Swarbreck Leveraging multiple transcriptome assembly methods for improved gene structure annotation GigaScience 7 30052957 6105091 giy093
B.J. Haas et al. Improving the Arabidopsis genome annotation using maximal transcript alignment assemblies Nucleic Acids Res. 31 5654 5666 1:CAS:528:DC%2BD3sXns1Cntbs%3D 14500829 206470
P. Jones et al. InterProScan 5: genome-scale protein function classification Bioinformatics 30 1236 1240 1:CAS:528:DC%2BC2cXmvFCjsr4%3D 24451626 3998142
S. Marco-Sola J.C. Moure M. Moreto A. Espinosa Fast gap-affine pairwise alignment using the wavefront algorithm Bioinformatics 37 456 463 1:CAS:528:DC%2BB3MXhvFeksrjN 32915952
A. Guarracino S. Heumos S. Nahnsen P. Prins E. Garrison ODGI: understanding pangenome graphs Bioinformatics 38 3319 3326 1:CAS:528:DC%2BB38XisVKmtrrJ 35552372 9237687
Parmigiani, L, Garrison, E, Stoye, J, Marschall, T. & Doerr, D. Panacus: fast and exact pangenome growth and core size estimation. Bioinformatics https://doi.org/10.1093/bioinformatics/btae720 (2024).
Y. Zhou et al. Graph pangenome captures missing heritability and empowers tomato breeding Nature 606 527 534 2022Natur.606.527Z 1:CAS:528:DC%2BB38XhsFWhu7bO 35676474 9200638
B. Buchfink C. Xie D.H. Huson Fast and sensitive protein alignment using DIAMOND Nat. Methods 12 59 60 1:CAS:528:DC%2BC2cXhvFKlsrzN 25402007
D.M. Emms S. Kelly OrthoFinder: phylogenetic orthology inference for comparative genomics Genome Biol. 20 31727128 6857279 238
L.-T. Nguyen H.A. Schmidt A. Von Haeseler B.Q. Minh IQ-TREE: a fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies Mol. Biol. Evol. 32 268 274 1:CAS:528:DC%2BC2MXivFGltrs%3D 25371430
G. Yu D.K. Smith H. Zhu Y. Guan T.T. Lam Y. ggtree: an R package for visualization and annotation of phylogenetic trees with their covariates and other associated data Methods Ecol. Evol. 8 28 36
J.T. Lovell et al. GENESPACE tracks regions of interest and gene copy number variation across multiple genomes eLife 11 e78526 1:CAS:528:DC%2BB3sXisVOltrfO 36083267 9462846
H. Li New strategies to improve minimap2 alignment accuracy Bioinformatics 37 4572 4574 1:CAS:528:DC%2BB38XhsFKjs7o%3D 34623391 8652018
M. Goel H. Sun W.-B. Jiao K. Schneeberger SyRI: finding genomic rearrangements and local sequence differences from whole-genome assemblies Genome Biol. 20 31842948 6913012 277
P. Danecek et al. Twelve years of SAMtools and BCFtools GigaScience 10 33590861 7931819 giab008
S. Purcell et al. PLINK: a tool set for whole-genome association and population-based linkage analyses Am. J. Hum. Genet. 81 559 575 1:CAS:528:DC%2BD2sXhtVSqurrL 17701901 1950838