[en] Soybean (Glycine max) is a leguminous plant with a broad range of applications, particularly in agriculture and food production, where its seed composition-especially oil and protein content-is highly valued. Improving these traits is a primary focus of soybean breeding programs. In this study, we conducted a genome-wide association study (GWAS) to identify genetic loci linked to oil and protein content in seeds, using imputed genotype data for 180 Eurasian soybean varieties and the novel "genotypic twins" approach. This dataset encompassed 87 Russian and European cultivars and 93 breeding lines from Western Siberia. We identified 11 novel loci significantly associated with oil and protein content in seeds (p-value < 1.5 × 10-6), including one locus on chromosome 11 linked to protein content and 10 loci associated with oil content (chromosomes 1, 5, 11, 16, 17, and 18). The protein-associated locus is located near a gene encoding a CBL-interacting protein kinase, which is involved in key biological processes, including stress response mechanisms such as drought and osmotic stress. The oil-associated loci were linked to genes with diverse functions, including lipid transport, nutrient reservoir activity, and stress responses, such as Sec14p-like phosphatidylinositol transfer proteins and Germin-like proteins. These findings suggest that the loci identified not only influence oil and protein content but may also contribute to plant resilience under environmental stress conditions. The data obtained from this study provide valuable genetic markers that can be used in breeding programs to optimize oil and protein content, particularly in varieties adapted to Russian climates, and contribute to the development of high-yielding, nutritionally enhanced soybean cultivars.
Disciplines :
Genetics & genetic processes
Author, co-author :
Potapova, Nadezhda A; Kurchatov Genomics Center, Institute of Cytology and Genetics SB RAS, Lavrentiev Av. 10, 630090 Novosibirsk, Russia
Zorkoltseva, Irina V; Kurchatov Genomics Center, Institute of Cytology and Genetics SB RAS, Lavrentiev Av. 10, 630090 Novosibirsk, Russia ; The Federal Research Center, Institute of Cytology and Genetics SB RAS, Lavrentiev Av. 10, 630090 Novosibirsk, Russia
Zlobin, Alexander ; Université de Liège - ULiège > Département des sciences biomédicales et précliniques ; Kurchatov Genomics Center, Institute of Cytology and Genetics SB RAS, Lavrentiev Av. 10, 630090 Novosibirsk, Russia
Shcherban, Andrey B ; Kurchatov Genomics Center, Institute of Cytology and Genetics SB RAS, Lavrentiev Av. 10, 630090 Novosibirsk, Russia ; The Federal Research Center, Institute of Cytology and Genetics SB RAS, Lavrentiev Av. 10, 630090 Novosibirsk, Russia
Fedyaeva, Anna V; The Federal Research Center, Institute of Cytology and Genetics SB RAS, Lavrentiev Av. 10, 630090 Novosibirsk, Russia
Salina, Elena A ; Kurchatov Genomics Center, Institute of Cytology and Genetics SB RAS, Lavrentiev Av. 10, 630090 Novosibirsk, Russia ; The Federal Research Center, Institute of Cytology and Genetics SB RAS, Lavrentiev Av. 10, 630090 Novosibirsk, Russia
Svishcheva, Gulnara R; Kurchatov Genomics Center, Institute of Cytology and Genetics SB RAS, Lavrentiev Av. 10, 630090 Novosibirsk, Russia ; The Federal Research Center, Institute of Cytology and Genetics SB RAS, Lavrentiev Av. 10, 630090 Novosibirsk, Russia ; Institute of General Genetics RAS, Gubkin St. 3, 119333 Moscow, Russia
Aksenovich, Tatiana I; The Federal Research Center, Institute of Cytology and Genetics SB RAS, Lavrentiev Av. 10, 630090 Novosibirsk, Russia
Tsepilov, Yakov A ; Kurchatov Genomics Center, Institute of Cytology and Genetics SB RAS, Lavrentiev Av. 10, 630090 Novosibirsk, Russia ; The Federal Research Center, Institute of Cytology and Genetics SB RAS, Lavrentiev Av. 10, 630090 Novosibirsk, Russia
Language :
English
Title :
Genome-Wide Association Study on Imputed Genotypes of 180 Eurasian Soybean Glycine max Varieties for Oil and Protein Contents in Seeds.
Rotundo J.L. Marshall R. McCormick R. Truong S.K. Styles D. Gerde J.A. Gonzalez-Escobar E. Carmo-Silva E. Janes-Bassett V. Logue J. et al. European soybean to benefit people and the environment Sci. Rep. 2024 14 7612 10.1038/s41598-024-57522-z 38556523
Zhang J. Mungara P. Jane J.-l. Mechanical and thermal properties of extruded soy protein sheets Polymer 2001 42 2569 2578 10.1016/S0032-3861(00)00624-8
Avinc O. Yavas A. Soybean: For textile applications and its printing Soybean—The Basis of Yield, Biomass and Productivity IntechOpen London, UK 2017
Milanović T. Popović V. Vučković S. Rakaščan N. Popović S. Petković Z. Analysis of soybean production and biogas yield to improve eco-marketing and circular economy Eкoнoмuкa Пoљonpuвpeдe 2020 67 141 156 10.5937/ekoPolj2001141M
Shaibu A.S. Ibrahim H. Miko Z.L. Mohammed I.B. Mohammed S.G. Yusuf H.L. Kamara A.Y. Omoigui L.O. Karikari B. Assessment of the genetic structure and diversity of soybean (Glycine max L.) germplasm using diversity array technology and single nucleotide polymorphism markers Plants 2021 11 68 10.3390/plants11010068
Andrijanić Z. Nazzicari N. Šarčević H. Sudarić A. Annicchiarico P. Pejić I. Genetic diversity and population structure of European soybean germplasm revealed by single nucleotide polymorphism Plants 2023 12 1837 10.3390/plants12091837
Potapova N.A. Zlobin A.S. Perfil’ev R.N. Vasiliev G.V. Salina E.A. Tsepilov Y.A. Population Structure and Genetic Diversity of the 175 Soybean Breeding Lines and Varieties Cultivated in West Siberia and Other Regions of Russia Plants 2023 12 3490 10.3390/plants12193490
Dong L. Fang C. Cheng Q. Su T. Kou K. Kong L. Zhang C. Li H. Hou Z. Zhang Y. Genetic basis and adaptation trajectory of soybean from its temperate origin to tropics Nat. Commun. 2021 12 5445 10.1038/s41467-021-25800-3
Lu S. Fang C. Abe J. Kong F. Liu B. Current overview on the genetic basis of key genes involved in soybean domestication Abiotech 2022 3 126 139 10.1007/s42994-022-00074-5
Liu L. Wang J. Zhang Q. Sun T. Wang P. Cloning of the Soybean GmNHL1 Gene and Functional Analysis under Salt Stress Plants 2023 12 3869 10.3390/plants12223869
Fang C. Ma Y. Wu S. Liu Z. Wang Z. Yang R. Hu G. Zhou Z. Yu H. Zhang M. et al. Genome-wide association studies dissect the genetic networks underlying agronomical traits in soybean Genome Biol. 2017 18 161 10.1186/s13059-017-1289-9
Shook J.M. Zhang J. Jones S.E. Singh A. Diers B.W. Singh A.K. Meta-GWAS for quantitative trait loci identification in soybean G3 Genes|Genomes|Genet. 2021 11 jkab117 10.1093/g3journal/jkab117
Priyanatha C. Torkamaneh D. Rajcan I. Genome-wide association study of soybean germplasm derived from Canadian× Chinese crosses to mine for novel alleles to improve seed yield and seed quality traits Front. Plant Sci. 2022 13 866300 10.3389/fpls.2022.866300 35419011
Kim S.H. Tayade R. Kang B.H. Hahn B.S. Ha B.K. Kim Y.H. Genome-Wide Association Studies of Seven Root Traits in Soybean (Glycine max L.) Landraces Int. J. Mol. Sci. 2023 24 873 10.3390/ijms24010873 36614316
Miller M.J. Song Q. Fallen B. Li Z. Genomic prediction of optimal cross combinations to accelerate genetic improvement of soybean (Glycine max) Front. Plant Sci. 2023 14 1171135 10.3389/fpls.2023.1171135 37235007
Miller M.J. Song Q. Li Z. Genomic selection of soybean (Glycine max) for genetic improvement of yield and seed composition in a breeding context Plant Genome 2023 16 e20384 10.1002/tpg2.20384
Riaz A. Raza Q. Kumar A. Dean D. Chiwina K. Phiri T.M. Thomas J. Shi A. GWAS and genomic selection for marker-assisted development of sucrose enriched soybean cultivars Euphytica 2023 219 97 10.1007/s10681-023-03224-y
Zhang C. Shao Z. Kong Y. Du H. Li W. Yang Z. Li X. Ke H. Sun Z. Shao J. et al. High-quality genome of a modern soybean cultivar and resequencing of 547 accessions provide insights into the role of structural variation Nat. Genet. 2024 56 2247 2258 10.1038/s41588-024-01901-9
Torkamaneh D. Chalifour F.-P. Beauchamp C.J. Agrama H. Boahen S. Maaroufi H. Rajcan I. Belzile F. Genome-wide association analyses reveal the genetic basis of biomass accumulation under symbiotic nitrogen fixation in African soybean Theor. Appl. Genet. 2020 133 665 676 10.1007/s00122-019-03499-7
Rani R. Raza G. Ashfaq H. Rizwan M. Razzaq M.K. Waheed M.Q. Shimelis H. Babar A.D. Arif M. Genome-wide association study of soybean (Glycine max [L.] Merr.) germplasm for dissecting the quantitative trait nucleotides and candidate genes underlying yield-related traits Front. Plant Sci. 2023 14 1229495 10.3389/fpls.2023.1229495
Lee S. Van K. Sung M. Nelson R. LaMantia J. McHale L.K. Mian M.A.R. Genome-wide association study of seed protein, oil and amino acid contents in soybean from maturity groups I to IV TAG. Theor. Appl. Genet. Theor. Angew. Genet. 2019 132 1639 1659 10.1007/s00122-019-03304-5
Kim W.J. Kang B.H. Kang S. Shin S. Chowdhury S. Jeong S.C. Choi M.S. Park S.K. Moon J.K. Ryu J. et al. A Genome-Wide Association Study of Protein, Oil, and Amino Acid Content in Wild Soybean (Glycine soja) Plants 2023 12 1665 10.3390/plants12081665 37111888
Jin H. Yang X. Zhao H. Song X. Tsvetkov Y.D. Wu Y. Gao Q. Zhang R. Zhang J. Genetic analysis of protein content and oil content in soybean by genome-wide association study Front. Plant Sci. 2023 14 1182771 10.3389/fpls.2023.1182771 37346139
Zhang H. Goettel W. Song Q. Jiang H. Hu Z. Wang M.L. An Y.C. Selection of GmSWEET39 for oil and protein improvement in soybean PLoS Genet. 2020 16 e1009114 10.1371/journal.pgen.1009114
Chu J.S. Peng B. Tang K. Yi X. Zhou H. Wang H. Li G. Leng J. Chen N. Feng X. Eight soybean reference genome resources from varying latitudes and agronomic traits Sci. Data 2021 8 164 10.1038/s41597-021-00947-2
Yi X. Liu J. Chen S. Wu H. Liu M. Xu Q. Lei L. Lee S. Zhang B. Kudrna D. et al. Genome assembly of the JD17 soybean provides a new reference genome for comparative genomics G3 Genes|Genomes|Genet. 2022 12 jkac017 10.1093/g3journal/jkac017
Chen L. Yang S. Araya S. Quigley C. Taliercio E. Mian R. Specht J.E. Diers B.W. Song Q. Genotype imputation for soybean nested association mapping population to improve precision of QTL detection TAG. Theor. Appl. Genet. Theor. Angew. Genet. 2022 135 1797 1810 10.1007/s00122-022-04070-7
Perfil’ev R. Shcherban A. Potapov D. Maksimenko K. Kiryukhin S. Gurinovich S. Panarina V. Polyudina R. Salina E. Impact of Allelic Variation in Maturity Genes E1–E4 on Soybean Adaptation to Central and West Siberian Regions of Russia Agriculture 2023 13 1251 10.3390/agriculture13061251
Rogers S.O. Bendich A.J. Extraction of DNA from milligram amounts of fresh, herbarium and mummified plant tissues Plant Mol. Biol. 1985 5 69 76 10.1007/BF00020088
Song Q. Hyten D.L. Jia G. Quigley C.V. Fickus E.W. Nelson R.L. Cregan P.B. Development and evaluation of SoySNP50K, a high-density genotyping array for soybean PLoS ONE 2013 8 e54985 10.1371/journal.pone.0054985
Grant D. Nelson R.T. Cannon S.B. Shoemaker R.C. SoyBase, the USDA-ARS soybean genetics and genomics database Nucleic Acids Res. 2010 38 D843 D846 10.1093/nar/gkp798
Perfil’ev R. Genotypes of 180 Soybean Accessions [Data set] Zenodo Geneva, Switzerland 2024 Available online: https://doi.org/10.5281/zenodo.13879245 (accessed on 1 November 2024)
Dekking F.M. Kraaikamp C. Lopuhaä H.P. Meester L.E. A Modern Introduction to Probability and Statistics: Understanding Why and How Springer Science & Business Media Berlin/Heidelberg, Germany 2006
Song Q. Hyen D.L. Jia G. Quigley C.V. Fickus E.W. Nelson R.L. Cregan P.B. Fingerprinting Soybean Germplasm and Its Utility in Genomic Research G3 Genes|Genomes|Genet. 2015 5 1999 2006 Erratum in G3 Genes|Genomes|Genet. 2016, 6, 495 10.1534/g3.115.019000 26224783
Kim M.S. Lozano R. Kim J.H. Bae D.N. Kim S.T. Park J.H. Choi M.S. Kim J. Ok H.C. Park S.K. et al. The patterns of deleterious mutations during the domestication of soybean Nat. Commun. 2021 12 97 10.1038/s41467-020-20337-3 33397978
Browning B.L. Zhou Y. Browning S.R. A one-penny imputed genome from next-generation reference panels Am. J. Hum. Genet. 2018 103 338 348 10.1016/j.ajhg.2018.07.015 30100085
Browning B.L. Tian X. Zhou Y. Browning S.R. Fast two-stage phasing of large-scale sequence data Am. J. Hum. Genet. 2021 108 1880 1890 10.1016/j.ajhg.2021.08.005
Zhou Z. Jiang Y. Wang Z. Gou Z. Lyu J. Li W. Yu Y. Shu L. Zhao Y. Ma Y. Resequencing 302 wild and cultivated accessions identifies genes related to domestication and improvement in soybean Nat. Biotechnol. 2015 33 408 414 10.1038/nbt.3096
Zhou X. Stephens M. Genome-wide efficient mixed-model analysis for association studies Nat. Genet. 2012 44 821 824 10.1038/ng.2310
Potapova N.A. Timoshchuk A.N. Tiys E.S. Vinichenko N.A. Leonova I.N. Salina E.A. Tsepilov Y.A. Multivariate Genome-Wide Association Study of Concentrations of Seven Elements in Seeds Reveals Four New Loci in Russian Wheat Lines Plants 2023 12 3019 10.3390/plants12173019
VanRaden P.M. Efficient methods to compute genomic predictions J. Dairy Sci. 2008 91 4414 4423 10.3168/jds.2007-0980
Zhao X. Zhang Y. Wang J. Zhao X. Li Y. Teng W. Han Y. Zhan Y. GWAS and WGCNA Analysis Uncover Candidate Genes Associated with Oil Content in Soybean Plants 2024 13 1351 10.3390/plants13101351
Sonah H. O’Donoughue L. Cober E. Rajcan I. Belzile F. Identification of loci governing eight agronomic traits using a GBS-GWAS approach and validation by QTL mapping in soya bean Plant Biotechnol. J. 2015 13 211 221 10.1111/pbi.12249
Zhang Q. Sun T. Wang J. Fei J. Liu Y. Liu L. Wang P. Genome-wide association study and high-quality gene mining related to soybean protein and fat BMC Genom. 2023 24 596 10.1186/s12864-023-09687-6
Duan Z. Zhang M. Zhang Z. Liang S. Fan L. Yang X. Yuan Y. Pan Y. Zhou G. Liu S. et al. Natural allelic variation of GmST05 controlling seed size and quality in soybean Plant Biotechnol. J. 2022 20 1807 1818 10.1111/pbi.13865 35642379
Fliege C.E. Ward R.A. Vogel P. Nguyen H. Quach T. Guo M. Viana J.P.G. Dos Santos L.B. Specht J.E. Clemente T.E. et al. Fine mapping and cloning of the major seed protein quantitative trait loci on soybean chromosome 20 Plant J. 2022 110 114 128 10.1111/tpj.15658 34978122
Purcell S. Neale B. Todd-Brown K. Thomas L. Ferreira M.A. Bender D. Maller J. Sklar P. De Bakker P.I. Daly M.J. PLINK: A tool set for whole-genome association and population-based linkage analyses Am. J. Hum. Genet. 2007 81 559 575 10.1086/519795 17701901
Novikova L.Y. Seferova I. Nekrasov A.Y. Perchuk I. Shelenga T. Samsonova M. Vishnyakova M. Impact of weather and climate on seed protein and oil content of soybean in the North Caucasus Vavilov J. Genet. Breed. 2018 22 708 715 10.18699/VJ18.414
Song W. Yang R. Wu T. Wu C. Sun S. Zhang S. Jiang B. Tian S. Liu X. Han T. Analyzing the Effects of Climate Factors on Soybean Protein, Oil Contents, and Composition by Extensive and High-Density Sampling in China J. Agric. Food Chem. 2016 64 4121 4130 10.1021/acs.jafc.6b00008
Petibskaya V. Soy: Chemical Composition and Use All-Russian Research Institute of Oilseeds Named After VS Pustovoita Krasnodar, Russia 2012 Volume 432
Xu M. Li H. Liu Z.N. Wang X.H. Xu P. Dai S.J. Cao X. Cui X.Y. The soybean CBL-interacting protein kinase, GmCIPK2, positively regulates drought tolerance and ABA signaling Plant Physiol. Biochem. 2021 167 980 989 10.1016/j.plaphy.2021.09.026
Li H. Wang X.H. Li Q. Xu P. Liu Z.N. Xu M. Cui X.Y. GmCIPK21, a CBL-interacting protein kinase confers salt tolerance in soybean (Glycine max. L) Plant Physiol. Biochem. 2022 184 47 55 10.1016/j.plaphy.2022.05.027
Ketehouli T. Zhou Y.G. Dai S.Y. Carther K.F.I. Sun D.Q. Li Y. Nguyen Q.V.H. Xu H. Wang F.W. Liu W.C. et al. A soybean calcineurin B-like protein-interacting protein kinase, GmPKS4, regulates plant responses to salt and alkali stresses J. Plant Physiol. 2021 256 153331 10.1016/j.jplph.2020.153331
Montag K. Ivanov R. Bauer P. Role of SEC14-like phosphatidylinositol transfer proteins in membrane identity and dynamics Front. Plant Sci. 2023 14 1181031 10.3389/fpls.2023.1181031
Xu H. Li Y. Yan Y. Wang K. Gao Y. Hu Y. Genome-scale identification of soybean BURP domain-containing genes and their expression under stress treatments BMC Plant Biol. 2010 10 197 10.1186/1471-2229-10-197 20836857
Lu M. Han Y.P. Gao J.G. Wang X.J. Li W.B. Identification and analysis of the germin-like gene family in soybean BMC Genom. 2010 11 620 10.1186/1471-2164-11-620 21059215
Zhao J.Y. Lu Z.W. Sun Y. Fang Z.W. Chen J. Zhou Y.B. Chen M. Ma Y.Z. Xu Z.S. Min D.H. The Ankyrin-Repeat Gene GmANK114 Confers Drought and Salt Tolerance in Arabidopsis and Soybean Front. Plant Sci. 2020 11 584167 10.3389/fpls.2020.584167 33193533