[en] With the rapid development of animal phenomics and deep phenotyping, we can obtain thousands of traditional (but also molecular) phenotypes per individual. However, there is still a lack of exploration regarding how to handle this huge amount of data in the context of animal breeding, presenting a challenge that we are likely to encounter more and more in the future. This study aimed to (1) explore the use of the mega-scale linear mixed model (MegaLMM), a factor model-based approach that is able to simultaneously estimate (co)variance components and genetic parameters in the context of thousands of milk traits, hereafter called thousand-trait (TT) models; (2) compare the phenotype values and genomic breeding value (u) predictions for focal traits (i.e., traits that are targeted for prediction, compared with secondary traits that are helping to evaluate), from single-trait (ST) and TT models, respectively; (3) propose a new approximate method of GEBV (U) prediction with TT models and MegaLMM. We used a total of 3,421 milk mid-infrared (MIR) spectra wavepoints (called secondary traits) and 3 focal traits (average fat percentage [AFP], average methane production [ACH4], and average SCS [ASCS]) collected on 3,302 first-parity Holstein cows. The 3,421 milk MIR wavepoint traits were composed of 311 wavepoints in 11 classes (months in lactation). Genotyping information of 564,439 SNPs was available for all animals and was used to calculate the genomic relationship matrix. The MegaLMM was implemented in the framework of the Bayesian sparse factor model and solved through Gibbs sampling (Markov chain Monte Carlo). The heritabilities of the studied 3,421 milk MIR wavepoints gradually increased and then decreased in units of 311 wavepoints throughout the lactation. The genetic and phenotypic correlations between the first 311 wavepoints and the other 3,110 wavepoints were low. The accuracies of phenotype predictions from the ST model were lower than those from the TT model for AFP (0.51 vs. 0.93), ACH4 (0.30 vs. 0.86), and ASCS (0.14 vs. 0.33). The same trend was observed for the accuracies of u predictions for AFP (0.59 vs. 0.86), ACH4 (0.47 vs. 0.78), and ASCS (0.39 vs. 0.59). The average correlation between U predicted from the TT model and the new approximate method was 0.90. The new approximate method used for estimating U in MegaLMM will enhance the suitability of MegaLMM for applications in animal breeding. This study conducted an initial investigation into the application of thousands of traits in animal breeding and showed that the TT model is beneficial for the prediction of focal traits (phenotype and breeding values), especially for difficult-to-measure traits (e.g., ACH4).
Precision for document type :
Review article
Disciplines :
Zoology
Author, co-author :
Chen, Yansen ; Université de Liège - ULiège > Département GxABT > Animal Sciences (AS)
Atashi, Hadi ; Université de Liège - ULiège > Département GxABT > Animal Sciences (AS)
Qu, Jiayi; Department of Animal Science, University of California Davis, Davis, CA 95616
Delhez, Pauline ; Université de Liège - ULiège > Département GxABT > Animal Sciences (AS)
Runcie, Daniel; Department of Plant Sciences, University of California Davis, Davis, CA 95616
Soyeurt, Hélène ; Université de Liège - ULiège > Département GxABT > Modélisation et développement
Gengler, Nicolas ; Université de Liège - ULiège > TERRA Research Centre > Animal Sciences (AS)
Language :
English
Title :
Exploring a Bayesian sparse factor model-based strategy for the genetic analysis of thousands of mid-infrared spectra traits for animal breeding.
The China Scholarship Council (Beijing) is acknowledged for funding the PhD project of Yansen Chen (no. 201906760007). Yansen Chen acknowledges the support of the Fonds de la Recherche Scientifique (FNRS, Brussels, Belgium) under grant no. T.0095.19 (PDR \u201CDEEPSELECT\u201D). The authors also acknowledge the support of the Walloon Government (Service Public de Wallonie\u2013Direction G\u00E9n\u00E9rale Op\u00E9rationnelle Agriculture, Ressources Naturelles et Environnement, SPW-DGARNE; Namur, Belgium) and the use of the computation resources of the Consortium des \u00C9quipements de Calcul Intensif (C\u00C9CI) funded by the FNRS under grant no. 2.5020.11. The University of Li\u00E8ge\u2013Gembloux Agro-Bio Tech (Li\u00E8ge, Belgium) supported computations through the technical platform Calcul et Mod\u00E9lisation Informatique (CAMI) of the TERRA Teaching and Research Centre supported by the FNRS under grant no. T.0095.19 (PDR \u201CDEEPSELECT\u201D). Genotyping was facilitated through the support of the FNRS under grant no. J.0174.18 (CDR \u201CPREDICT-2\u201D). D. Runcie was supported by Agriculture and Food Research Initiative grant nos. 2020-67013-30904 and 2018-67015-27957 from the USDA National Institute of Food and Agriculture (NIFA) and by USDA NIFA Hatch project 1010469. Supplemental material for this article is available at https://github.com/Yansen0515/MegaLMM_for_Animal. No human or animal subjects were used, so this analysis did not require approval by an Institutional Animal Care and Use Committee or Institutional Review Board. The authors have not stated any conflicts of interest. Nonstandard abbreviations used: AI-REML = average information REML; ACH4 = average methane production; AFP = average fat percentage; ASCS = average SCS; FP = fat percentage; HTP = high-throughput phenotyping; MCMC = Markov chain Monte Carlo; MegaLMM = mega-scale linear mixed model; MIR = mid-infrared; MT = multitrait; NA = not applicable; PC = principal components; ST = single-trait; TT = thousand-trait; \u00DB = GEBV of multiple traits from new approximated methods; U = GEBV of multiple traits; u = GEBV of a single trait.The China Scholarship Council (Beijing) is acknowledged for funding the PhD project of Yansen Chen (no. 201906760007). Yansen Chen thanks the support by the Fonds de la Recherche Scientifique ( FNRS , Brussels, Belgium) under grant no. T.0095.19 (PDR \u201CDEEPSELECT\u201D). The authors also acknowledged the support of the Walloon Government (Service Public de Wallonie \u2013 Direction G\u00E9n\u00E9rale Op\u00E9rationnelle Agriculture, Ressources Naturelles et Environnement, SPW-DGARNE; Namur, Belgium) and the use of the computation resources of the Consortium des \u00C9quipements de Calcul Intensif (C\u00C9CI) funded by the FNRS under Grant no. 2.5020.11. The University of Li\u00E8ge\u2013Gembloux Agro-Bio Tech (Li\u00E8ge, Belgium) supported computations through the technical platform Calcul et Mod\u00E9lisation Informatique (CAMI) of the TERRA Teaching and Research Centre supported by the FNRS under grant no. T.0095.19 (PDR \u201CDEEPSELECT\u201D). Genotyping was facilitated through the support of the FNRS under grant no. J.0174.18 (CDR \u201CPREDICT-2\u201D). D. Runcie was supported by Agriculture and Food Research Initiative grants no. 2020-67013-30904 and 2018-67015-27957 from the USDA National Institute of Food and Agriculture and by the United States Department of Agriculture (USDA) National Institute of Food and Agriculture (NIFA), Hatch project 1010469. The authors declare that they have no competing interests.
Bonfatti, V., Vicario, D., Degano, L., Lugo, A., Carnier, P., Comparison between direct and indirect methods for exploiting Fourier transform spectral information in estimation of breeding values for fine composition and technological properties of milk. J. Dairy Sci. 100 (2017), 2057–2067 https://doi.org/10.3168/jds.2016-11951 28109603.
Brito, L.F., Bedere, N., Douhard, F., Oliveira, H.R., Arnal, M., Peñagaricano, F., Schinckel, A.P., Baes, C.F., Miglior, F., Review: Genetic selection of high-yielding dairy cattle toward sustainable farming systems in a rapidly changing world. Animal, 15, 2021, 100292 https://doi.org/10.1016/j.animal.2021.100292 34294547.
Chen, Y., Atashi, H., Grelet, C., Mota, R.R., Vanderick, S., Hu, H., GplusE Consortium. Gengler, N., Genome-wide association study and functional annotation analyses for nitrogen efficiency index and its composition traits in dairy cattle. J. Dairy Sci. 106 (2023), 3397–3410 https://doi.org/10.3168/jds.2022-22351 36894424.
Chen, Y., Delhez, P., Atashi, H., Soyeurt, H., Gengler, N., Genetic analyses of principal components of milk mid-infrared spectra from Holstein cows. The 74th Annual Meeting of the European Federation of Animal Science. Lyon, France. https://orbi.uliege.be/bitstream/2268/307832/1/EAAP_Yansen_Chen_Genetic_PCs_.pdf, 2023. (Accessed 14 August 2024)
Cole, J.B., Dürr, J.W., Nicolazzi, E.L., Invited review: The future of selection decisions and breeding programs—What are we breeding for, and who decides?. J. Dairy Sci. 104 (2021), 5111–5124 https://doi.org/10.3168/jds.2020-19777 33714581.
Cole, J.B., Eaglen, S.A.E., Maltecca, C., Mulder, H.A., Pryce, J.E., The future of phenomics in dairy cattle breeding. Anim. Front. 10 (2020), 37–44 https://doi.org/10.1093/af/vfaa007 32257602.
Delhez, P., Colinet, F., Vanderick, S., Bertozzi, C., Gengler, N., Soyeurt, H., Predicting milk mid-infrared spectra from first-parity Holstein cows using a test-day mixed model with the perspective of herd management. J. Dairy Sci. 103 (2020), 6258–6270 https://doi.org/10.3168/jds.2019-17717 32418684.
Du, C., Nan, L., Yan, L., Bu, Q., Ren, X., Zhang, Z., Sabek, A., Zhang, S., Genetic analysis of milk production traits and mid-infrared spectra in Chinese Holstein population. Animals (Basel), 10, 2020, 139 https://doi.org/10.3390/ani10010139 31952258.
Ducrocq, V., Chapuis, H., Generalizing the use of the canonical transformation for the solution of multivariate mixed model equations. Genet. Sel. Evol. 29 (1997), 205–224 https://doi.org/10.1186/1297-9686-29-2-205.
Gengler, N., Soyeurt, H., Dehareng, F., Bastin, C., Colinet, F., Hammami, H., Vanrobays, M.L., Lainé, A., Vanderick, S., Grelet, C., Vanlierde, A., Froidmont, E., Dardenne, P., Capitalizing on fine milk composition for breeding and management of dairy cows. J. Dairy Sci. 99 (2016), 4071–4079 https://doi.org/10.3168/jds.2015-10140 26778306.
Grelet, C., Dardenne, P., Soyeurt, H., Fernandez, J.A., Vanlierde, A., Stevens, F., Gengler, N., Dehareng, F., Large-scale phenotyping in dairy sector using milk MIR spectra: Key factors affecting the quality of predictions. Methods 186 (2021), 97–111 https://doi.org/10.1016/j.ymeth.2020.07.012 32763376.
Grewal, M.K., Huppertz, T., Vasiljevic, T., FTIR fingerprinting of structural changes of milk proteins induced by heat treatment, deamidation and dephosphorylation. Food Hydrocoll. 80 (2018), 160–167 https://doi.org/10.1016/j.foodhyd.2018.02.010.
Jensen, J., Mao, I.L., Transformation algorithms in analysis of single trait and of multitrait models with equal design matrices and one random factor per trait: A review. J. Anim. Sci. 66 (1988), 2750–2761 https://doi.org/10.2527/jas1988.66112750x.
Kandel, P.B., Vanrobays, M.L., Vanlierde, A., Dehareng, F., Froidmont, E., Gengler, N., Soyeurt, H., Genetic parameters of mid-infrared methane predictions and their relationships with milk production traits in Holstein cattle. J. Dairy Sci. 100 (2017), 5578–5591 https://doi.org/10.3168/jds.2016-11954 28527796.
Legarra, A., Reverter, A., Semi-parametric estimates of population accuracy and bias of predictions of breeding values and future phenotypes using the LR method. Genet. Sel. Evol., 50, 2018, 53 https://doi.org/10.1186/s12711-018-0426-6 30400768.
Meyer, K., Houle, D., Sampling based approximation of confidence intervals for functions of genetic covariance matrices. Proc. Association for the Advancement of Animal Breeding and Genetics, 2013, Association for the Advancement of Animal Breeding and Genetics, 523–526.
Misztal, I., Tsuruta, S., Lourenco, D., Masuda, Y., Aguilar, I., Legarra, A., Vitezica, Z., Manual for BLUPF90 family of programs. http://nce.ads.uga.edu/wiki/lib/exe/fetch.php?media=blupf90_all8.pdf, 2014. (Accessed 26 June 2023)
Paiva, J.T., Mota, R.R., Lopes, P.S., Hammami, H., Vanderick, S., Oliveira, H.R., Veroneze, R., Fonseca e Silva, F., Gengler, N., Random regression test-day models to describe milk production and fatty acid traits in first lactation Walloon Holstein cows. J. Anim. Breed. Genet. 139 (2022), 398–413 https://doi.org/10.1111/jbg.12673 35201644.
Qu, J., Runcie, D., Cheng, H., Mega-scale Bayesian regression methods for genome-wide prediction and association studies with thousands of traits. Genetics, 223, 2023, iyac183 https://doi.org/10.1093/genetics/iyac183 36529897.
Rovere, G., de los Campos, G., Tempelman, R.J., Vazquez, A.I., Miglior, F., Schenkel, F., Cecchinato, A., Bittante, G., Toledo-Alvarado, H., Fleming, A., A landscape of the heritability of Fourier-transform infrared spectral wavelengths of milk samples by parity and lactation stage in Holstein cows. J. Dairy Sci. 102 (2019), 1354–1363 https://doi.org/10.3168/jds.2018-15109 30580946.
Runcie, D.E., Qu, J., Cheng, H., Crawford, L., MegaLMM: Mega-scale linear mixed models for genomic predictions with thousands of traits. Genome Biol., 22, 2021, 213 https://doi.org/10.1186/s13059-021-02416-w 34301310.
Sargolzaei, M., Chesnais, J.P., Schenkel, F.S., A new approach for efficient genotype imputation using information from relatives. BMC Genomics, 15, 2014, 478 10.1186/1471-2164-15-478.
Shadpour, S., Chud, T.C., Hailemariam, D., Plastow, G., Oliveira, H.R., Stothard, P., Lassen, J., Miglior, F., Baes, C.F., Tulpan, D., Schenkel, F.S., Predicting methane emission in Canadian Holstein dairy cattle using milk mid-infrared reflectance spectroscopy and other commonly available predictors via artificial neural networks. J. Dairy Sci. 105 (2022), 8272–8285 https://doi.org/10.3168/jds.2021-21176 36055858.
Silva, F.F., Morota, G., Rosa, G.J.M., Editorial: High-throughput phenotyping in the genomic improvement of livestock. Front. Genet., 12, 2021, 707343 https://doi.org/10.3389/fgene.2021.707343 34220969.
Soyeurt, H., Dardenne, P., Dehareng, F., Lognay, G., Veselko, D., Marlier, M., Bertozzi, C., Mayeres, P., Gengler, N., Estimating fatty acid content in cow milk using mid-infrared spectrometry. J. Dairy Sci. 89 (2006), 3690–3695 https://doi.org/10.3168/jds.S0022-0302(06)72409-2 16899705.
Soyeurt, H., Misztal, I., Gengler, N., Genetic variability of milk components based on mid-infrared spectral data. J. Dairy Sci. 93 (2010), 1722–1728 https://doi.org/10.3168/jds.2009-2614 20338450.
Suravajhala, P., Kogelman, L.J.A., Kadarmideen, H.N., Multi-omic data integration and analysis using systems genomics approaches: methods and applications in animal production, health and welfare. Genet. Sel. Evol., 48, 2016, 38 https://doi.org/10.1186/s12711-016-0217-x 27130220.
Tiezzi, F., Fleming, A., Malchiodi, F., Use of milk infrared spectral data as environmental covariates in genomic prediction models for production traits in canadian holstein. Animals (Basel), 12, 2022, 1189 https://doi.org/10.3390/ani12091189 35565615.
Tiplady, K.M., Lopdell, T.J., Reynolds, E., Sherlock, R.G., Keehan, M., Johnson, T.J., Pryce, J.E., Davis, S.R., Spelman, R.J., Harris, B.L., Garrick, D.J., Littlejohn, M.D., Sequence-based genome-wide association study of individual milk mid-infrared wavenumbers in mixed-breed dairy cattle. Genet. Sel. Evol., 53, 2021, 62 https://doi.org/10.1186/s12711-021-00648-9 34284721.
Toledo-Alvarado, H.O., Tempelman, R.J., Lopez-Cruz, M., VandeHaar, M.J., Santos, J.E.P., Peñagaricano, F., Khanal, P., de los Campos, G., Phenotypic and genetic associations between feed efficiency and FTIR milk-spectra. Proc. 12th World Congress on Genetics Applied to Livestock Production, 2022, 252–255 https://doi.org/10.3920/978-90-8686-940-4_51.
Vanderick, S., Mota, R.R., Wijnrocs, K., Gengler, N., Description of the genetic evaluation systems used in the Walloon Region of Belgium. https://www.elinfo.be/indexEN.html, 2022. (Accessed 17 October 2023)
Vanlierde, A., Dehareng, F., Gengler, N., Froidmont, E., McParland, S., Kreuzer, M., Bell, M., Lund, P., Martin, C., Kuhla, B., Soyeurt, H., Improving robustness and accuracy of predicted daily methane emissions of dairy cows using milk mid-infrared spectra. J. Sci. Food Agric. 101 (2021), 3394–3403 https://doi.org/10.1002/jsfa.10969 33222175.
VanRaden, P.M., Lactation yields and accuracies computed from test day yields and (co)variances by best prediction. J. Dairy Sci. 80 (1997), 3015–3022 https://doi.org/10.3168/jds.S0022-0302(97)76268-4 9406095.
VanRaden, P.M., Efficient methods to compute genomic predictions. J. Dairy Sci. 91 (2008), 4414–4423 https://doi.org/10.3168/jds.2007-0980 18946147.
Zaalberg, R.M., Janss, L., Buitenhuis, A.J., Genome-wide association study on Fourier transform infrared milk spectra for two Danish dairy cattle breeds. BMC Genet., 21, 2020, 9 https://doi.org/10.1186/s12863-020-0810-4 32005101.
Zaalberg, R.M., Shetty, N., Janss, L., Buitenhuis, A.J., Genetic analysis of Fourier transform infrared milk spectra in Danish Holstein and Danish Jersey. J. Dairy Sci. 102 (2019), 503–510 https://doi.org/10.3168/jds.2018-14464 30343907.