[en] Among the dairy sector's current concerns, the assessment of global animal health status is a complex challenge. Its multidimensionality means that global monitoring tools are rarely considered. Instead, specific disease detection is often studied separately and, due to financial and ethical issues, uses small-scale data sets focusing on few biomarkers. Several studies have already been conducted using milk Fourier transform mid-infrared (FT-MIR) spectroscopy to detect mastitis and lameness or to quantify health-related biomarkers in milk or blood. Those studies are relevant but they focus mainly on one biomarker or disease. To solve this issue and the small-scale data set, in this study, we proposed a holistic approach using big data obtained from milk recording, including milk yield, somatic cell count, and 27 FT-MIR-based predictors related to milk composition and animal health status. Using 740,454 records collected from 114,536 first-parity Holstein cows in southern Belgium, we performed repeated unsupervised learning algorithms based on Ward's agglomerative hierarchical clustering method to find potential interesting patterns. A divide-and-conquer approach was used to overcome the limitation of computational resources in clustering a relatively large data set. Five groups of records were identified. Differences observed in the fourth group suggested a relationship to metabolic disorders. The fifth group seemed to be related to mastitis. In a second step, we performed a partial least squares discriminant analysis (PLS-DA) to predict the probability of belonging to those specific groups for the entire data set. The obtained global accuracy was 0.77 and the balanced accuracy (i.e., the mean between sensitivity and specificity) of discriminating the fourth and fifth groups was 0.88 and 0.96, respectively. Then, a validation of the interpretation of those groups was performed using 204 milk and blood reference records. The predicted probability associated with the metabolic disorders issue had significant correlations of 0.54 with blood β-hydroxybutyrate, 0.44 with blood nonesterified fatty acids, -0.32 with blood glucose, -0.23 with milk glucose-6-phosphate, and 0.38 with milk isocitrate. In contrast, the predicted probability of belonging to the mastitis group had correlations of 0.69 with milk lactate dehydrogenase, 0.46 with milk N-acetyl-β-d-glucosaminidase, -0.18 with milk free glucose, and 0.16 with milk glucose-6-phosphate. Consequently, these results suggest that the obtained quantitative traits indirectly reflect some of the main health disorders in dairy farming and could be used to monitor dairy cows on a large scale. By using unsupervised learning on large-scale milk recording data and then validating the pattern using reference laboratory measures, we propose a new approach to quickly assess dairy cow health status.
Disciplines :
Animal production & animal husbandry Food science
Author, co-author :
Franceschini, Sébastien ; Université de Liège - ULiège > Département GxABT > Modélisation et développement
Grelet, C; Walloon Agricultural Research Center (CRA-W), 5030 Gembloux, Belgium
Leblois, J; Walloon Breeders Association Group (Elevéo by Awé groupe), 5590 Ciney, Belgium
Gengler, Nicolas ; Université de Liège - ULiège > TERRA Research Centre > Ingénierie des productions animales et nutrition
Soyeurt, Hélène ; Université de Liège - ULiège > Département GxABT > Modélisation et développement
GplusE consortium
Language :
English
Title :
Can unsupervised learning methods applied to milk recording big data provide new insights into dairy cow health?
Alternative titles :
[fr] Peut-on utiliser des méthodes d'apprentissages non-supervisés sur des données massives issues du contrôle laitier pour détecter des problèmes de santé?
The authors acknowledge the support of the Walloon Government (Service Public de Wallonie, Namur, Belgium) through the ScorWelCow project (Grant agreement D31-1390) and their co-financing of the INTERREG NWE HappyMoo project (Grant agreement NWE 730), which is also acknowledged for its support of this research. The GplusE project has received funding from the European Union's Seventh Framework Program (Brussels, Belgium) for research, technological development, and demonstration, under Grant agreement no. 613689. The views expressed in this publication are the sole responsibility of the authors and do not necessarily reflect the views of the European Commission. Also acknowledged are the use of the computation resources of the University of Liège–Gembloux Agro-Bio Tech (Gembloux, Belgium) provided by the technical platform Calcul et Modélisation Informatique (CAMI) of the TERRA Teaching and Research Centre, partly supported by the National Fund for Scientific Research (F.R.S.-FNRS, Brussels, Belgium) under Grants No. T.0095.19 (PDR “DEEPSELECT”) and J.0174.18 (CDR “PREDICT-2”). The authors have not stated any conflicts of interest.
Atashi, H., Salavati, M., De Koster, J., Crowe, M.A., Opsomer, G., Hostens, M., Genome-wide association for metabolic clusters in early-lactation Holstein dairy cows. J. Dairy Sci. 103 (2020), 6392–6406 https://doi.org/10.3168/jds.2019-17369 32331880.
Bansal, B.K., Hamann, J., Grabowski, N.T., Singh, K.B., Variation in the composition of selected milk fraction samples from healthy and mastitic quarters, and its significance for mastitis diagnosis. J. Dairy Res. 72 (2005), 144–152 https://doi.org/10.1017/S0022029905000798 15909679.
Bastin, C., Théron, L., Lainé, A., Gengler, N., On the role of mid-infrared predicted phenotypes in fertility and health dairy breeding programs. J. Dairy Sci. 99 (2016), 4080–4094 https://doi.org/10.3168/jds.2015-10087 26723123.
Belay, T.K., Dagnachew, B.S., Kowalski, Z.M., Ådnøy, T., An attempt at predicting blood β-hydroxybutyrate from Fourier-transform mid-infrared spectra of milk using multivariate mixed models in Polish dairy cattle. J. Dairy Sci. 100 (2017), 6312–6326 https://doi.org/10.3168/jds.2016-12252 28571989.
Blokhuis, H.J., Jones, R.B., Geers, R., Miele, M., Veissier, I., Measuring and monitoring animal welfare: Transparency in the food product quality chain. Anim. Welf. 12 (2003), 445–455.
Bonfatti, V., Turner, S.-A., Kuhn-Sherlock, B., Luke, T.D.W., Ho, P.N., Phyn, C.V.C., Pryce, J.E., Prediction of blood β-hydroxybutyrate content and occurrence of hyperketonemia in early-lactation, pasture-grazed dairy cows using milk infrared spectra. J. Dairy Sci. 102 (2019), 6466–6476 https://doi.org/10.3168/jds.2018-15988 31079906.
Bonfatti, V., Vicario, D., Lugo, A., Carnier, P., Genetic parameters of measures and population-wide infrared predictions of 92 traits describing the fine composition and technological properties of milk in Italian Simmental cattle. J. Dairy Sci. 100 (2017), 5526–5540 https://doi.org/10.3168/jds.2016-11667 28478002.
Bresolin, T., Dórea, J.R.R., Infrared spectrometry as a high-throughput phenotyping technology to predict complex traits in livestock systems. Front. Genet., 11, 2020, 923 https://doi.org/10.3389/fgene.2020.00923 32973876.
Christophe, O.S., Grelet, C., Bertozzi, C., Veselko, D., Lecomte, C., Höeckels, P., Werner, A., Auer, F.-J., Gengler, N., Dehareng, F., Soyeurt, H., Multiple breeds and countries' predictions of mineral contents in milk from milk mid-infrared spectrometry. Foods, 10, 2021, 2235 https://doi.org/10.3390/foods10092235 34574345.
De Koster, J., Salavati, M., Grelet, C., Crowe, M.A., Matthews, E., O'Flaherty, R., Opsomer, G., Foldager, L., Hostens, M., Prediction of metabolic clusters in early-lactation dairy cows using models based on milk biomarkers. J. Dairy Sci. 102 (2019), 2631–2644 https://doi.org/10.3168/jds.2018-15533 30692010.
De Marchi, M., Toffanin, V., Cassandro, M., Penasa, M., Invited review: Mid-infrared spectroscopy as phenotyping tool for milk traits. J. Dairy Sci. 97 (2014), 1171–1186 https://doi.org/10.3168/jds.2013-6799 24440251.
Ebrahimie, E., Ebrahimi, F., Ebrahimi, M., Tomlinson, S., Petrovski, K.R., A large-scale study of indicators of sub-clinical mastitis in dairy cattle by attribute weighting analysis of milk composition features: Highlighting the predictive power of lactose and electrical conductivity. J. Dairy Res. 85 (2018), 193–200 https://doi.org/10.1017/S0022029918000249 29785910.
Esposito, G., Irons, P.C., Webb, E.C., Chapwanya, A., Interactions between negative energy balance, metabolic diseases, uterine health and immune response in transition dairy cows. Anim. Reprod. Sci. 144 (2014), 60–71 https://doi.org/10.1016/j.anireprosci.2013.11.007 24378117.
Fernando, R.S., Spahr, S.L., Jaster, E.H., Comparison of electrical conductivity of milk with other indirect methods for detection of subclinical mastitis. J. Dairy Sci. 68 (1985), 449–456 https://doi.org/10.3168/jds.S0022-0302(85)80844-4 3989085.
Fiore, E., Blasi, F., Morgante, M., Cossignani, L., Badon, T., Gianesella, M., Contiero, B., Berlanda, M., Changes of milk fatty acid composition in four lipid classes as biomarkers for the diagnosis of bovine ketosis using bioanalytical thin layer chromatography and gas chromatographic techniques (TLC-GC). J. Pharm. Biomed. Anal., 188, 2020, 113372 https://doi.org/10.1016/j.jpba.2020.113372 32502957.
Fleming, A., Schenkel, F.S., Chen, J., Malchiodi, F., Bonfatti, V., Ali, R.A., Mallard, B., Corredig, M., Miglior, F., Prediction of milk fatty acid content with mid-infrared spectroscopy in Canadian dairy cattle using differently distributed model development sets. J. Dairy Sci. 100 (2017), 5073–5081 https://doi.org/10.3168/jds.2016-12102 28434722.
Foldager, L., Gaillard, C., Sorensen, M.T., Larsen, T., Matthews, E., O'Flaherty, R., Carter, F., Crowe, M.A., Grelet, C., Salavati, M., Hostens, M., Ingvartsen, K.L., Krogh, M.A., Predicting physiological imbalance in Holstein dairy cows by three different sets of milk biomarkers. Prev. Vet. Med., 179, 2020, 105006 https://doi.org/10.1016/j.prevetmed.2020.105006 32361640.
Fraser, D., Science, values and animal welfare: Exploring the ‘inextricable connection’. Anim. Welf. 4 (1995), 103–117.
Gandomi, A., Haider, M., Beyond the hype: Big data concepts, methods, and analytics. Int. J. Inf. Manage. 35 (2015), 137–144 https://doi.org/10.1016/j.ijinfomgt.2014.10.007.
Gengler, N., Soyeurt, H., Dehareng, F., Bastin, C., Colinet, F., Hammami, H., Vanrobays, M.-L., Lainé, A., Vanderick, S., Grelet, C., Vanlierde, A., Froidmont, E., Dardenne, P., Capitalizing on fine milk composition for breeding and management of dairy cows1. J. Dairy Sci. 99 (2016), 4071–4079 https://doi.org/10.3168/jds.2015-10140 26778306.
Grelet, C., Bastin, C., Gelé, M., Davière, J.-B., Johan, M., Werner, A., Reding, R., Fernandez Pierna, J.A., Colinet, F.G., Dardenne, P., Gengler, N., Soyeurt, H., Dehareng, F., Development of Fourier transform mid-infrared calibrations to predict acetone, β-hydroxybutyrate, and citrate contents in bovine milk through a European dairy network. J. Dairy Sci. 99 (2016), 4816–4825 https://doi.org/10.3168/jds.2015-10477 27016835.
Grelet, C., Dardenne, P., Soyeurt, H., Fernandez, J.A., Vanlierde, A., Steevens, F., Gengler, N., Dehareng, F., Large-scale phenotyping in dairy sector using milk MIR spectra: Key factors affecting the quality of predictions. Methods 86 (2021), 97–111 https://doi.org/10.1016/j.ymeth.2020.07.012 32763376.
Grelet, C., Fernandez Pierna, J., Soyeurt, H., Dehareng, F., Gengler, N., Dardenne, P., Creation of universal MIR calibration by standardization of milk spectra: example of fatty acids. Book of Abstracts of the 65th Annual Meeting of the European Federation of Animal Science, 2014, Wageningen Academic Publishers, 108.
Grelet, C., Froidmont, E., Foldager, L., Salavati, M., Hostens, M., Ferris, C.P., Ingvartsen, K.L., Crowe, M.A., Sorensen, M.T., Fernandez Pierna, J.A., Vanlierde, A., Gengler, N., Dehareng, F., Potential of milk mid-infrared spectra to predict nitrogen use efficiency of individual dairy cows in early lactation. J. Dairy Sci. 103 (2020), 4435–4445 https://doi.org/10.3168/jds.2019-17910 32147266.
Grelet, C., Vanlierde, A., Dehareng, F., Froidmont, E., Consortium, G., Prediction of energy status of dairy cows using MIR milk spectra. Book of Abstracts of the 68th Annual Meeting of the European Federation of Animal Science, Tallin, 2017, Wageningen Academic Publishers, 403.
Grelet, C., Vanlierde, A., Hostens, M., Foldager, L., Salavati, M., Ingvartsen, K.L., Crowe, M., Sorensen, M.T., Froidmont, E., Ferris, C.P., Marchitelli, C., Becker, F., Larsen, T., Carter, F., Dehareng, F., Potential of milk mid-IR spectra to predict metabolic status of cows through blood components and an innovative clustering approach. Animal 13 (2019), 649–658 https://doi.org/10.1017/S1751731118001751 29987991.
Gross, J., van Dorland, H.A., Bruckmaier, R.M., Schwarz, F.J., Performance and metabolic profile of dairy cows during a lactational and deliberately induced negative energy balance with subsequent realimentation. J. Dairy Sci. 94 (2011), 1820–1830 https://doi.org/10.3168/jds.2010-3707 21426971.
Ingvartsen, K.L., Feeding- and management-related diseases in the transition cow: Physiological adaptations around calving and strategies to reduce feeding-related diseases. Anim. Feed Sci. Technol. 126 (2006), 175–213 https://doi.org/10.1016/j.anifeedsci.2005.08.003.
Krogh, M.A., Hostens, M., Salavati, M., Grelet, C., Sorensen, M.T., Wathes, D.C., Ferris, C.P., Marchitelli, C., Signorelli, F., Napolitano, F., Becker, F., Larsen, T., Matthews, E., Carter, F., Vanlierde, A., Opsomer, G., Gengler, N., Dehareng, F., Crowe, M.A., Ingvartsen, K.L., Foldager, L., Between- and within-herd variation in blood and milk biomarkers in Holstein cows in early lactation. Animal 14 (2020), 1067–1075 https://doi.org/10.1017/S1751731119002659 31694730.
Larsen, T., Fluorometric determination of free and total isocitrate in bovine milk. J. Dairy Sci. 97 (2014), 7498–7504 https://doi.org/10.3168/jds.2014-8018 25262187.
Larsen, T., Moyes, K.M., Are free glucose and glucose-6-phosphate in milk indicators of specific physiological states in the cow?. Animal 9 (2015), 86–93 https://doi.org/10.1017/S1751731114002043 25159717.
Larsen, T., Røntved, C.M., Ingvartsen, K.L., Vels, L., Bjerring, M., Enzyme activity and acute phase proteins in milk utilized as indicators of acute clinical E. coli LPS-induced mastitis. Animal 4 (2010), 1672–1679 https://doi.org/10.1017/S1751731110000947 22445120.
LeBlanc, S., Monitoring metabolic health of dairy cattle in the transition period. J. Reprod. Dev. 56:Suppl. (2010), S29–S35 https://doi.org/10.1262/jrd.1056S29 20629214.
Luke, T.D.W., Rochfort, S., Wales, W.J., Bonfatti, V., Marett, L., Pryce, J.E., Metabolic profiling of early-lactation dairy cows using milk mid-infrared spectra. J. Dairy Sci. 102 (2019), 1747–1760 https://doi.org/10.3168/jds.2018-15103 30594377.
McParland, S., Berry, D.P., The potential of Fourier transform infrared spectroscopy of milk samples to predict energy intake and efficiency in dairy cows. J. Dairy Sci. 99 (2016), 4056–4070 https://doi.org/10.3168/jds.2015-10051 26947296.
NRC, Nutrient Requirements of Dairy Cattle. 7th rev. ed., 2001, The National Academies Press.
Pralle, R.S., Weigel, K.W., White, H.M., Predicting blood β-hydroxybutyrate using milk Fourier transform infrared spectrum, milk composition, and producer-reported variables with multiple linear regression, partial least squares regression, and artificial neural network. J. Dairy Sci. 101 (2018), 4378–4387 https://doi.org/10.3168/jds.2017-14076 29477523.
Smith, S.L., Denholm, S.J., Coffey, M.P., Wall, E., Energy profiling of dairy cows from routine milk mid-infrared analysis. J. Dairy Sci. 102 (2019), 11169–11179 https://doi.org/10.3168/jds.2018-16112 31587910.
Soyeurt, H., Bastin, C., Colinet, F.G., Arnould, V.M.-R., Berry, D.P., Wall, E., Dehareng, F., Nguyen, H.N., Dardenne, P., Schefers, J., Vandenplas, J., Weigel, K., Coffey, M., Théron, L., Detilleux, J., Reding, E., Gengler, N., McParland, S., Mid-infrared prediction of lactoferrin content in bovine milk: Potential indicator of mastitis. Animal 6 (2012), 1830–1838 https://doi.org/10.1017/S1751731112000791 22717388.
Soyeurt, H., Grelet, C., McParland, S., Calmels, M., Coffey, M., Tedde, A., Delhez, P., Dehareng, F., Gengler, N., A comparison of 4 different machine learning algorithms to predict lactoferrin content in bovine milk from mid-infrared spectra. J. Dairy Sci. 103 (2020), 11585–11596 https://doi.org/10.3168/jds.2020-18870 33222859.
Suthar, V.S., Canelas-Raposo, J., Deniz, A., Heuwieser, W., Prevalence of subclinical ketosis and relationships with postpartum diseases in European dairy cows. J. Dairy Sci. 96 (2013), 2925–2938 https://doi.org/10.3168/jds.2012-6035 23497997.
Tedde, A., Grelet, C., Ho, P.N., Pryce, J.E., Hailemariam, D., Wang, Z., Plastow, G., Gengler, N., Brostaux, Y., Froidmont, E., Dehareng, F., Bertozzi, C., Crowe, M.A., Dufrasne, I., Soyeurt, H., GplusE Consortium. Validation of dairy cow bodyweight prediction using traits easily recorded by dairy herd improvement organizations and its potential improvement using feature selection algorithms. Animals (Basel), 11, 2021, 1288 https://doi.org/10.3390/ani11051288.
Vanlierde, A., Dehareng, F., Gengler, N., Froidmont, E., McParland, S., Kreuzer, M., Bell, M., Lund, P., Martin, C., Kuhla, B., Soyeurt, H., Improving robustness and accuracy of predicted daily methane emissions of dairy cows using milk mid-infrared spectra. J. Sci. Food Agric. 101 (2021), 3394–3403 https://doi.org/10.1002/jsfa.10969 33222175.
Vranković, L., Aladrović, J., Octenjak, D., Bijelić, D., Cvetnić, L., Stojević, Z., Milk fatty acid composition as an indicator of energy status in Holstein dairy cows. Arch. Tierzucht 60 (2017), 205–212 https://doi.org/10.5194/aab-60-205-2017.
Wang, C., Chen, M.-H., Schifano, E., Wu, J., Yan, J., Statistical methods and computing for big data. Stat. Interface 9 (2016), 399–414 https://doi.org/10.4310/SII.2016.v9.n4.a1 27695593.
Ward, J.H. Jr., Hierarchical grouping to optimize an objective function. J. Am. Stat. Assoc. 58 (1963), 236–244 https://doi.org/10.1080/01621459.1963.10500845.
Yan, M.-J., Humphreys, J., Holden, N.M., An evaluation of life cycle assessment of European milk production. J. Environ. Manage. 92 (2011), 372–379 https://doi.org/10.1016/j.jenvman.2010.10.025 21055870.