Comparison of 3 different variable selection strategies to improve the predictions of fatty acid profile in bovine milk by mid-infrared spectrometry.pdf
Comparison of 3 different variable selection strategies to improve the predictions of fatty acid profile in bovine milk by mid-infrared spectrometry PP.pdf
milk; mid infrared; selection variable; lait; infrarouge; sélection de variable
Abstract :
[en] Mid-infrared (MIR) spectrometry is used to provide phenotypes related to the milk composition. Foss spectrum contains 1,060 datapoints. The number of reference values required to build a calibration equation is often lower than the spectral variables mainly due to the cost of chemical analysis. Problems of collinearity and overfitting appear when this high dimensional data set is used. This research will study the interest of using variable selection (VS) approach before the use of partial least square regression (PLS). The data set included 1,236 milk spectra related to their fatty acid (FA) contents. Saturated (SFA), monounsaturated (MUFA), polyunsaturated (PUFA), short chain (SCFA), medium chain (MCFA), and long chain FA (LCFA) were studied. The data set was randomly divided in 3 groups which were used to create 3 calibration and validation data sets. Three different VS methods were compared. The first strategy was based on the part of trait variability explained by each considered variables (R2VS). The second method was based on the regression coefficient estimated after PLS procedure divided by the standard deviation of the considered spectral variable (BSVS). The third strategy permitted to underline the uninformative variables which were the ones having the lowest ratio of average regression coefficient to their corresponding standard deviation estimated after a leave-one out cross-validation (UVEVS). For UVEVS and BSVS, the cutoff was determined from the known uninformative region of MIR milk spectrum. The cutoff for R2VS was determined by testing different thresholds ranged between 5 and 40%. The most interesting cutoff for R2VS was 25%. The worst results in terms of validation root mean square error of prediction (RMSEPv) were obtained using a full PLS (i.e., without VS). The maximum difference (g/dl of milk) of RMSEPv obtained from the full PLS and from the PLS using selected variables were 0.156 for SFA, 0.139 for MUFA, 0.011 for PUFA, 0.025 for SCFA, 0.164 for MCFA, and 0.188 for LCFA. R2VS gave the best results for all studied traits followed by UVEVS and then BSVS. In conclusion, the use of VS improved significantly the performance of FA MIR equations.
Disciplines :
Agriculture & agronomy Food science Animal production & animal husbandry
Author, co-author :
Soyeurt, Hélène ; Université de Liège > Agronomie, Bio-ingénierie et Chimie (AgroBioChem) > Statistique, Inform. et Mathém. appliquée à la bioingénierie
Brostaux, Yves ; Université de Liège > Agronomie, Bio-ingénierie et Chimie (AgroBioChem) > Statistique, Inform. et Mathém. appliquée à la bioingénierie
This website uses cookies to improve user experience. Read more
Save & Close
Accept all
Decline all
Show detailsHide details
Cookie declaration
About cookies
Strictly necessary
Performance
Strictly necessary cookies allow core website functionality such as user login and account management. The website cannot be used properly without strictly necessary cookies.
This cookie is used by Cookie-Script.com service to remember visitor cookie consent preferences. It is necessary for Cookie-Script.com cookie banner to work properly.
Performance cookies are used to see how visitors use the website, eg. analytics cookies. Those cookies cannot be used to directly identify a certain visitor.
Used to store the attribution information, the referrer initially used to visit the website
Cookies are small text files that are placed on your computer by websites that you visit. Websites use cookies to help users navigate efficiently and perform certain functions. Cookies that are required for the website to operate properly are allowed to be set without your permission. All other cookies need to be approved before they can be set in the browser.
You can change your consent to cookie usage at any time on our Privacy Policy page.