No full text
Unpublished conference/Abstract (Scientific congresses and symposiums)
Going deeper in tile-based processing for complex GC×GC-TOFMS datasets
Stefanuto, Pierre-Hugues; Caitlin Cain; Robert Synovec et al.
20232nd Separation Science Workshop
Editorial reviewed
 

Files


Full Text
No document available.

Send to



Details



Abstract :
[en] In the quest of making multidimensional chromatography (MDC) a robust method for untargeted screening of small molecules, one of the key remaining challenges to tackle is reproducibility. To reach this objective, important analytical aspects, such as column dimension and separation conditions need to be investigated. The biggest challenge for MDC is nevertheless data processing, meaning transforming row data into pertinent information. To enable data analytical method and processing workflow evaluation, a reference data set is required. In this study, we used a whole stool research grade test materials (RGTMs) prepared by NIST for interlaboratory studies to develop a control data set covering sampling, analysis, and processing workflows. The RGTMs contain two diets, vegan and omnivore, and two sample formats, liquid vs lyophilized. In this presentation, we will focus on the utilization of data produced from RGTMs to evaluate data processing approaches. The robustness of several statistical workflows involving commercial, in house, and open-source solutions were investigated. First, we investigated user impact on a well-established ANOVA-based workflow. The key was to evaluate the weight of human decision on the final classification metrics and the impact on the identification of significant features. Our well-established workflow shown to be unimpacted by human decision during data cleaning, pre-processing and model building as no significant output changes appeared in the study. Next, we developed and evaluated a new processing approach combining tile-based image comparison and machine learning-based feature selection. The combination of tile-based alignment and random forest classification increased the robustness, compared to the ANOVA-based approach. Indeed, the false positive rate decreased during feature selection, and we were able to conduct unbalanced data set processing.
Disciplines :
Chemistry
Author, co-author :
Stefanuto, Pierre-Hugues  ;  Université de Liège - ULiège > Département de chimie (sciences) > Chimie analytique, organique et biologique
Caitlin Cain;  UW - University of Washington [US-WA]
Robert Synovec;  UW - University of Washington [US-WA]
Focant, Jean-François  ;  Université de Liège - ULiège > Département de chimie (sciences) > Chimie analytique, organique et biologique
Gaida, Meriem  ;  Université de Liège - ULiège > Molecular Systems (MolSys)
Language :
English
Title :
Going deeper in tile-based processing for complex GC×GC-TOFMS datasets
Publication date :
2023
Event name :
2nd Separation Science Workshop
Event date :
28/06/23 - 29/06/23
By request :
Yes
Audience :
International
Peer reviewed :
Editorial reviewed
Available on ORBi :
since 04 July 2023

Statistics


Number of views
32 (2 by ULiège)
Number of downloads
0 (0 by ULiège)

Bibliography


Similar publications



Contact ORBi