Going deeper in tile-based processing for complex GC×GC-TOFMS datasets

Stefanuto, Pierre-Hugues; Caitlin Cain; Robert Synovec; Focant, Jean-François; Gaida, Meriem

No full text

Unpublished conference/Abstract (Scientific congresses and symposiums)

Going deeper in tile-based processing for complex GC×GC-TOFMS datasets

Stefanuto, Pierre-Hugues; Caitlin Cain; Robert Synovec et al.

2023 • 2nd Separation Science Workshop

Editorial reviewed

Permalink
https://hdl.handle.net/2268/304832

Files (0)Send to Details Statistics Bibliography Similar publications

Files

Full Text

No document available.

Send to

RIS BibTex APA Chicago Permalink X Linkedin

Details

Abstract :

[en] In the quest of making multidimensional chromatography (MDC) a robust method for untargeted screening of small molecules, one of the key remaining challenges to tackle is reproducibility. To reach this objective, important analytical aspects, such as column dimension and separation conditions need to be investigated. The biggest challenge for MDC is nevertheless data processing, meaning transforming row data into pertinent information. To enable data analytical method and processing workflow evaluation, a reference data set is required. In this study, we used a whole stool research grade test materials (RGTMs) prepared by NIST for interlaboratory studies to develop a control data set covering sampling, analysis, and processing workflows. The RGTMs contain two diets, vegan and omnivore, and two sample formats, liquid vs lyophilized. In this presentation, we will focus on the utilization of data produced from RGTMs to evaluate data processing approaches. The robustness of several statistical workflows involving commercial, in house, and open-source solutions were investigated. First, we investigated user impact on a well-established ANOVA-based workflow. The key was to evaluate the weight of human decision on the final classification metrics and the impact on the identification of significant features. Our well-established workflow shown to be unimpacted by human decision during data cleaning, pre-processing and model building as no significant output changes appeared in the study. Next, we developed and evaluated a new processing approach combining tile-based image comparison and machine learning-based feature selection. The combination of tile-based alignment and random forest classification increased the robustness, compared to the ANOVA-based approach. Indeed, the false positive rate decreased during feature selection, and we were able to conduct unbalanced data set processing.

Disciplines :

Chemistry

Author, co-author :

Stefanuto, Pierre-Hugues ; Université de Liège - ULiège > Département de chimie (sciences) > Chimie analytique, organique et biologique

Caitlin Cain; UW - University of Washington [US-WA]

Robert Synovec; UW - University of Washington [US-WA]

Focant, Jean-François ; Université de Liège - ULiège > Département de chimie (sciences) > Chimie analytique, organique et biologique

Gaida, Meriem ; Université de Liège - ULiège > Molecular Systems (MolSys)

Language :

English

Title :

Going deeper in tile-based processing for complex GC×GC-TOFMS datasets

Publication date :

2023

Event name :

2nd Separation Science Workshop

Event date :

28/06/23 - 29/06/23

By request :

Yes

Audience :

International

Peer review/Selection committee :

Editorial reviewed

Available on ORBi :

since 04 July 2023

Statistics

Number of views

63 (2 by ULiège)

Number of downloads

0 (0 by ULiège)

More statistics