Normalization and correction for batch effects via RUV for RNA-seq data: practical implications for Breast Cancer Research

Debit, Ahmed; Wenric, Stéphane; JOSSE, Claire; Van Steen, Kristel; Bours, Vincent

Poster (Scientific congresses and symposiums)

Debit, Ahmed; Wenric, Stéphane; JOSSE, Claire et al.

2017 • European Human Genetics Conference ESHG 2017

Permalink
https://hdl.handle.net/2268/233109

Files (1)Send to Details Statistics Bibliography Similar publications

Files

Full Text

electronicPosterESHG_2017.pdf

Author postprint (1.28 MB)

Download

All documents in ORBi are protected by a user license.

Send to

RIS BibTex APA Chicago Permalink X Linkedin

Details

Keywords :

RNA-seq; Normalization; RUV

Abstract :

[en] The whole transcriptome contains information about nonsense, missense, silent, in-frame and frameshift mutations, as observed at whole-exome level, as well as splicing and (allelic) gene-expression changes which are missed by DNA analysis. One important step in the analysis of gene expression data arising from RNA-seq is the detection of differential expression (DE) levels. Several methods are available and the choice is sometimes controversial. For a reliable DE analysis that reduces False Positive DE genes, and accurate estimation of gene expression levels, a good and suitable normalization approach (including correction for confounders) is mandatory. Several normalization methods have been proposed to correct for both within-sample and between-sample biases. RUV (Removing Unwanted Variation) is one of them and has the advantage to correct for batch effects including potentially unknown unwanted variation in gene expression. In this study, we present a comparison on real-life Illumina paired-end sequencing data for Estrogen-Receptor-Positive (ER+) Breast Cancer tissues versus matched controls between RUV (RUVg using in silico negative control genes) and more commonly used methods for RNA-seq data normalization, such as DESeq2, edgeR, and UQ. The set of in silico empirical negative control genes for RUVg was defined as the set of least significant DE genes obtained after a first DE analysis performed prior to RUVg correction. Box plots of relative log expression (RLE) among the samples and PCA plots show that RUVg performs well and leads to a stabilization of read count across samples with a clear clustering of biological replicates.

Research Center/Unit :

GIGA‐R - Giga‐Research - ULiège

Disciplines :

Engineering, computing & technology: Multidisciplinary, general & others

Author, co-author :

Debit, Ahmed ; Université de Liège - ULiège > Cancer-Human Genetics

Wenric, Stéphane ; Université de Liège - ULiège > Cancer-Human Genetic

JOSSE, Claire ; Centre Hospitalier Universitaire de Liège - CHU > Département de médecine interne > Service d'oncologie médicale

Van Steen, Kristel ; Université de Liège - ULiège > Dép. d'électric., électron. et informat. (Inst.Montefiore) > Bioinformatique

Bours, Vincent ; Université de Liège - ULiège > Département des sciences biomédicales et précliniques > Génétique humaine

Language :

English

Title :

Normalization and correction for batch effects via RUV for RNA-seq data: practical implications for Breast Cancer Research

Publication date :

May 2017

Event name :

European Human Genetics Conference ESHG 2017

Event organizer :

European Society Human Genetics

Event place :

Copenhagen, Denmark

Event date :

from 27-05-2017 to 30-05-2017

Audience :

International

Funders :

Région wallonne

Available on ORBi :

since 21 February 2019

Statistics

Number of views

426 (4 by ULiège)

Number of downloads

132 (3 by ULiège)

More statistics