Article (Scientific journals)
Toward genomic prediction from whole-genome sequence data: impact of sequencing design on genotype imputation and accuracy of predictions.
Druet, Tom; Macleod, I. M.; Hayes, B. J.
2014In Heredity, 112 (1), p. 39-47
Peer Reviewed verified by ORBi
 

Files


Full Text
druet_heredity2013.pdf
Author preprint (1.34 MB)
Request a copy

All documents in ORBi are protected by a user license.

Send to



Details



Abstract :
[en] Genomic prediction from whole-genome sequence data is attractive, as the accuracy of genomic prediction is no longer bounded by extent of linkage disequilibrium between DNA markers and causal mutations affecting the trait, given the causal mutations are in the data set. A cost-effective strategy could be to sequence a small proportion of the population, and impute sequence data to the rest of the reference population. Here, we describe strategies for selecting individuals for sequencing, based on either pedigree relationships or haplotype diversity. Performance of these strategies (number of variants detected and accuracy of imputation) were evaluated in sequence data simulated through a real Belgian Blue cattle pedigree. A strategy (AHAP), which selected a subset of individuals for sequencing that maximized the number of unique haplotypes (from single-nucleotide polymorphism panel data) sequenced gave good performance across a range of variant minor allele frequencies. We then investigated the optimum number of individuals to sequence by fold coverage given a maximum total sequencing effort. At 600 total fold coverage (x 600), the optimum strategy was to sequence 75 individuals at eightfold coverage. Finally, we investigated the accuracy of genomic predictions that could be achieved. The advantage of using imputed sequence data compared with dense SNP array genotypes was highly dependent on the allele frequency spectrum of the causative mutations affecting the trait. When this followed a neutral distribution, the advantage of the imputed sequence data was small; however, when the causal mutations all had low minor allele frequencies, using the sequence data improved the accuracy of genomic prediction by up to 30%.Heredity advance online publication, 3 April 2013; doi:10.1038/hdy.2013.13.
Disciplines :
Genetics & genetic processes
Agriculture & agronomy
Author, co-author :
Druet, Tom ;  Université de Liège - ULiège > Département de productions animales > GIGA-R : Génomique animale
Macleod, I. M.
Hayes, B. J.
Language :
English
Title :
Toward genomic prediction from whole-genome sequence data: impact of sequencing design on genotype imputation and accuracy of predictions.
Publication date :
January 2014
Journal title :
Heredity
ISSN :
0018-067X
eISSN :
1365-2540
Publisher :
Nature Publishing Group, London, United Kingdom
Volume :
112
Issue :
1
Pages :
39-47
Peer reviewed :
Peer Reviewed verified by ORBi
Available on ORBi :
since 04 April 2013

Statistics


Number of views
71 (17 by ULiège)
Number of downloads
6 (6 by ULiège)

Scopus citations®
 
151
Scopus citations®
without self-citations
131
OpenCitations
 
143

Bibliography


Similar publications



Contact ORBi