[en] Inbreeding results from the mating of related individuals and has negative consequences because it brings together deleterious variants in one individual. Genomic estimates of the inbreeding coefficients are preferred to pedigree-based estimators as they measure the realized inbreeding levels and they are more robust to pedigree errors. Several methods identifying homozygous-by-descent (HBD) segments with hidden Markov models (HMM) have been recently developed and are particularly valuable when the information is degraded or heterogeneous (e.g., low-fold sequencing, low marker density, heterogeneous genotype quality or variable marker spacing). We previously developed a multiple HBD class HMM where HBD segments are classified in different groups based on their length (e.g., recent versus old HBD segments) but we recently observed that for high inbreeding levels with many HBD segments, the estimated contributions might be biased towards more recent classes (i.e., associated with large HBD segments) although the overall estimated level of inbreeding remained unbiased. We herein propose a new model in which the HBD classification is modelled in successive nested levels with decreasing expected HBD segment lengths, the underlying exponential rates being directly related to the number of generations to the common ancestor. The non-HBD classes are now modelled as a mixture of HBD segments from later generations and shorter non-HBD segments (i.e., both with higher rates). The new model has improved statistical properties and performs better on simulated data compared to our previous version. We also show that the parameters of the model are easier to interpret and that the model is more robust to the choice of the number of classes. Overall, the new model results in an improved partitioning of inbreeding in different HBD classes and should be preferred.
Disciplines :
Genetics & genetic processes
Author, co-author :
Druet, Tom ; Université de Liège - ULiège > GIGA > GIGA Medical Genomics - Unit of Animal Genomics
Gautier, Mathieu ; INRAE, UMR CBGP (INRAE-IRD-Cirad-Montpellier SupAgro), Montferrier-sur-Lez, France
Language :
English
Title :
A hidden Markov model to estimate homozygous-by-descent probabilities associated with nested layers of ancestors.
Publication date :
2022
Journal title :
Theoretical Population Biology
ISSN :
0040-5809
eISSN :
1096-0325
Publisher :
Elsevier BV, United States
Volume :
145
Pages :
38-51
Peer reviewed :
Peer Reviewed verified by ORBi
Tags :
CÉCI : Consortium des Équipements de Calcul Intensif
scite shows how a scientific paper has been cited by providing the context of the citation, a classification describing whether it supports, mentions, or contrasts the cited claim, and a label indicating in which section the citation was made.
Bibliography
Abney, M., Ober, C., McPeek, M.S., Quantitative-trait homozygosity and association mapping and empirical genomewide significance in large, complex pedigrees: fasting serum-insulin level in the Hutterites. Am. J. Hum. Genet. 70:4 (2002), 920–934.
Alemu, S.W., Kadri, N.K., Harland, C., Faux, P., Charlier, C., Caballero, A., Druet, T., An evaluation of inbreeding measures using a whole-genome sequenced cattle pedigree. Heredity 126 (2021), 410–423.
Bertrand, A.R., Kadri, N.K., Flori, L., Gautier, M., Druet, T., RZooRoH: An r package to characterize individual genomic autozygosity and identify homozygous-by-descent segments. Methods Ecol. Evol. 10:6 (2019), 860–866.
Broman, K.W., Weber, J.L., Long homozygous chromosomal segments in reference families from the centre d'Etude du polymorphisme humain. Am. J. Hum. Genet. 65:6 (1999), 1493–1500.
Ceballos, F.C., Joshi, P.K., Clark, D.W., Ramsay, M., Wilson, J.F., Runs of homozygosity: windows into population history and trait architecture. Nature Rev. Genet., 19(4), 2018, 220.
Crow, J.F., Kimura, M., et al. An Introduction to Population Genetics Theory. 1970, Harper & Row, Publishers, New York, Evanston and London.
Druet, T., Gautier, M., A model-based approach to characterize individual inbreeding at both global and local genomic scales. Mol. Ecol. 26:20 (2017), 5820–5841.
Druet, T., Oleński, K., Flori, L., Bertrand, A.R., Olech, W., Tokarska, M., Kaminski, S., Gautier, M., Genomic footprints of recovery in the European bison. J. Heredity 111:2 (2020), 194–203.
Kirin, M., McQuillan, R., Franklin, C.S., Campbell, H., McKeigue, P.M., Wilson, J.F., Genomic runs of homozygosity record population history and consanguinity. PLoS One, 5, 2010, e13996.
Leutenegger, A.-L., Labalme, A., Génin, E., Toutain, A., Steichen, E., Clerget-Darpoux, F., Edery, P., Using genomic inbreeding coefficient estimates for homozygosity mapping of rare recessive traits: application to taybi-linder syndrome. Am. J. Hum. Genet. 79:1 (2006), 62–66.
Leutenegger, A.L., Prum, B., Genin, E., Verny, C., Lemainque, A., Clerget-Darpoux, F., Thompson, E.A., Estimation of the inbreeding coefficient through use of genomic data. Am. J. Hum. Genet. 73:3 (2003), 516–523.
Magi, A., Tattini, L., Palombo, F., Benelli, M., Gialluisi, A., Giusti, B., Abbate, R., Seri, M., Gensini, G.F., Romeo, G., et al. H 3 m 2: detection of runs of homozygosity from whole-exome sequencing data. Bioinformatics 30:20 (2014), 2852–2859.
McQuillan, R., Leutenegger, A.-L., Abdel-Rahman, R., Franklin, C.S., Pericic, M., et al. Runs of homozygosity in European populations. Am. J. Hum. Genet. 83 (2008), 359–372.
Narasimhan, V., Danecek, P., Scally, A., Xue, Y., Tyler-Smith, C., Durbin, R., BCFtools/RoH: A hidden Markov model approach for detecting autozygosity from next-generation sequencing data. Bioinformatics 32 (2016), 1749–1751.
Palamara, P.F., ARGON: fast, whole-genome simulation of the discrete time wright-fisher process. Bioinformatics 32:19 (2016), 3032–3034.
Pemberton, T.J., Absher, D., Feldman, M.W., Myers, R.M., Rosenberg, N.A., Li, J.Z., Genomic patterns of homozygosity in worldwide human populations. Am. J. Hum. Genet. 91 (2012), 275–292.
Purcell, S., Neale, B., Todd-Brown, K., Thomas, L., Ferreira, M.A., Bender, D., Maller, J., Sklar, P., de Bakker, P.I., Daly, M.J., Sham, P.C., PLINK: a tool set for whole-genome association and population-based linkage analyses. Am. J. Hum. Genet. 81:3 (2007), 559–575.
R Core Team, S., R: A Language and Environment for Statistical Computing. 2013, R Foundation for Statistical Computing, Vienna, Austria ISBN: 3-900051-07-0.
Rabiner, L.R., 1989. A tutorial on hidden Markov models and selected applications in speech recognition. In: PROCEEDINGS of the IEEE. pp. 257–286.
Renaud, G., Hanghøj, K., Korneliussen, T.S., Willerslev, E., Orlando, L., Joint estimates of heterozygosity and runs of homozygosity for modern and ancient samples. Genetics 212:3 (2019), 587–614.
Ringbauer, H., Novembre, J., Steinrücken, M., Parental relatedness through time revealed by runs of homozygosity in ancient DNA. Nature Commun. 12:1 (2021), 1–11.
Solé, M., Gori, A.-S., Faux, P., Bertrand, A., Farnir, F., Gautier, M., Druet, T., Age-based partitioning of individual genomic inbreeding levels in belgian blue cattle. Genet. Sel. Evol., 49(92), 2017.
Szpiech, Z.A., Blant, A., Pemberton, T.J., GARLIC: Genomic autozygosity regions likelihood-based inference and classification. Bioinformatics 33:13 (2017), 2059–2062.
Vieira, F.G., Albrechtsen, A., Nielsen, R., Estimating IBD tracts from low coverage NGS data. Bioinformatics 32 (2016), 2096–2102.
Wang, S., Haynes, C., Barany, F., Ott, J., Genome-wide autozygosity mapping in human populations. Genet. Epidemiol. 33:2 (2009), 172–180.
Weir, B.S., Anderson, A.D., Hepler, A.B., Genetic relatedness analysis: modern data and new challenges. Nature Rev. Genet. 7:10 (2006), 771–780.
Zucchini, W., MacDonald, I., Hidden Markov Models for Time Series Monographs on Statistics and Applied Probability, vol. 110, 2009, CRC Press.
This website uses cookies to improve user experience. Read more
Save & Close
Accept all
Decline all
Show detailsHide details
Cookie declaration
About cookies
Strictly necessary
Performance
Strictly necessary cookies allow core website functionality such as user login and account management. The website cannot be used properly without strictly necessary cookies.
This cookie is used by Cookie-Script.com service to remember visitor cookie consent preferences. It is necessary for Cookie-Script.com cookie banner to work properly.
Performance cookies are used to see how visitors use the website, eg. analytics cookies. Those cookies cannot be used to directly identify a certain visitor.
Used to store the attribution information, the referrer initially used to visit the website
Cookies are small text files that are placed on your computer by websites that you visit. Websites use cookies to help users navigate efficiently and perform certain functions. Cookies that are required for the website to operate properly are allowed to be set without your permission. All other cookies need to be approved before they can be set in the browser.
You can change your consent to cookie usage at any time on our Privacy Policy page.