PMEPA1 protein, human; Membrane Proteins; Humans; Bayes Theorem; Genotype; Phenotype; Rare Diseases/genetics; Genome-Wide Association Study/methods; Genome-Wide Association Study; Rare Diseases; Biochemistry, Genetics and Molecular Biology (all); General Biochemistry, Genetics and Molecular Biology; General Medicine
Abstract :
[en] The genetic etiologies of more than half of rare diseases remain unknown. Standardized genome sequencing and phenotyping of large patient cohorts provide an opportunity for discovering the unknown etiologies, but this depends on efficient and powerful analytical methods. We built a compact database, the 'Rareservoir', containing the rare variant genotypes and phenotypes of 77,539 participants sequenced by the 100,000 Genomes Project. We then used the Bayesian genetic association method BeviMed to infer associations between genes and each of 269 rare disease classes assigned by clinicians to the participants. We identified 241 known and 19 previously unidentified associations. We validated associations with ERG, PMEPA1 and GPR156 by searching for pedigrees in other cohorts and using bioinformatic and experimental approaches. We provide evidence that (1) loss-of-function variants in the Erythroblast Transformation Specific (ETS)-family transcription factor encoding gene ERG lead to primary lymphoedema, (2) truncating variants in the last exon of transforming growth factor-β regulator PMEPA1 result in Loeys-Dietz syndrome and (3) loss-of-function variants in GPR156 give rise to recessive congenital hearing impairment. The Rareservoir provides a lightweight, flexible and portable system for synthesizing the genetic and phenotypic data required to study rare disease cohorts with tens of thousands of participants.
Disciplines :
Genetics & genetic processes
Author, co-author :
Greene, Daniel; Department of Medicine, University of Cambridge, Cambridge, UK ; Mindich Child Health and Development Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
Genomics England Research Consortium
Pirri, Daniela; National Heart and Lung Institute, Imperial College London, London, UK
Frudd, Karen; National Heart and Lung Institute, Imperial College London, London, UK ; University College London Institute of Ophthalmology, University College London, London, UK
Sackey, Ege; Molecular and Clinical Sciences Institute, St. George's University of London, London, UK
Al-Owain, Mohammed; Department of Medical Genomics, Centre for Genomic Medicine, King Faisal Specialist Hospital & Research Centre, Riyadh, Saudi Arabia
Giese, Arnaud P J ; Department of Otorhinolaryngology Head and Neck Surgery, School of Medicine, University of Maryland, Baltimore, MD, USA
Ramzan, Khushnooda; Department of Clinical Genomics, Centre for Genomic Medicine, King Faisal Specialist Hospital & Research Centre, Riyadh, Saudi Arabia
Riaz, Sehar; Department of Otorhinolaryngology Head and Neck Surgery, School of Medicine, University of Maryland, Baltimore, MD, USA ; Department of Biochemistry and Molecular Biology, School of Medicine, University of Maryland, Baltimore, MD, USA
Yamanaka, Itaru ; Department of Bioscience and Genetics, National Cerebral and Cardiovascular Center, Osaka, Japan
Boeckx, Nele; Center for Medical Genetics, Antwerp University Hospital/University of Antwerp, Antwerp, Belgium
Thys, Chantal; Department of Cardiovascular Sciences, Center for Molecular and Vascular Biology, KU Leuven, Leuven, Belgium
Gelb, Bruce D ; Mindich Child Health and Development Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA ; Department of Pediatrics, Icahn School of Medicine at Mount Sinai, New York, NY, USA ; Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA
Brennan, Paul; Northern Genetics Service, Newcastle upon Tyne Hospitals National Health Service Trust International Centre for Life, Newcastle upon Tyne, UK
Hartill, Verity; Department of Clinical Genetics, Chapel Allerton Hospital, Leeds Teaching Hospitals National Health Service Trust, Leeds, UK ; Leeds Institute of Medical Research, University of Leeds, Leeds, UK
Harvengt, Julie ; Centre Hospitalier Universitaire de Liège - CHU > > Service de génétique
Kosho, Tomoki ; Department of Medical Genetics, Shinshu University School of Medicine, Nagano, Japan ; Center for Medical Genetics, Shinshu University Hospital, Nagano, Japan
Mansour, Sahar; Molecular and Clinical Sciences Institute, St. George's University of London, London, UK ; South West Thames Regional Genetics Service, St. George's University Hospitals National Health Service Foundation Trust, London, UK
Masuno, Mitsuo; Department of Medical Genetics, Kawasaki Medical School Hospital, Okayama, Japan
Ohata, Takako; Okinawa Chubu Hospital, Okinawa, Japan
Stewart, Helen; Oxford University Hospitals National Health Service Foundation Trust, Oxford, UK
Taibah, Khalid; Ear Nose and Throat Medical Centre, Riyadh, Saudi Arabia
Turner, Claire L S; Peninsula Clinical Genetics Service, Royal Devon & Exeter Hospital, Exeter, UK
Imtiaz, Faiqa; Department of Clinical Genomics, Centre for Genomic Medicine, King Faisal Specialist Hospital & Research Centre, Riyadh, Saudi Arabia
Riazuddin, Saima ; Department of Otorhinolaryngology Head and Neck Surgery, School of Medicine, University of Maryland, Baltimore, MD, USA ; Department of Biochemistry and Molecular Biology, School of Medicine, University of Maryland, Baltimore, MD, USA
Morisaki, Takayuki; Department of Bioscience and Genetics, National Cerebral and Cardiovascular Center, Osaka, Japan ; Division of Molecular Pathology and Department of Internal Medicine, Institute of Medical Science, The University of Tokyo, Tokyo, Japan
Ostergaard, Pia ; Molecular and Clinical Sciences Institute, St. George's University of London, London, UK
Loeys, Bart L ; Center for Medical Genetics, Antwerp University Hospital/University of Antwerp, Antwerp, Belgium ; Department of Human Genetics, Radboud University Medical Center, Nijmegen, the Netherlands
Morisaki, Hiroko; Department of Bioscience and Genetics, National Cerebral and Cardiovascular Center, Osaka, Japan ; Department of Medical Genetics, Sakakibara Heart Institute, Tokyo, Japan
Ahmed, Zubair M ; Department of Otorhinolaryngology Head and Neck Surgery, School of Medicine, University of Maryland, Baltimore, MD, USA ; Department of Biochemistry and Molecular Biology, School of Medicine, University of Maryland, Baltimore, MD, USA
Birdsey, Graeme M ; National Heart and Lung Institute, Imperial College London, London, UK
Freson, Kathleen; Department of Cardiovascular Sciences, Center for Molecular and Vascular Biology, KU Leuven, Leuven, Belgium
Mumford, Andrew; School of Cellular and Molecular Medicine, University of Bristol, Bristol, UK ; South West National Health Service Genomic Medicine Service Alliance, Bristol, UK
Turro, Ernest ; Mindich Child Health and Development Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA. ernest.turro@mssm.edu ; Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA. ernest.turro@mssm.edu ; Department of Haematology, University of Cambridge, Cambridge Biomedical Campus, Cambridge, UK. ernest.turro@mssm.edu ; Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, NY, USA. ernest.turro@mssm.edu
This research was made possible through access to the data and findings generated by the 100,000 Genomes Project. The 100,000 Genomes Project is managed by Genomics England Limited (a wholly owned company of the Department of Health and Social Care). The 100,000 Genomes Project is funded by the National Institute for Health Research and National Health Service (NHS) England. The Wellcome Trust, Cancer Research UK and the Medical Research Council have also funded research infrastructure. The 100,000 Genomes Project uses data provided by patients and collected by the National Health Service as part of their care and support. GS was performed by Illumina at Illumina Laboratory Services and was overseen by Genomics England. We thank all NHS clinicians who have contributed clinical phenotype data to the 100,000 Genomes Rare Diseases Programme and all staff at Genomics England who have contributed to the sequencing, maintenance of the research environment and assembly of the standard bioinformatic files that were required for our analyses. We thank the participants of the rare diseases program who made this research possible. We are grateful to V. Keeley for providing access to paternal DNA (ERG), F. Elmslie for inviting a patient to the clinic (ERG) and T. Jaworek for technical assistance (GPR156). D.G. was supported by the Cambridge British Heart Foundation (BHF) Centre of Research Excellence (RE/18/1/34212) and the Wellcome Collaborative (219506/Z/19/Z). V.H. was supported by an Medical Research Council (MRC)/National Institute for Health and Care Research Clinical Academic Research Partnership (MR/V037617/1). G.M.B. and K. Frudd were funded by BHF (PG/17/33/32990). G.M.B. and D.P. were funded by BHF (PG/20/16/35047). E.S. was supported by the Swiss Federal National Fund for Scientific Research (CRSII5_177191/1). S.M. and P.O. were supported by the MRC (MR/P011543/1) and BHF (RG/17/7/33217). K. Freson was supported by Katholieke Universiteit (KU) Leuven Special Research Fund (BOF) (C14/19/096) and Research Foundation – Flanders (G072921N). Work at the University of Maryland, Baltimore was supported by the National Institute on Deafness and Other Communication Disorders/National Institutes of Health (R01DC016295 to Z.M.A.). M.A.-O., F.I. and K.R. were supported by the King Salman Center for Disability Research (85722). E.T. was supported by the Mindich Child Health and Development Institute, the Charles Bronfman Institute for Personalized Medicine and the Lowy Foundation USA.This research was made possible through access to the data and findings generated by the 100,000 Genomes Project. The 100,000 Genomes Project is managed by Genomics England Limited (a wholly owned company of the Department of Health and Social Care). The 100,000 Genomes Project is funded by the National Institute for Health Research and National Health Service (NHS) England. The Wellcome Trust, Cancer Research UK and the Medical Research Council have also funded research infrastructure. The 100,000 Genomes Project uses data provided by patients and collected by the National Health Service as part of their care and support. GS was performed by Illumina at Illumina Laboratory Services and was overseen by Genomics England. We thank all NHS clinicians who have contributed clinical phenotype data to the 100,000 Genomes Rare Diseases Programme and all staff at Genomics England who have contributed to the sequencing, maintenance of the research environment and assembly of the standard bioinformatic files that were required for our analyses. We thank the participants of the rare diseases program who made this research possible. We are grateful to V. Keeley for providing access to paternal DNA (ERG), F. Elmslie for inviting a patient to the clinic (ERG) and T. Jaworek for technical assistance (GPR156). D.G. was supported by the Cambridge British Heart Foundation (BHF) Centre of Research Excellence (RE/18/1/34212) and the Wellcome Collaborative (219506/Z/19/Z). V.H. was supported by an Medical Research Council (MRC)/National Institute for Health and Care Research Clinical Academic Research Partnership (MR/V037617/1). G.M.B. and K. Frudd were funded by BHF (PG/17/33/32990). G.M.B. and D.P. were funded by BHF (PG/20/16/35047). E.S. was supported by the Swiss Federal National Fund for Scientific Research (CRSII5_177191/1). S.M. and P.O. were supported by the MRC (MR/P011543/1) and BHF (RG/17/7/33217). K. Freson was supported by Katholieke Universiteit (KU) Leuven Special Research Fund (BOF) (C14/19/096) and Research Foundation – Flanders (G072921N). Work at the University of Maryland, Baltimore was supported by the National Institute on Deafness and Other Communication Disorders/National Institutes of Health (R01DC016295 to Z.M.A.). M.A.-O., F.I. and K.R. were supported by the King Salman Center for Disability Research (85722). E.T. was supported by the Mindich Child Health and Development Institute, the Charles Bronfman Institute for Personalized Medicine and the Lowy Foundation USA.
scite shows how a scientific paper has been cited by providing the context of the citation, a classification describing whether it supports, mentions, or contrasts the cited claim, and a label indicating in which section the citation was made.
Bibliography
Boycott, K. M. et al. International cooperation to enable the diagnosis of all rare genetic diseases. Am. J. Hum. Genet 100, 695–705 (2017). DOI: 10.1016/j.ajhg.2017.04.003
Ferreira, C. R. The burden of rare diseases. Am. J. Med Genet A 179, 885–892 (2019). DOI: 10.1002/ajmg.a.61124
Turro, E. et al. Whole-genome sequencing of patients with rare diseases in a national health system. Nature 583, 96–102 (2020). DOI: 10.1038/s41586-020-2434-2
Wang, Q. et al. Rare variant contribution to human disease in 281,104 UK Biobank exomes. Nature 597, 527–532 (2021). DOI: 10.1038/s41586-021-03855-y
Kaplanis, J. et al. Evidence for 28 genetic disorders discovered by combining healthcare and research data. Nature 586, 757–762 (2020). DOI: 10.1038/s41586-020-2832-5
Greene, D., Richardson, S. & Turro, E. Phenotype similarity regression for identifying the genetic determinants of rare diseases. Am. J. Hum. Genet 98, 490–499 (2016). DOI: 10.1016/j.ajhg.2016.01.008
Hail Team. Hail 0.2. https://github.com/hail-is/hail (2022).
Lopez, J. et al. HGVA: the Human Genome Variation Archive. Nucleic Acids Res. 45, W189–W194 (2017). DOI: 10.1093/nar/gkx445
Greene, D., Richardson, S. & Turro, E. A fast association test for identifying pathogenic variants involved in rare diseases. Am. J. Hum. Genet 101, 104–114 (2017). DOI: 10.1016/j.ajhg.2017.05.015
Karczewski, K. J. et al. The mutational constraint spectrum quantified from variation in 141,456 humans. Nature 581, 434–443 (2020). DOI: 10.1038/s41586-020-2308-7
Rentzsch, P., Schubach, M., Shendure, J. & Kircher, M. CADD-Splice-improving genome-wide variant effect prediction using deep learning-derived splice scores. Genome Med. 13, 31 (2021). DOI: 10.1186/s13073-021-00835-9
Eilbeck, K. et al. The Sequence Ontology: a tool for the unification of genome annotations. Genome Biol. 6, R44 (2005). DOI: 10.1186/gb-2005-6-5-r44
Howe, K. L. et al. Ensembl 2021. Nucleic Acids Res. 49, D884–D891 (2021). DOI: 10.1093/nar/gkaa942
Martin, A. R. et al. PanelApp crowdsources expert knowledge to establish consensus diagnostic gene panels. Nat. Genet. 51, 1560–1565 (2019). DOI: 10.1038/s41588-019-0528-2
Korber, L., Schneider, H., Fleischer, N. & Maier-Wohlfart, S. No evidence for preferential X-chromosome inactivation as the main cause of divergent phenotypes in sisters with X-linked hypohidrotic ectodermal dysplasia. Orphanet J. Rare Dis. 16, 98 (2021). DOI: 10.1186/s13023-021-01735-2
Kasahara, Y. et al. Hyper-IgM syndrome with putative dominant negative mutation in activation-induced cytidine deaminase. J. Allergy Clin. Immunol. 112, 755–760 (2003). DOI: 10.1016/S0091-6749(03)01860-8
Martin-Almedina, S., Mortimer, P. S. & Ostergaard, P. Development and physiological functions of the lymphatic system: insights from human genetic studies of primary lymphedema. Physiol. Rev. 101, 1809–1871 (2021). DOI: 10.1152/physrev.00006.2020
Gordon, K. et al. Update and audit of the St George’s classification algorithm of primary lymphatic anomalies: a clinical and molecular approach to diagnosis. J. Med Genet. 57, 653–659 (2020). DOI: 10.1136/jmedgenet-2019-106084
Kalna, V. et al. The transcription factor ERG regulates super-enhancers associated with an endothelial-specific gene expression program. Circ. Res. 124, 1337–1349 (2019). DOI: 10.1161/CIRCRESAHA.118.313788
Shah, A. V., Birdsey, G. M. & Randi, A. M. Regulation of endothelial homeostasis, vascular development and angiogenesis by the transcription factor ERG. Vasc. Pharm. 86, 3–13 (2016). DOI: 10.1016/j.vph.2016.05.003
Hoesel, B. et al. Sequence-function correlations and dynamics of ERG isoforms. ERG8 is the black sheep of the family. Biochim. Biophys. Acta 1863, 205–218 (2016). DOI: 10.1016/j.bbamcr.2015.10.023
Westbury, S. K. et al. Human phenotype ontology annotation and cluster analysis to unravel genetic defects in 707 cases with unexplained bleeding and platelet disorders. Genome Med. 7, 36 (2015). DOI: 10.1186/s13073-015-0151-5
Greene, D., Richardson, S. & Turro, E. ontologyX: a suite of R packages for working with ontological data. Bioinformatics 33, 1104–1106 (2017). DOI: 10.1093/bioinformatics/btw763
Resnik, P. et al. Semantic similarity in a taxonomy: an information-based measure and its application to problems of ambiguity in natural language. J. Artif. Intell. Res. 11, 95–130 (1999). DOI: 10.1613/jair.514
Ciurica, S. et al. Arterial tortuosity. Hypertension 73, 951–960 (2019). DOI: 10.1161/HYPERTENSIONAHA.118.11647
Loeys, B. L. et al. A syndrome of altered cardiovascular, craniofacial, neurocognitive and skeletal development caused by mutations in TGFBR1 or TGFBR2. Nat. Genet. 37, 275–281 (2005). DOI: 10.1038/ng1511
Szklarczyk, D. et al. STRING v11: protein-protein association networks with increased coverage, supporting functional discovery in genome-wide experimental datasets. Nucleic Acids Res. 47, D607–D613 (2019). DOI: 10.1093/nar/gky1131
Watanabe, Y. et al. TMEPAI, a transmembrane TGF-beta-inducible protein, sequesters Smad proteins from active participation in TGF-beta signaling. Mol. Cell. 37, 123–134 (2010). DOI: 10.1016/j.molcel.2009.10.028
Creamer, T. J., Bramel, E. E. & MacFarlane, E. G. Insights on the pathogenesis of aneurysm through the study of hereditary aortopathies. Genes (Basel) 12, 183 (2021). DOI: 10.3390/genes12020183
Thermann, R. et al. Binary specification of nonsense codons by splicing and cytoplasmic translation. EMBO J. 17, 3484–3494 (1998). DOI: 10.1093/emboj/17.12.3484
Sobreira, N., Schiettecatte, F., Valle, D. & Hamosh, A. GeneMatcher: a matching tool for connecting investigators with an interest in the same gene. Hum. Mutat. 36, 928–930 (2015). DOI: 10.1002/humu.22844
Ellaithy, A., Gonzalez-Maeso, J., Logothetis, D. A. & Levitz, J. Structural and biophysical mechanisms of class C G protein-coupled receptor function. Trends Biochem. Sci. 45, 1049–1064 (2020). DOI: 10.1016/j.tibs.2020.07.008
Kindt, K. S. et al. EMX2-GPR156-Gai reverses hair cell orientation in mechanosensory epithelia. Nat. Commun. 12, 2861 (2021). DOI: 10.1038/s41467-021-22997-1
Scheffer, D. I., Shen, J., Corey, D. P. & Chen, Z. Y. Gene expression by mouse inner ear hair cells during development. J. Neurosci. 35, 6366–6380 (2015). DOI: 10.1523/JNEUROSCI.5126-14.2015
Miyoshi, T. et al. Human deafness-associated variants alter the dynamics of key molecules in hair cell stereocilia F-actin cores. Hum. Genet 141, 363–382 (2022). DOI: 10.1007/s00439-021-02304-0
Smedley, D. et al. 100,000 Genomes pilot on rare-disease diagnosis in health care - preliminary report. N. Engl. J. Med. 385, 1868–1880 (2021). DOI: 10.1056/NEJMoa2035790
Thalhammer, A., Trinidad, J. C., Burlingame, A. L. & Schoepfer, R. Densin-180: revised membrane topology, domain structure and phosphorylation status. J. Neurochem. 109, 297–302 (2009). DOI: 10.1111/j.1471-4159.2009.05951.x
Chong, C. H. et al. Lrrc7 mutant mice model developmental emotional dysregulation that can be alleviated by mGluR5 allosteric modulation. Transl. Psychiatry 9, 244 (2019). DOI: 10.1038/s41398-019-0580-9
Berthouze, M., Venkataramanan, V., Li, Y. & Shenoy, S. K. The deubiquitinases USP33 and USP20 coordinate beta2 adrenergic receptor recycling and resensitization. EMBO J. 28, 1684–1696 (2009). DOI: 10.1038/emboj.2009.128
Birdsey, G. M. et al. The endothelial transcription factor ERG promotes vascular stability and growth through Wnt/Beta-catenin signaling. Dev. Cell 32, 82–96 (2015). DOI: 10.1016/j.devcel.2014.11.016
Motiejunaite, J., Amar, L. & Vidal-Petiot, E. Adrenergic receptors and cardiovascular effects of catecholamines. Ann. Endocrinol. (Paris) 82, 193–197 (2021). DOI: 10.1016/j.ando.2020.03.012
Munoz-Lasso, D. C., Roma-Mateo, C., Pallardo, F. V. & Gonzalez-Cabo, P. Much more than a scaffold: cytoskeletal proteins in neurological disorders. Cells 9, 358 (2020). DOI: 10.3390/cells9020358
Zuchero, J. B. et al. CNS myelin wrapping is driven by actin disassembly. Dev. Cell 34, 152–167 (2015). DOI: 10.1016/j.devcel.2015.06.011
DeWard, A. D., Eisenmann, K. M., Matheson, S. F. & Alberts, A. S. The role of formins in human disease. Biochim. Biophys. Acta 1803, 226–233 (2010). DOI: 10.1016/j.bbamcr.2009.11.006
Ninoyu, Y. et al. The integrity of cochlear hair cells is established and maintained through the localization of Dia1 at apical junctional complexes and stereocilia. Cell Death Dis. 11, 536 (2020). DOI: 10.1038/s41419-020-02743-z
Geppert, M. et al. The role of Rab3A in neurotransmitter release. Nature 369, 493–497 (1994). DOI: 10.1038/369493a0
Li, H. et al. The Sequence Alignment/Map format and SAMtools. Bioinformatics 25, 2078–2079 (2009). DOI: 10.1093/bioinformatics/btp352
Morales, J. et al. A joint NCBI and EMBL-EBI transcript set for clinical genomics and research. Nature 604, 310–315 (2022). DOI: 10.1038/s41586-022-04558-8
Genomics England Research Consortium. Variant QC for 100,000 Genomes Project merged VCF files. https://re-docs.genomicsengland.co.uk/site_qc/ (2022).
Similar publications
Sorry the service is unavailable at the moment. Please try again later.
This website uses cookies to improve user experience. Read more
Save & Close
Accept all
Decline all
Show detailsHide details
Cookie declaration
About cookies
Strictly necessary
Performance
Strictly necessary cookies allow core website functionality such as user login and account management. The website cannot be used properly without strictly necessary cookies.
This cookie is used by Cookie-Script.com service to remember visitor cookie consent preferences. It is necessary for Cookie-Script.com cookie banner to work properly.
Performance cookies are used to see how visitors use the website, eg. analytics cookies. Those cookies cannot be used to directly identify a certain visitor.
Used to store the attribution information, the referrer initially used to visit the website
Cookies are small text files that are placed on your computer by websites that you visit. Websites use cookies to help users navigate efficiently and perform certain functions. Cookies that are required for the website to operate properly are allowed to be set without your permission. All other cookies need to be approved before they can be set in the browser.
You can change your consent to cookie usage at any time on our Privacy Policy page.