Algorithms; HTLV-I Infections/genetics/virology; Human T-lymphotropic virus 1; Humans; Virus Integration
Abstract :
[en] MOTIVATION: The relative abundance of retroviral insertions in a host genome is important in understanding the persistence and pathogenesis of both natural retroviral infections and retroviral gene therapy vectors. It could be estimated from a sample of cells if only the host genomic sites of retroviral insertions could be directly counted. When host genomic DNA is randomly broken via sonication and then amplified, amplicons of varying lengths are produced. The number of unique lengths of amplicons of an insertion site tends to increase according to its abundance, providing a basis for estimating relative abundance. However, as abundance increases amplicons of the same length arise by chance leading to a non-linear relation between the number of unique lengths and relative abundance. The difficulty in calibrating this relation is compounded by sample-specific variations in the relative frequencies of clones of each length. RESULTS: A likelihood function is proposed for the discrete lengths observed in each of a collection of insertion sites and is maximized with a hybrid expectation-maximization algorithm. Patient data illustrate the method and simulations show that relative abundance can be estimated with little bias, but that variation in highly abundant sites can be large. In replicated patient samples, variation exceeds what the model implies-requiring adjustment as in Efron (2004) or using jackknife standard errors. Consequently, it is advantageous to collect replicate samples to strengthen inferences about relative abundance.
scite shows how a scientific paper has been cited by providing the context of the citation, a classification describing whether it supports, mentions, or contrasts the cited claim, and a label indicating in which section the citation was made.
Bibliography
Aird, D. et al. (2011) Analyzing and minimizing PCR amplification bias in illumina sequencing libraries. Genome Biol., 12, R18.
Baker, S. (1994) The multinomial-poisson transformation. Statistician, 43, 495-504.
Brady, T. et al. (2011) A method to sequence and quantify DNA integration for monitoring outcome in gene therapy. Nucleic Acids Res, 39, e72.
Cavazzana-Calvo, M. et al. (2010) Transfusion independence and hmga2 activation after gene therapy of human [bgr]-thalassaemia. Nature, 467, 318-322.
Chao, A. (1987) Estimating the population size for capture-recapture data with unequal catchability. Biometrics, 43, 783-791.
Chao, A. and Lee, S. (1992) Estimating the number of classes via sample coverage. J. Am. Stat. Assoc., 87, 210-217.
Chao, A. and Shen, T. (2003) Nonparametric estimation of Shannons index of diversity when there are unseen species in sample. Environ. Ecol. Stat., 10, 429-443.
De Boor, C. (2001) A Practical Guide to Splines, vol. 27. Springer, New York.
Deichmann, A. et al. (2007) Vector integration is nonrandom and clustered and influences the fate of lymphopoiesis in scid-x1 gene therapy. J. Clin. Investig., 117, 2232.
Dempster, A. et al. (1977) Maximum likelihood from incomplete data via the em algorithm. J. R. Stat. Soc. Ser. B, 39, 1-38.
Efron, B. (2004) Large-scale simultaneous hypothesis testing. J. Am. Stat. Assoc., 99, 96-104.
Feller, W. (1945) On the normal approximation to the binomial distribution. Ann. Math. Stat., 16, 319-329.
Finzi, D. et al. (1997) Identification of a reservoir for HIV-1 in patients on highly active antiretroviral therapy. Science, 278, 1300.
Gabriel, R. et al. (2009). Comprehensive genomic access to vector integration in clinical gene therapy. Nat. Med., 15, 1436.
Gillet, N. A. et al. (2011) The host genomic environment of the provirus determines the abundance of HTLV-1-infected T-cell clones. Blood, 117, 3113-3122.
Hacein-Bey-Abina, S. et al. (2003)Aserious adverse event after successful gene therapy for x-linked severe combined immunodeficiency. N. Engl. J. Med., 348, 256.
Hacein-Bey-Abina, S. et al. (2008) Insertional oncogenesis in 4 patients after retrovirusmediated gene therapy of scid-x1. J. Clin. Investig., 118, 3142.
Hacein-Bey-Abina, S. et al. (2010) Efficacy of gene therapy for x-linked severe combined immunodeficiency. N. Engl. J. Med., 363, 364.
Han, Y. et al. (2007) Experimental approaches to the study of HIV-1 latency. Nat. Rev. Microbiol., 5, 106.
Meekings, K. N. et al. (2008) HTLV-1 integration into transcriptionally active genomic regions is associated with proviral expression and with ham/tsp. PLoS Pathogens, 4, e1000027.
Miller, R. (1974) The jackknife-a review. Biometrika, 61, 1.
Mitchell, R. S. et al. (2004) Retroviral DNA integration: ASLV, HIV, and MLV show distinct target site preferences. PLoS Biol., 2, e234.
Schmidt, M. et al. (2003) Clonality analysis after retroviral-mediated gene transfer to CD34+ cells from the cord blood of ADA-deficient SCID neonates. Nat. Med., 9, 468.
Schroder, A. R. W. et al. (2002) HIV-1 integration in the human genome favors active genes and local hotspots. Cell, 110, 529.
Wang, G. P. et al. (2007) HIV integration site selection: analysis by massively parallel pyrosequencing reveals association with epigenetic modifications. Genome Res., 17, 1194.
Wang, G. P. et al. (2008) DNA bar coding and pyrosequencing to analyze adverse events in therapeutic gene transfer. Nucleic Acids Res., 36, e49.
Wang, G. P. et al. (2010) Dynamics of gene-modified progenitor cells analyzed by tracking retroviral integration sites in a human scid-x1 gene therapy trial. Blood, 115, 4356-4366.
Wu, X. et al. (2003) Transcription start regions in the human genome are favored targets for MLV integration. Science, 300, 1751.
This website uses cookies to improve user experience. Read more
Save & Close
Accept all
Decline all
Show detailsHide details
Cookie declaration
About cookies
Strictly necessary
Performance
Strictly necessary cookies allow core website functionality such as user login and account management. The website cannot be used properly without strictly necessary cookies.
This cookie is used by Cookie-Script.com service to remember visitor cookie consent preferences. It is necessary for Cookie-Script.com cookie banner to work properly.
Performance cookies are used to see how visitors use the website, eg. analytics cookies. Those cookies cannot be used to directly identify a certain visitor.
Used to store the attribution information, the referrer initially used to visit the website
Cookies are small text files that are placed on your computer by websites that you visit. Websites use cookies to help users navigate efficiently and perform certain functions. Cookies that are required for the website to operate properly are allowed to be set without your permission. All other cookies need to be approved before they can be set in the browser.
You can change your consent to cookie usage at any time on our Privacy Policy page.