CRISPR-based antimicrobials; Drug design; In silico docking; RNA secondary structure; RNA tertiary structure; RNA–protein interaction; Structural biology; Anti-Bacterial Agents; RNA, Bacterial; Ribonucleases; Bacteria/genetics; Humans; Ribonucleases/genetics; Ribonucleases/metabolism; CRISPR-Cas Systems; Bacteria; Immunology; Ecology, Evolution, Behavior and Systematics; Modeling and Simulation; Biochemistry, Genetics and Molecular Biology (all); Agricultural and Biological Sciences (all); Applied Mathematics; General Agricultural and Biological Sciences; General Biochemistry, Genetics and Molecular Biology
Abstract :
[en] RNA-protein interactions are crucial for diverse biological processes. In prokaryotes, RNA-protein interactions enable adaptive immunity through CRISPR-Cas systems. These defence systems utilize CRISPR RNA (crRNA) templates acquired from past infections to destroy foreign genetic elements through crRNA-mediated nuclease activities of Cas proteins. Thanks to the programmability and specificity of CRISPR-Cas systems, CRISPR-based antimicrobials have the potential to be repurposed as new types of antibiotics. Unlike traditional antibiotics, these CRISPR-based antimicrobials can be designed to target specific bacteria and minimize detrimental effects on the human microbiome during antibacterial therapy. In this study, we explore the potential of CRISPR-based antimicrobials by optimizing the RNA-protein interactions of crRNAs and Cas13 proteins. CRISPR-Cas13 systems are unique as they degrade specific foreign RNAs using the crRNA template, which leads to non-specific RNase activities and cell cycle arrest. We show that a high proportion of the Cas13 systems have no colocalized CRISPR arrays, and the lack of direct association between crRNAs and Cas proteins may result in suboptimal RNA-protein interactions in the current tools. Here, we investigate the RNA-protein interactions of the Cas13-based systems by curating the validation dataset of Cas13 protein and CRISPR repeat pairs that are experimentally validated to interact, and the candidate dataset of CRISPR repeats that reside on the same genome as the currently known Cas13 proteins. To find optimal CRISPR-Cas13 interactions, we first validate the 3-D structure prediction of crRNAs based on their experimental structures. Next, we test a number of RNA-protein interaction programs to optimize the in silico docking of crRNAs with the Cas13 proteins. From this optimized pipeline, we find a number of candidate crRNAs that have comparable or better in silico docking with the Cas13 proteins of the current tools. This study fully automatizes the in silico optimization of RNA-protein interactions as an efficient preliminary step for designing effective CRISPR-Cas13-based antimicrobials.
Disciplines :
Mathematics Biotechnology Engineering, computing & technology: Multidisciplinary, general & others
Author, co-author :
Park, Ho-Min ; Center for Biosystems and Biotech Data Science, Ghent University Global Campus, Incheon, South Korea ; Department of Electronics and Information Systems, Ghent University, Ghent, Belgium
Park, Yunseol ; Center for Biosystems and Biotech Data Science, Ghent University Global Campus, Incheon, South Korea
Berani, Urta; Center for Biosystems and Biotech Data Science, Ghent University Global Campus, Incheon, South Korea
Bang, Eunkyu; Center for Biosystems and Biotech Data Science, Ghent University Global Campus, Incheon, South Korea
Vankerschaver, Joris ; Center for Biosystems and Biotech Data Science, Ghent University Global Campus, Incheon, South Korea ; Department of Applied Mathematics, Computer Science and Statistics, Ghent University, Ghent, Belgium
Van Messem, Arnout ; Université de Liège - ULiège > Département de mathématique > Statistique appliquée aux sciences
De Neve, Wesley ; Center for Biosystems and Biotech Data Science, Ghent University Global Campus, Incheon, South Korea ; Department of Electronics and Information Systems, Ghent University, Ghent, Belgium
Shim, Hyunjin ; Center for Biosystems and Biotech Data Science, Ghent University Global Campus, Incheon, South Korea. hyunjin.shim@ghent.ac.kr
Language :
English
Title :
In silico optimization of RNA-protein interactions for CRISPR-Cas13-based antimicrobials.
Crick F. Central dogma of molecular biology. Nature. 1970;227:561–3.
Sanchez de Groot N, Armaos A, Graña-Montes R, Alriquet M, Calloni G, Vabulas RM, et al. RNA structure drives interaction with proteins. Nat Commun. 2019;10:3246.
Bowman JC, Petrov AS, Frenkel-Pinter M, Penev PI, Williams LD. Root of the Tree: The significance, evolution, and origins of the ribosome. Chem Rev. 2020;120:4848–78.
Makarova KS, Grishin NV, Shabalina SA, Wolf YI, Koonin EV. A putative RNA-interference-based immune system in prokaryotes: computational analysis of the predicted enzymatic machinery, functional analogies with eukaryotic RNAi, and hypothetical mechanisms of action. Biol Direct. 2006;1:7.
Mojica FJM, Díez-Villaseñor C, García-Martínez J, Soria E. Intervening sequences of regularly spaced prokaryotic repeats derive from foreign genetic elements. J Mol Evol. 2005;60:174–82.
Cong L, Ran FA, Cox D, Lin S, Barretto R, Habib N, et al. Multiplex genome engineering using CRISPR/Cas systems. Science. 2013;339:819–23.
Jinek M, Chylinski K, Fonfara I, Hauer M, Doudna JA, Charpentier E. A programmable dual-RNA-guided DNA endonuclease in adaptive bacterial immunity. Science. 2012;337:816–21.
Mali P, Yang L, Esvelt KM, Aach J, Guell M, DiCarlo JE, et al. RNA-guided human genome engineering via Cas9. Science. 2013;339:823–6.
Koonin EV, Makarova KS, Wolf YI. Evolutionary genomics of defense systems in archaea and bacteria. Annu Rev Microbiol. 2017;71:233–61.
Makarova KS, Wolf YI, Alkhnbashi OS, Costa F, Shah SA, Saunders SJ, et al. An updated evolutionary classification of CRISPR–Cas systems. Nat Rev Microbiol. 2015;13:722–36.
Makarova KS, Wolf YI, Iranzo J, Shmakov SA, Alkhnbashi OS, Brouns SJJ, et al. Evolutionary classification of CRISPR–Cas systems: a burst of class 2 and derived variants. Nat Rev Microbiol. 2019;18:67–83.
Pursey E, Sünderhauf D, Gaze WH, Westra ER, van Houte S. CRISPR-Cas antimicrobials: challenges and future prospects. PLoS Pathog. 2018;14:e1006990.
Duan C, Cao H, Zhang L-H, Xu Z. Harnessing the CRISPR-Cas systems to combat antimicrobial resistance. Front Microbiol. 2021;12:716064.
Bikard D, Barrangou R. Using CRISPR-Cas systems as antimicrobials. Curr Opin Microbiol. 2017;37:155–60.
Aslam B, Rasool M, Idris A, Muzammil S, Alvi RF, Khurshid M, et al. CRISPR-Cas system: a potential alternative tool to cope antibiotic resistance. Antimicrob Resist Infect Control. 2020;9:131.
Murray CJL, Ikuta KS, Sharara F, Swetschinski L, Aguilar GR, Gray A, et al. Global burden of bacterial antimicrobial resistance in 2019: a systematic analysis. Lancet. 2019;2022:629–55. 10.1016/s0140-6736(21)02724-0. DOI: 10.1016/s0140-6736(21)02724-0
Gordillo Altamirano FL, Barr JJ. Phage therapy in the Postantibiotic Era. Clin Microbiol Rev. 2019. 10.1128/CMR.00066-18. DOI: 10.1128/CMR.00066-18
Park H-M, Park Y, Vankerschaver J, Van Messem A, De Neve W, Shim H. Rethinking protein drug design with highly accurate structure prediction of anti-CRISPR proteins. Pharmaceuticals. 2022;15:310. 10.3390/ph15030310. DOI: 10.3390/ph15030310
Abdelkader K, Gerstmans H, Saafan A, Dishisha T, Briers Y. The preclinical and clinical progress of bacteriophages and their lytic enzymes: the parts are easier than the whole. Viruses. 2019. 10.3390/v11020096. DOI: 10.3390/v11020096
Shim H, Shivram H, Lei S, Doudna JA, Banfield JF. Diverse ATPase proteins in mobilomes constitute a large potential sink for prokaryotic host ATP. Front Microbiol. 2021;12: 691847.
Bikard D, Euler CW, Jiang W, Nussenzweig PM, Goldberg GW, Duportet X, et al. Exploiting CRISPR-Cas nucleases to produce sequence-specific antimicrobials. Nat Biotechnol. 2014;32:1146–50.
Shim H. Investigating the genomic background of CRISPR-Cas genomes for CRISPR-based antimicrobials. Evol Bioinform Online. 2022;18:11769343221103888.
Bernheim A, Bikard D, Touchon M, Rocha EPC. A matter of background: DNA repair pathways as a possible cause for the sparse distribution of CRISPR-Cas systems in bacteria. Philos Trans R Soc Lond B Biol Sci. 2019;374:20180088.
Shmakov S, Abudayyeh OO, Makarova KS, Wolf YI, Gootenberg JS, Semenova E, et al. Discovery and functional characterization of diverse class 2 CRISPR-Cas systems. Mol Cell. 2015;60:385–97.
Abudayyeh OO, Gootenberg JS, Konermann S, Joung J, Slaymaker IM, Cox DBT, et al. C2c2 is a single-component programmable RNA-guided RNA-targeting CRISPR effector. Science. 2016;353:aaf5573.
Liu L, Li X, Wang J, Wang M, Chen P, Yin M, et al. Two distant catalytic sites are responsible for C2c2 RNase activities. Cell. 2017;168:121-134.e12.
Kiga K, Tan X-E, Ibarra-Chávez R, Watanabe S, Aiba Y, Sato’o Y, et al. Development of CRISPR-Cas13a-based antimicrobials capable of sequence-specific killing of target bacteria. Nat Commun. 2020;11:2934.
Liu L, Li X, Ma J, Li Z, You L, Wang J, et al. The molecular architecture for RNA-guided RNA cleavage by Cas13a. Cell. 2017;170:714-726.e10.
Riesenberg S, Helmbrecht N, Kanis P, Maricic T, Pääbo S. Improved gRNA secondary structures allow editing of target sites resistant to CRISPR-Cas9 cleavage. Nat Commun. 2022;13:1–8.
Pourcel C, Touchon M, Villeriot N, Vernadet J-P, Couvin D, Toffano-Nioche C, et al. CRISPRCasdb a successor of CRISPRdb containing CRISPR arrays and cas genes from complete genome sequences, and tools to download and query lists of repeats and spacers. Nucleic Acids Res. 2020;48:D535–44.
Rybarczyk A, Szostak N, Antczak M, Zok T, Popenda M, Adamiak R, et al. New in silico approach to assessing RNA secondary structures with non-canonical base pairs. BMC Bioinform. 2015;16:276.
Antczak M, Popenda M, Zok T, Sarzynska J, Ratajczak T, Tomczyk K, et al. New functionality of RNAComposer: application to shape the axis of miR160 precursor structure. Acta Biochim Pol. 2016;63:737–44.
Watkins AM, Rangan R, Das R. FARFAR2: improved De Novo Rosetta prediction of complex global RNA folds. Structure. 2020;28:963-976.e6.
Cech P, Svozil D, Hoksza D. SETTER: web server for RNA structure comparison. Nucleic Acids Res. 2012;40:W42–8.
DeLano WL. Pymol: An open-source molecular graphics tool. CCP4 Newsl Protein Crystallogr. 2002.
Honorato RV, Koukos PI, Jiménez-García B, Tsaregorodtsev A, Verlato M, Giachetti A, et al. Structural biology in the clouds: the WeNMR-EOSC ecosystem. Front Mol Biosci. 2021;8: 729513.
van Zundert GCP, Rodrigues JPGL, Trellet M, Schmitz C, Kastritis PL, Karaca E, et al. The HADDOCK2.2 Web server: user-friendly integrative modeling of biomolecular complexes. J Mol Biol. 2016. 10.1016/j.jmb.2015.09.014. DOI: 10.1016/j.jmb.2015.09.014
Yan Y, Tao H, He J, Huang S-Y. The HDOCK server for integrated protein-protein docking. Nat Protoc. 2020;15:1829–52.
Rodríguez-Lumbreras LÁ, Jiménez-García B, Fernández-Recio J. pyDockDNA: a new approach for protein-DNA docking. Barcelona Supercomputing Center. 2017. Available: https://upcommons.upc.edu/handle/2117/107868
Kappel K, Das R. Sampling native-like structures of RNA–protein complexes through Rosetta folding and docking. Structure. 2019;27:140-151.e5.
Torchala M, Moal IH, Chaleil RAG, Fernandez-Recio J, Bates PA. SwarmDock: a server for flexible protein–protein docking. Bioinformatics. 2013;29:807–9.
Varadi M, Kosol S, Lebrun P, Valentini E, Blackledge M, Dunker AK, et al. pE-DB: a database of structural ensembles of intrinsically disordered and of unfolded proteins. Nucleic Acids Res. 2014;42:D326–35.
Peterson LX, Roy A, Christoffer C, Terashi G, Kihara D. Modeling disordered protein interactions from biophysical principles. PLoS Comput Biol. 2017;13: e1005485.
Lensink MF, Wodak SJ. Docking, scoring, and affinity prediction in CAPRI. Proteins. 2013;81:2082–95.
van Dijk M, van Dijk ADJ, Hsu V, Boelens R, Bonvin AMJJ. Information-driven protein-DNA docking using HADDOCK: it is a matter of flexibility. Nucleic Acids Res. 2006;34:3317–25.
Basu S, Wallner B. DockQ: a quality measure for protein–protein docking models. PLoS ONE. 2016;11: e0161879.
Bock H-H. Clustering Methods: A History of k-Means Algorithms. In: Brito P, Cucumel G, Bertrand P, de Carvalho F, editors. Selected Contributions in Data Analysis and Classification. Berlin, Heidelberg: Springer Berlin Heidelberg; 2007. pp. 161–172.
Knott GJ, East-Seletsky A, Cofsky JC, Holton JM, Charles E, O’Connell MR, et al. Guide-bound structures of an RNA-targeting A-cleaving CRISPR-Cas13a enzyme. Nat Struct Mol Biol. 2017;24:825–33.
Jumper J, Evans R, Pritzel A, Green T, Figurnov M, Ronneberger O, et al. Highly accurate protein structure prediction with AlphaFold. Nature. 2021;596:583–9.
Shim H. Feature learning of virus genome evolution with the nucleotide skip-gram neural network. Evol Bioinform Online. 2019;15:1176934318821072.
Zuallaert J, Godin F, Kim M, Soete A, Saeys Y, De Neve W. SpliceRover: interpretable convolutional neural networks for improved splice site prediction. Bioinformatics. 2018;34:4180–8.
Lensink MF, Brysbaert G, Mauri T, Nadzirin N, Velankar S, Chaleil RAG, et al. Prediction of protein assemblies, the next frontier: The CASP14-CAPRI experiment. Proteins. 2021;89:1800–23.
Crawley AB, Henriksen JR, Barrangou R. CRISPRdisco: an automated pipeline for the discovery and analysis of CRISPR-Cas systems. CRISPR J. 2018;1:171–81.
Deng L, Yang W, Liu H. PredPRBA: prediction of protein–RNA binding affinity using gradient boosted regression trees. Front Genet. 2019;2(10):637.