[en] Bacterial genomes contain thousands of biosynthetic gene clusters (BGCs) responsible for the production of structurally diverse natural products with applications in medicine, agriculture, and biotechnology. Expression of these BGCs is tightly regulated by transcription factors (TFs) responding to environmental cues, yet predicting which TFs regulate specific BGCs remains challenging. In particular, TF binding sites (TFBSs) within BGCs often diverge from canonical motifs, limiting the effectiveness of standard motif-scanning approaches and hindering systematic exploration of BGC regulation. Here, we present COMMBAT (COnditions for Microbial Metabolite Biosynthesis Activated Transcription), a framework for large-scale prediction of TF-BGC regulatory interactions across bacterial genomes. COMMBAT integrates motif matching with genomic context and gene function information to predict functional TFBSs. The COMMBAT web platform (https://www.commbat.uliege.be) enables users to (i) identify BGCs potentially regulated by a given TF, and (ii) predict candidate TFs that control a specific BGC. With over 4000 TF position weight matrices from four public repositories and more than 400 000 BGCs from MIBiG and antiSMASH DB, COMMBAT provides a scalable resource to predict regulatory inputs and guide/prioritize culture conditions and genetic engineering strategies for natural product discovery.
Disciplines :
Microbiology
Author, co-author :
Ribeiro Monteiro, Silvia ; Université de Liège - ULiège > Département des sciences de la vie > Centre d'Ingénierie des Protéines (CIP)
Rigolet, Augustin ; Université de Liège - ULiège > Département GxABT > Microbial technologies ; Molecular Biotechnology, Institute of Biology, Leiden University, Sylviusweg 72, 2333 BE Leiden, The Netherlands
Kerdel, Yasmine ; Université de Liège - ULiège > Département des sciences de la vie > Centre d'Ingénierie des Protéines (CIP)
Henry, Matthias; InBioS-Center for Protein Engineering, University of Liège, Institut de Chimie, Liège B-4000, Belgium
Augustijn, Hannah E ; Molecular Biotechnology, Institute of Biology, Leiden University, Sylviusweg 72, 2333 BE Leiden, The Netherlands ; Bioinformatics Group, Wageningen University & Research, Droevendaalsesteeg 1, 6708 PB Wageningen, The Netherlands
Medema, Marnix H ; Bioinformatics Group, Wageningen University & Research, Droevendaalsesteeg 1, 6708 PB Wageningen, The Netherlands
van Wezel, Gilles P ; Molecular Biotechnology, Institute of Biology, Leiden University, Sylviusweg 72, 2333 BE Leiden, The Netherlands