Skip to main content
U.S. flag

An official website of the United States government

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS
A lock ( ) or https:// means you’ve safely connected to the .gov website. Share sensitive information only on official, secure websites.

Skip to content

Genome-wide Transcription Factor DNA Binding Sites and Gene Regulatory Networks in Clostridium thermocellum

Metadata Updated: January 22, 2025

Clostridium thermocellum is a thermophilic bacterium recognized for its natural ability to effectively deconstruct cellulosic biomass. While there is a large body of studies on the genetic engineering of this bacterium and its physiology to-date, there is limited knowledge in the transcriptional regulation in this organism and thermophilic bacteria in general. The study herein is the first report of a high-throughput application of DNA-affinity purification sequencing (DAP-seq) to transcription factors (TFs) from a thermophile. We applied DAP-seq to >90 TFs in C. thermocellum and detected genome-wide binding sites for 11 of them. We then compiled and aligned DNA binding sequences from these TFs to deduce the primary DNA-binding sequence motifs for each TF. These binding motifs are further validated with electrophoretic mobility shift assay (EMSA) and are used to identify individual TFs’ regulatory targets in C. thermocellum. Our results led to the discovery of novel, uncharacterized TFs as well as homologues of previously studied TFs including RexA-, LexA- and LacI-type TFs. We then used these data to reconstruct gene regulatory networks for the 11 TFs individually, which resulted in a global network encompassing the TFs with some interconnections. As gene regulation governs and constrains how bacteria behave, our findings shed light on the roles of TFs delineated by their regulons, and potentially provides a means to enable rational, advanced genetic engineering of C. thermocellum and other organisms alike towards a desired phenotype.

Access & Use Information

Public: This dataset is intended for public access and use. License: Creative Commons Attribution

Downloads & Resources

Dates

Metadata Created Date January 11, 2025
Metadata Updated Date January 22, 2025

Metadata Source

Harvested from OpenEI data.json

Additional Metadata

Resource Type Dataset
Metadata Created Date January 11, 2025
Metadata Updated Date January 22, 2025
Publisher National Renewable Energy Laboratory
Maintainer
Identifier https://data.openei.org/submissions/8222
Data First Published 2021-04-19T20:19:35Z
Data Last Modified 2025-01-21T22:31:15Z
Public Access Level public
Bureau Code 019:20
Metadata Context https://openei.org/data.json
Metadata Catalog ID https://openei.org/data.json
Schema Version https://project-open-data.cio.gov/v1.1/schema
Catalog Describedby https://project-open-data.cio.gov/v1.1/schema/catalog.json
Data Quality True
Harvest Object Id ace71c9c-0d34-4cf6-b6ed-9a370d713a9a
Harvest Source Id 7cbf9085-0290-4e9f-bec1-91653baeddfd
Harvest Source Title OpenEI data.json
Homepage URL https://data.nrel.gov/submissions/161
License https://creativecommons.org/licenses/by/4.0/
Program Code 019:005
Projectnumber DE-AC05-00OR22725
Projecttitle Center for Bioenergy Innovation (CBI)
Source Datajson Identifier True
Source Hash 3d1984cb28838ff86f951f90e14caa75b0d7b61398e546f43fabef697fe9f063
Source Schema Version 1.1

Didn't find what you're looking for? Suggest a dataset here.