Non-coding DNA: Difference between revisions

Content deleted Content added
tweaked
Tags: Mobile edit Mobile app edit iOS app edit
Line 37:
[[Cis-regulatory element|Regulatory elements]] are sites that control the [[Transcription (genetics)|transcription]] of a nearby gene. They are almost always sequences where [[transcription factor]]s bind to DNA and these transcription factors can either activate transcription (activators) or repress transcription (repressors). Regulatory elements were discovered in the 1960s and their general characteristics were worked out in the 1970s by studying specific transcription factors in bacteria and [[bacteriophage]].{{citation needed|date=June 2022}}
 
Promoters and regulatory sequences represent an abundant class of noncoding DNA but they mostly consist of a collection of relatively short sequences so they don'tdo not take up a very large fraction of the genome. The exact amount of regulatory DNA in mammalian genome is unclear because it is difficult to distinguish between spurious transcription factor binding sites and those that are functional. The binding characteristics of typical [[DNA-binding protein]]s were characterized in the 1970s and the biochemical properties of transcription factors predict that in cells with large genomes the majority of binding sites will be fortuitous and not biologiacally functional.{{citation needed|date=June 2022}}
 
Many regulatory sequences occur near promoters, usually upstream of the transcription start site of the gene. Some occur within a gene and a few are located downstream of the transcription termination site. In eukaryotes, there are some regulatory sequences that are located at a considerable distance from the promoter region. These distant regulatory sequences are often called [[Enhancer (genetics)|enhancers]] but there is no rigorous definition of enhancer that distinguishes it from other transcription factor binding sites.<ref>{{cite journal | vauthors = Compe E, Egly JM | title = The Long Road to Understanding RNAPII Transcription Initiation and Related Syndromes | journal = Annual Review of Biochemistry | volume = 90 | pages = 193–219 | date = 2021 | doi = 10.1146/annurev-biochem-090220-112253| pmid = 34153211 | s2cid = 235595550 }}</ref><ref>{{cite journal | vauthors = Visel A, Rubin EM, Pennacchio LA | title = Genomic views of distant-acting enhancers | journal = Nature | volume = 461 | issue = 7261 | pages = 199–205 | date = September 2009 | pmid = 19741700 | pmc = 2923221 | doi = 10.1038/nature08451 | author-link3 = Len A. Pennacchio | bibcode = 2009Natur.461..199V }}</ref>
Line 81:
{{Main|Pseudogene}}
 
Pseudogenes are mostly former genes that have become non-functional due to mutation but the term also refers to inactive DNA sequences that are derived from RNAs produced by functional genes ([[Pseudogene|processed pseudogenes]]). Pseudogenes are only a small fraction of noncoding DNA in prokaryotic genomes because they are eliminated by negative selection. In some eukaryotes, however, pseudogenes can accumulate because selection isn'tis not powerful enough to eliminate them (see [[Nearly neutral theory of molecular evolution]]).
 
The human genome contains about 15,000 pseudogenes derived from protein-coding genes and an unknown number derived from noncoding genes.<ref>{{ cite web | url = https://useast.ensembl.org/Homo_sapiens/Info/Annotation | title = Ensemble Human reference genome GRCh38.p13}}</ref> They may cover a substantial fraction of the genome (~5%) since many of them contain former intron sequences.