Coding region: Difference between revisions

Content deleted Content added
Monkbot (talk | contribs)
m Regulation: Task 17: replace to-be-deprecated: |name-list-format= (1× replaced; usage: 1 of 5);
Monkbot (talk | contribs)
m Task 18 (cosmetic): eval 23 templates: del empty params (3×); del |url-status= (1×); cvt lang vals (1×);
Line 23:
In [[DNA]], the coding region is flanked by the [[Promoter (genetics)|promoter sequence]] on the 5' end of the [[template strand]] and the termination sequence on the 3' end. During [[Transcription (biology)|transcription]], the [[RNA Polymerase|RNA Polymerase (RNAP)]] binds to the promoter sequence and moves along the template strand to the coding region. RNAP then adds RNA [[nucleotide]]s complementary to the coding region in order to form the [[mRNA]], substituting [[uracil]] in place of [[thymine]].<ref name=":2">Overview of transcription. (n.d.). Retrieved from <nowiki>https://www.khanacademy.org/science/biology/gene-expression-central-dogma/transcription-of-dna-into-rna/a/overview-of-transcription</nowiki>.</ref> This continues until the RNAP reaches the termination sequence.<ref name=":2" />
 
After transcription and maturation, the [[mature mRNA]] formed encompasses multiple parts important for its eventual translation into [[protein]]. The coding region in an mRNA is flanked by the [[Five prime untranslated region|5' untranslated region]] (5'-UTR) and [[Three prime untranslated region|3' untranslated region]] (3'-UTR),<ref name=":12"/> the [[Five-prime cap|5' cap]], and [[Poly a tail|Poly-A tail]]. During [[Translation (biology)|translation]], the [[ribosome]] facilitates the attachment of the [[Transfer RNA|tRNAs]] to the coding region, 3 nucleotides at a time ([[codons]]).<ref>{{Cite web|url=https://www.nature.com/scitable/topicpage/translation-dna-to-mrna-to-protein-393/|title=Translation: DNA to mRNA to Protein|last=Clancy|first=Suzanne|date=2008|website=Scitable: By Nature Education|url-status=live|archive-url=|archive-date=|access-date=}}</ref> The tRNAs transfer their associated [[amino acid]]s to the growing [[polypeptide]] chain, eventually forming the protein defined in the initial DNA coding region.
[[File:Mature_mRNA.png|thumb|413x413px|The coding region (teal) is flanked by untranslated regions, the 5' cap, and the poly(A) tail which together form the '''mature mRNA'''.<ref>{{Citation|last=Plociam|title=English: The structure of a mature eukaryotic mRNA. A fully processed mRNA includes the 5' cap, 5' UTR, coding region, 3' UTR, and poly(A) tail.|date=2005-08-08|url=https://commons.wikimedia.org/wiki/File:Mature_mRNA.png|access-date=2019-11-19}}</ref>]]
 
Line 54:
While identification of [[open reading frames]] within a DNA sequence is straightforward, identifying coding sequences is not, because the cell translates only a subset of all open reading frames to proteins.<ref>{{cite journal | vauthors = Furuno M, Kasukawa T, Saito R, Adachi J, Suzuki H, Baldarelli R, Hayashizaki Y, Okazaki Y | display-authors = 6 | title = CDS annotation in full-length cDNA sequence | journal = Genome Research | volume = 13 | issue = 6B | pages = 1478–87 | date = June 2003 | pmid = 12819146 | pmc = 403693 | doi = 10.1101/gr.1060303 | publisher = Cold Spring Harbor Laboratory Press }}</ref> Currently CDS prediction uses sampling and sequencing of mRNA from cells, although there is still the problem of determining which parts of a given mRNA are actually translated to protein. CDS prediction is a subset of [[gene prediction]], the latter also including prediction of DNA sequences that code not only for protein but also for other functional elements such as RNA genes and regulatory sequences.
 
In both [[prokaryote]]s and [[eukaryote]]s, [[Overlapping gene|gene overlapping]] occurs relatively often in both DNA and RNA viruses as an evolutionary advantage to reduce genome size while retaining the ability to produce various proteins from the available coding regions.<ref>{{cite journal | vauthors = Rogozin IB, Spiridonov AN, Sorokin AV, Wolf YI, Jordan IK, Tatusov RL, Koonin EV | title = Purifying and directional selection in overlapping prokaryotic genes | language = Englishen | journal = Trends in Genetics | volume = 18 | issue = 5 | pages = 228–32 | date = May 2002 | pmid = 12047938 | doi = 10.1016/S0168-9525(02)02649-5 | url = https://www.cell.com/trends/genetics/abstract/S0168-9525(02)02649-5 }}</ref><ref>{{cite journal | vauthors = Chirico N, Vianelli A, Belshaw R | title = Why genes overlap in viruses | journal = Proceedings. Biological Sciences | volume = 277 | issue = 1701 | pages = 3809–17 | date = December 2010 | pmid = 20610432 | pmc = 2992710 | doi = 10.1098/rspb.2010.1052 }}</ref> For both DNA and RNA, [[Sequence alignment#Pairwise alignment|pairwise alignments]] can detect overlapping coding regions, including short [[open reading frame]]s in viruses, but would require a known coding strand to compare the potential overlapping coding strand with.<ref>{{cite journal | vauthors = Firth AE, Brown CM | title = Detecting overlapping coding sequences with pairwise alignments | journal = Bioinformatics | volume = 21 | issue = 3 | pages = 282–92 | date = February 2005 | pmid = 15347574 | doi = 10.1093/bioinformatics/bti007 | url = https://academic.oup.com/bioinformatics/article/21/3/282/237775 | doi-access = free }}</ref> An alternative method using single genome sequences would not require multiple genome sequences to execute comparisons but would require at least 50 nucleotides overlapping in order to be sensitive.<ref>{{cite journal | vauthors = Schlub TE, Buchmann JP, Holmes EC | title = A Simple Method to Detect Candidate Overlapping Genes in Viruses Using Single Genome Sequences | journal = Molecular Biology and Evolution | volume = 35 | issue = 10 | pages = 2572–2581 | date = October 2018 | pmid = 30099499 | pmc = 6188560 | doi = 10.1093/molbev/msy155 | editor-first = Harmit | editor-last = Malik }}</ref>
 
== See also ==