Examine individual changes

This page allows you to examine the variables generated by the Edit Filter for an individual change.

Variables generated for this change

Variable	Value
Name of the user account (`user_name`)	'209.55.111.195'
Page ID (`page_id`)	44284
Page namespace (`page_namespace`)	0
Page title without namespace (`page_title`)	'Noncoding DNA'
Full page title (`page_prefixedtitle`)	'Noncoding DNA'
Action (`action`)	'edit'
Edit summary/reason (`summary`)	''
Whether or not the edit is marked as minor (no longer in use) (`minor_edit`)	false
Old page wikitext, before the edit (`old_wikitext`)	'In [[genomics]] and related disciplines, '''noncoding DNA''' sequences are components of an organism's [[DNA]] that do not [[genetic code\|encode]] [[protein]] sequences. Some noncoding DNA is [[Transcription (genetics)\|transcribed]] into functional [[noncoding RNA]] molecules (e.g. [[transfer RNA]], [[ribosomal RNA]], and [[RNA interference\|regulatory RNAs]]), while others are not transcribed or give rise to RNA transcripts of unknown function. The amount of noncoding DNA varies greatly among species. For example, about 50% of the [[human genome]] is noncoding DNA, while only about 2% of a typical [[bacterial genome size\|bacterial genome]] is noncoding DNA. Initially, a large proportion of noncoding DNA had no known biological function and was therefore sometimes referred to as "'''junk DNA'''", particularly in the lay press. Some sequences may have no biological function for the organism, such as [[endogenous retrovirus]]es. However, many types of noncoding DNA sequences do have important biological functions, including the [[Transcription (genetics)\|transcription]]al and [[translation (biology)\|translational]] [[regulatory sequence\|regulation]] of protein-coding sequences. Other noncoding sequences have likely, but as-yet undetermined, functions. (This is inferred from high levels of [[homology (biology)\|homology]] and [[Conserved sequence\|conservation]] seen in sequences that do not encode proteins but, nonetheless, appear to be under heavy [[selective pressure]].) The Encyclopedia of DNA Elements ([[ENCODE]]) project<ref name=Nature489p57>{{cite journal \|journal=Nature \| title=An integrated encyclopedia of DNA elements in the human genome \| year = 2012 \| volume = 489 \| pages = 57–74 \| doi = 10.1038/nature11247 \| author = The ENCODE Project Consortium \|issue=7414 }}</ref> reported in September 2012 that over 80% of DNA in the [[human genome]] "serves some purpose, biochemically speaking".<ref name=pennisi>{{Cite journal \| last1 = Pennisi \| first1 = E. \| title = Genomics. ENCODE project writes eulogy for junk DNA. \| journal = Science \| volume = 337 \| issue = 6099 \| pages = 1159, 1161 \| month = Sep \| year = 2012 \| doi = 10.1126/science.337.6099.1159 \| PMID = 22955811 }}</ref> == Fraction of noncoding genomic DNA == The amount of total genomic DNA varies widely between organisms, and the proportion of coding and noncoding DNA within these genomes varies greatly as well. More than 98% of the [[human genome]] does not encode protein sequences, including most sequences within [[intron]]s and most [[intergenic region\|intergenic DNA]].<ref name="Elgar & Vavouri">{{cite journal \|author=Elgar G, Vavouri T \|title=Tuning in to the signals: noncoding sequence conservation in vertebrate genomes \|journal=Trends Genet. \|volume=24 \|issue=7 \|pages=344–52 \|year=2008 \|month=July \|pmid=18514361 \|doi=10.1016/j.tig.2008.04.005 \|url=}}</ref> While overall [[genome size]], and by extension the amount of noncoding DNA, are correlated to organism complexity, there are many exceptions. For example, the genome of the unicellular ''[[Polychaos dubium]]'' (formerly known as ''Amoeba dubia'') has been reported to contain more than 200 times the amount of DNA in humans.<ref name=Gregory>{{cite journal \|author=Gregory TR, Hebert PD \|title=The modulation of DNA content: proximate causes and ultimate consequences \|journal=Genome Res. \|volume=9 \|issue=4 \|pages=317–24 \|year=1999 \|month=April \|pmid=10207154 \|doi= 10.1101/gr.9.4.317\|url=http://www.genome.org/cgi/pmidlookup?view=long&pmid=10207154}}</ref> The [[pufferfish]] ''[[Takifugu]] rubripes'' genome is only about one eighth the size of the human genome, yet seems to have a comparable number of genes; approximately 90% of the ''Takifugu'' genome is noncoding DNA<ref name="Elgar & Vavouri"/> and most of the genome size difference appears to lie in the noncoding DNA. The extensive variation in nuclear genome size among eukaryotic species is known as the [[C-value enigma]] or C-value paradox.<ref name=Wahls>{{cite journal \| doi=10.1016/0092-8674(90)90719-U \| author=Wahls, W.P., ''et al.''\| title=Hypervariable minisatellite DNA is a hotspot for homologous recombination in human cells \| journal=Cell \| year=1990 \| pages=95–103 \| volume=60\| issue=1 \| pmid=2295091}}</ref> About 80 percent of the [[nucleotide base]]s in the human genome may be transcribed,<ref name ="DNAStudy">{{cite journal \| author=Pennisi, Elizabeth \|title =DNA Study Forces Rethink of What It Means to Be a Gene\|journal = Science\| volume = 316 \| issue = 5831 \| year = 2007 \| pages =1556–7 \|doi =10.1126/science.316.5831.1556 \|pmid =17569836}}</ref> but transcription does not necessarily imply function.<ref name ="RNApol2study">{{cite journal \| author=Struhl, Kevin \|title =Transcriptional noise and the fidelity of initiation by RNA polymerase II\|journal = Nature Structural & Molecular Biology\| volume = 14 \| issue = 2\| year = 2007 \| pages =103–105 \|doi =10.1038/nsmb0207-103 \|pmid =17277804}}</ref> ==Types of noncoding DNA sequences== {{main\|conserved non-coding sequence}} ===Noncoding functional RNA=== [[Noncoding RNA]]s are functional [[RNA]] molecules that are not translated into protein. Examples of noncoding RNA include [[ribosomal RNA]], [[transfer RNA]], [[Piwi-interacting RNA]] and [[microRNA]]. MicroRNAs are predicted to control the translational activity of approximately 30% of all protein-coding genes in [[mammals]] and may be vital components in the progression or treatment of various diseases including [[cancer]], [[cardiovascular disease]], and the [[immune system]] response to [[infection]].<ref name=pmid19030926>{{cite journal \|author=Li M, Marin-Muller C, Bharadwaj U, Chow KH, Yao Q, Chen C \|title=MicroRNAs: Control and Loss of Control in Human Physiology and Disease \|journal=World J Surg \|volume=33 \|issue=4 \|pages=667–84 \|year=2009 \|month=April \|pmid=19030926 \|pmc=2933043 \|doi=10.1007/s00268-008-9836-x \|url=}}</ref> ===''Cis''- and ''Trans''-regulatory elements=== [[Cis-regulatory element]]s are sequences that control the [[Transcription (genetics)\|transcription]] of a nearby gene. Cis-elements may be located in [[5']] or [[3']] [[untranslated region]]s or within [[intron]]s. [[Trans-regulatory element]]s control the [[Transcription (genetics)\|transcription]] of a distant gene. [[promoter (biology)\|Promoter]]s facilitate the transcription of a particular gene and are typically [[Upstream and downstream (DNA)\|upstream]] of the coding region. [[Enhancer (genetics)\|Enhancer]] sequences may also exert very distant effects on the transcription levels of genes.<ref>{{cite journal \|author=Visel A, Rubin EM, [[Len A. Pennacchio\|Pennacchio LA]] \|title=Genomic Views of Distant-Acting Enhancers \|journal=Nature \|volume=461 \|issue=7261 \|pages=199–205 \|year=2009 \|month=September \|pmid=19741700 \|pmc=2923221 \|doi=10.1038/nature08451 \|url=\|bibcode = 2009Natur.461..199V }}</ref> ===Introns=== [[Intron]]s are non-coding sections of a gene, transcribed into the [[precursor mRNA]] sequence, but ultimately removed by [[RNA splicing]] during the processing to mature [[messenger RNA]]. Many introns appear to be [[mobile genetic element]]s.<ref name=intron/> Studies of [[group I intron]]s from ''[[Tetrahymena]]'' [[protozoans]] indicate that some introns appear to be selfish genetic elements, neutral to the host because they remove themselves from flanking [[exon]]s during [[RNA processing]] and do not produce an expression bias between [[allele]]s with and without the intron.<ref name=intron>{{cite journal \|author=Nielsen H, Johansen SD \|title=Group I introns: Moving in new directions \|journal=RNA Biol \|volume=6 \|issue=4 \|pages=375–83 \|year=2009 \|pmid=19667762 \|doi=10.4161/rna.6.4.9334 \|url=http://www.landesbioscience.com/journals/rna/abstract.php?id=9334}}</ref> Some introns appear to have significant biological function, possibly through [[ribozyme]] functionality that may regulate [[tRNA]] and [[rRNA]] activity as well as protein-coding gene expression, evident in hosts that have become dependent on such introns over long periods of time; for example, the ''trnL-intron'' is found in all [[green plant]]s and appears to have been [[Vertical gene transfer\|vertically inherited]] for several billions of years, including more than a billion years within [[chloroplast]]s and an additional 2–3 billion years prior in the [[cyanobacteria]]l ancestors of chloroplasts.<ref name=intron/> ===Pseudogenes=== [[Pseudogene]]s are DNA sequences, related to known [[gene]]s, that have lost their protein-coding ability or are otherwise no longer [[Gene expression\|expressed]] in the cell. Pseudogenes arise from retrotransposition or genomic duplication of functional genes, and become "genomic fossils" that are nonfunctional due to [[mutation]]s that prevent the [[Transcription (genetics)\|transcription]] of the gene, such as within the gene promoter region, or fatally alter the [[translation (biology)\|translation]] of the gene, such as premature [[stop codon]]s or [[Translational frameshift\|frameshift]]s.<ref name=pmid17568002>{{cite journal \|author=Zheng D, Frankish A, Baertsch R, ''et al.'' \|title=Pseudogenes in the ENCODE regions: Consensus annotation, analysis of transcription, and evolution \|journal=Genome Res. \|volume=17 \|issue=6 \|pages=839–51 \|year=2007 \|month=June \|pmid=17568002 \|pmc=1891343 \|doi=10.1101/gr.5586307 \|url=}}</ref> Pseudogenes resulting from the retrotransposition of an RNA intermediate are known as processed pseudogenes; pseudogenes that arise from the genomic remains of [[gene duplication\|duplicated genes]] or residues of inactivated genes are nonprocessed pseudogenes.<ref name=pmid17568002/> While [[Dollo's Law]] suggests that the loss of function in pseudogenes is likely permanent, silenced genes may actually retain function for several million years and can be "reactivated" into protein-coding sequences<ref>{{cite journal \|author=Marshall CR, Raff EC, Raff RA \|title=Dollo's law and the death and resurrection of genes \|journal=Proc. Natl. Acad. Sci. U.S.A. \|volume=91 \|issue=25 \|pages=12283–7 \|year=1994 \|month=December \|pmid=7991619 \|pmc=45421 \|doi= 10.1073/pnas.91.25.12283\|bibcode = 1994PNAS...9112283M }}</ref> and a substantial number of pseudogenes are actively transcribed.<ref name=pmid17568002/><ref>{{Cite journal \| last1 = Tutar \| first1 = Y. \| title = Pseudogenes. \| journal = Comp Funct Genomics \| volume = 2012 \| issue = \| pages = 424526 \| month = \| year = 2012 \| doi = 10.1155/2012/424526 \| PMID = 22611337 \| pmc=3352212}}</ref> Because pseudogenes are presumed to change without evolutionary constraint, they can serve as a useful model of the type and frequencies of various spontaneous [[genetic mutation]]s.<ref name="Petrov & Hartl">{{cite journal \|author=Petrov DA, Hartl DL \|title=Pseudogene evolution and natural selection for a compact genome \|journal=J. Hered. \|volume=91 \|issue=3 \|pages=221–7 \|year=2000 \|pmid=10833048 \|doi= 10.1093/jhered/91.3.221\|url=http://jhered.oxfordjournals.org/cgi/pmidlookup?view=long&pmid=10833048}}</ref> ===Repeat sequences, transposons and viral elements=== [[Transposon]]s and [[retrotransposon]]s are [[mobile genetic elements]]. Retrotransposon [[Repeated sequence (DNA)\|repeated sequences]], which include [[Retrotransposon#LINEs\|long interspersed nuclear elements]] (LINEs) and [[Retrotransposon#SINEs\|short interspersed nuclear elements]] (SINEs), account for a large proportion of the genomic sequences in many species. [[Alu sequence]]s, classified as a short interspersed nuclear element, are the most abundant mobile elements in the human genome. Some examples have been found of SINEs exerting transcriptional control of some protein-encoding genes.<ref>{{cite journal \|author=Ponicsan SL, Kugel JF, Goodrich JA \|title=Genomic gems: SINE RNAs regulate mRNA production \|journal=Curr Opin Genet Dev \|volume= 20\|issue= 2\|pages= 149–55\|year=2010 \|month=February \|pmid=20176473 \|pmc=2859989 \|doi=10.1016/j.gde.2010.01.004 \|url=}}</ref><ref>{{cite journal \|author=Häsler J, Samuelsson T, Strub K \|title=Useful 'junk': Alu RNAs in the human transcriptome \|journal=Cell. Mol. Life Sci. \|volume=64 \|issue=14 \|pages=1793–800 \|year=2007 \|month=July \|pmid=17514354 \|doi=10.1007/s00018-007-7084-0 \|url=}}</ref><ref>{{cite journal \|author=Walters RD, Kugel JF, Goodrich JA \|title=InvAluable junk: the cellular impact and function of Alu and B2 RNAs \|journal=IUBMB Life \|volume=61 \|issue=8 \|pages=831–7 \|year=2009 \|month=Aug \|pmid=19621349 \|doi=10.1002/iub.227 \|url=}}</ref> [[Endogenous retrovirus]] sequences are the product of [[reverse transcription]] of [[retrovirus]] genomes into the genomes of [[germ cell]]s. Mutation within these retro-transcribed sequences can inactivate the viral genome.<ref>{{Cite journal \| last1 = Nelson \| first1 = PN. \| last2 = Hooley \| first2 = P. \| last3 = Roden \| first3 = D. \| last4 = Davari Ejtehadi \| first4 = H. \| last5 = Rylance \| first5 = P. \| last6 = Warren \| first6 = P. \| last7 = Martin \| first7 = J. \| last8 = Murray \| first8 = PG. \| title = Human endogenous retroviruses: transposable elements with potential? \| journal = Clin Exp Immunol \| volume = 138 \| issue = 1 \| pages = 1-9 \| month = Oct \| year = 2004 \| doi = 10.1111/j.1365-2249.2004.02592.x \| PMID = 15373898 }}</ref> Over 8% of the human genome is made up of (mostly decayed) endogenous retrovirus sequences, as part of the over 42% fraction that is recognizably derived of retrotransposons, while another 3% can be identified to be the remains of [[Transposon#Class II: DNA transposons\|DNA transposon]]s. Much of the remaining half of the genome that is currently without an explained origin is expected to have found its origin in transposable elements that were active so long ago (> 200 million years) that random mutations have rendered them unrecognizable.<ref name=humangenome>{{cite journal \| author=International Human Genome Sequencing Consortium \| month=February \| year=2001 \| title=Initial sequencing and analysis of the human genome\| journal=Nature \| volume=409 \| pages=879–888 \| pmid= 11237011 \| issue=6822 \| doi=10.1038/35057062}}</ref> Genome size variation in at least two kinds of plants is mostly the result of retrotransposon sequences.<ref>{{Cite journal \| last1 = Piegu \| first1 = B. \| last2 = Guyot \| first2 = R. \| last3 = Picault \| first3 = N. \| last4 = Roulin \| first4 = A. \| last5 = Sanyal \| first5 = A. \| last6 = Saniyal \| first6 = A. \| last7 = Kim \| first7 = H. \| last8 = Collura \| first8 = K. \| last9 = Brar \| first9 = DS. \| title = Doubling genome size without polyploidization: dynamics of retrotransposition-driven genomic expansions in Oryza australiensis, a wild relative of rice. \| journal = Genome Res \| volume = 16 \| issue = 10 \| pages = 1262-9 \| month = Oct \| year = 2006 \| doi = 10.1101/gr.5290206 \| PMID = 16963705 }} </ref><ref>{{Cite journal \| last1 = Hawkins \| first1 = JS. \| last2 = Kim \| first2 = H. \| last3 = Nason \| first3 = JD. \| last4 = Wing \| first4 = RA. \| last5 = Wendel \| first5 = JF. \| title = Differential lineage-specific amplification of transposable elements is responsible for genome size variation in Gossypium. \| journal = Genome Res \| volume = 16 \| issue = 10 \| pages = 1252-61 \| month = Oct \| year = 2006 \| doi = 10.1101/gr.5282906 \| PMID = 16954538 }}</ref> ===Telomeres=== [[Telomere]]s are regions of repetitive DNA at the end of a [[chromosome]], which provide protection from chromosomal deterioration during [[DNA replication]]. ==Functions of noncoding DNA== Many noncoding DNA sequences have important biological functions as indicated by [[comparative genomics]] studies that report some regions of noncoding DNA that are highly [[Conserved sequence\|conserved]], sometimes on time-scales representing hundreds of millions of years, implying that these noncoding regions are under strong [[evolution]]ary pressure and [[Natural selection\|positive selection]].<ref name=Ludwig>{{cite journal \|author=Ludwig MZ \|title=Functional evolution of noncoding DNA \|journal=Curr. Opin. Genet. Dev. \|volume=12 \|issue=6 \|pages=634–9 \|year=2002 \|month=December \|pmid=12433575 \|doi= 10.1016/S0959-437X(02)00355-6\|url=http://linkinghub.elsevier.com/retrieve/pii/S0959437X02003556}}</ref> For example, in the genomes of [[human]]s and [[mice]], which diverged from a [[common ancestor]] 65–75 million years ago, protein-coding DNA sequences account for only about 20% of conserved DNA, with the remaining 80% of conserved DNA represented in noncoding regions.<ref name="Cobb et al"/> [[Linkage mapping]] often identifies chromosomal regions associated with a disease with no evidence of functional coding variants of genes within the region, suggesting that disease-causing genetic variants lie in the noncoding DNA.<ref name="Cobb et al">{{cite journal \|author=Cobb J, Büsst C, Petrou S, Harrap S, Ellis J \|title=Searching for functional genetic variants in non-coding DNA \|journal=Clin. Exp. Pharmacol. Physiol. \|volume=35 \|issue=4 \|pages=372–5 \|year=2008 \|month=April \|pmid=18307723 \|doi=10.1111/j.1440-1681.2008.04880.x \|url=}}</ref> Some specific sequences of noncoding DNA may be features essential to chromosome structure, [[centromere]] function and [[homology (biology)\|homolog]] recognition in [[meiosis]].<ref>{{cite journal \|author=Subirana JA, Messeguer X \|title=The most frequent short sequences in non-coding DNA \|journal=Nucleic Acids Res. \|volume=38 \|issue=4 \|pages=1172–81 \|year=2010 \|month=March \|pmid=19966278 \|pmc=2831315 \|doi=10.1093/nar/gkp1094 \|url=}}</ref> According to a comparative study of over 300 [[prokaryotic]] and over 30 [[eukaryotic]] genomes,<ref>{{cite journal \| author = S. E. Ahnert, [[Thomas Fink\|T. M. A. Fink]] and A. Zinovyev \| title=How much non-coding DNA do eukaryotes require? \| url=http://www.tcm.phy.cam.ac.uk/~tmf20/PUBLICATIONS/jtb_07.pdf \|format=PDF\| journal=J Theor. Biol. \| year=2008 \| pages=587–592 \| issue = 4 \| pmid = 18384817 \| volume=252 \| doi = 10.1016/j.jtbi.2008.02.005}}</ref> eukaryotes appear to require a minimum amount of non-coding DNA. This minimum amount can be predicted using a growth model for regulatory genetic networks, implying that it is required for regulatory purposes. In humans the predicted minimum is about 5% of the total genome. === Protection of the genome === {{main\|Mutation}} Noncoding DNA separate genes from each other with long gaps, so mutation in one gene or part of a chromosome, for example deletion or insertion, does not have the "[[frameshift mutation]]" on the whole chromosome. When genome complexity is relatively high, like in the case of human genome, not only different genes, but also inside one gene there are gaps of [[introns]] to protect the entire coding segment to minimise the changes caused by mutation. === Genetic switches === Some noncoding DNA sequences are genetic "switches" that regulate when and where genes are expressed.<ref>{{cite journal\| doi=10.1038/scientificamerican0508-60\| author=Carroll, Sean B., et al.\| month=May \| year=2008 \| title=Regulating Evolution \| journal=Scientific American \| pages=60–67 \| volume=298 \| issue=5\| pmid=18444326 }}</ref> === Regulation of gene expression === {{Main\|Regulation of gene expression}} Some noncoding DNA sequences determine the expression levels of various genes.<ref name=Callaway>{{cite journal \| author=Callaway, Ewen \| title=Junk DNA gets credit for making us who we are \| journal=New Scientist \|year=2010 \| month=March \| url=http://www.newscientist.com/article/dn18680-junk-dna-gets-credit-for-making-us-who-we-are.html?full=true&print=true}}</ref> === Transcription factors === {{Main\|Transcription factor}} Some noncoding DNA sequences determine where transcription factors attach.<ref name=Callaway/> A transcription factor is a protein that binds to specific non-coding DNA sequences, thereby controlling the flow (or transcription) of genetic information from DNA to mRNA. Transcription factors act at very different locations on the genomes of different people. === Operators === {{Main\|Operator (biology) }} An operator is a segment of DNA to which a [[repressor]] binds. A repressor is a DNA-binding protein that regulates the expression of one or more genes by binding to the operator and blocking the attachment of RNA polymerase to the promoter, thus preventing transcription of the genes. This blocking of expression is called repression. === Enhancers === {{Main\|Enhancer (genetics) }} An enhancer is a short region of DNA that can be bound with proteins ([[trans-acting factors]]), much like a set of transcription factors, to enhance transcription levels of genes in a gene cluster. === Promoters === {{Main\|Promoter (biology) }} A promoter is a region of DNA that facilitates transcription of a particular gene. Promoters are typically located near the genes they regulate. === Insulators === {{Main\|Insulator (genetics) }} A genetic insulator is a boundary element that plays two distinct roles in gene expression, either as an enhancer-blocking code, or rarely as a barrier against condensed chromatin. An insulator in a DNA sequence is comparable to a linguistic [[word divider]] such as a comma (,) in a sentence, because the insulator indicates where an enhanced or repressed sequence ends. ==Noncoding DNA and evolution== Shared sequences of apparently non-functional DNA are a major line of [[evidence of common descent]].<ref name=TO-FAQ>[http://www.talkorigins.org/faqs/molgen/ "Plagiarized Errors and Molecular Genetics"], [[talkorigins]], by Edward E. Max, M.D., Ph.D.</ref> Pseudogene sequences appear to accumulate mutations more rapidly than coding sequences due to a loss of selective pressure.<ref name="Petrov & Hartl"/> This allows for the creation of mutant alleles that incorporate new functions that may be favored by natural selection; thus, pseudogenes can serve as raw material for [[evolution]] and can be considered "protogenes".<ref>{{cite journal \|author=Balakirev ES, Ayala FJ \|title=Pseudogenes: are they "junk" or functional DNA? \|journal=Annu. Rev. Genet. \|volume=37 \|issue= \|pages=123–51 \|year=2003 \|pmid=14616058 \|doi=10.1146/annurev.genet.37.040103.103949 \|url=}}</ref> ==Junk DNA== "Junk DNA" is a term that was introduced in 1972 by [[Susumu Ohno]],<ref name = Ohno>{{cite book\| author=S. Ohno \| year=1972\| last=[http://www.junkdna.com/ohno.html So much "junk" DNA in our genome]\| first=In Evolution of Genetic Systems\| editor=H. H. Smith\| pages=366–370\| publisher=Gordon and Breach, New York}}</ref> who noted that the [[mutational load]] from deleterious mutations placed an upper limit on the number of functional loci that could be expected given a typical mutation rate. Ohno predicted that mammal genomes could not have more than 30,000 loci under selection before the "cost" from the mutational load would cause an inescapable decline in fitness, and eventually extinction. This prediction remains robust, with the human genome containing approximately 20,000 genes. Junk DNA remains a label for the portions of a [[genome]] sequence for which no discernible [[Function (biology)\|function]] had been identified. According to a 1980 review in ''[[Nature (journal)\|Nature]]'' by [[Leslie Orgel]] and [[Francis Crick]], junk DNA has "little specificity and conveys little or no selective advantage to the organism".<ref>{{cite journal \|author=Orgel LE, Crick FH \|title=Selfish DNA: the ultimate parasite \|journal=Nature \|volume=284 \|issue=5757 \|pages=604–7 \|year=1980 \|month=April \|pmid=7366731 \|doi=10.1038/284604a0 \|url=http://www.nature.com/nature/journal/v284/n5757/abs/284604a0.html\|bibcode = 1980Natur.284..604O }}</ref> The term is used mainly in [[popular science]] and in a [[colloquialism\|colloquial]] way in scientific publications. "Scientific American" claims that its connotations may have slowed research into the biological functions of noncoding DNA.<ref name=SA>{{cite journal \|doi=10.1038/scientificamerican0307-104 \|author=Khajavinia A, Makalowski W \|title=What is "junk" DNA, and what is it worth? \|journal=[[Scientific American]] \|volume=296 \|issue=5 \|pages=104 \|year=2007 \|month=May \|pmid=17503549 \|url=http://www.scientificamerican.com/article.cfm?id=what-is-junk-dna-and-what \|quote=The term "junk DNA" repelled mainstream researchers from studying noncoding genetic material for many years}}</ref> Several lines of evidence indicate that some "junk DNA" sequences are likely to have unidentified functional activity, and other sequences may have had functions in the past.<ref>{{Cite journal\| doi = 10.1038/443521a\| title = Genetics: Junk DNA as an evolutionary force\| pmid = 17024082\| first2 = C \| year = 2006\| last2 = Vieira \| author = Biémont, Christian \| journal = Nature \| volume = 443\| issue = 7111 \| pages = 521–4 \| bibcode=2006Natur.443..521B}}</ref> Recently junk DNA sequences were artificially expressed resulting in the synthesis of functional proteins. In 2012, the [[ENCODE]] project, a research program supported by the [[National Human Genome Research Institute]], reported that 76% of the human genome's noncoding DNA sequences were [[Transcription (genetics)\|transcribed]] and that nearly half of the genome was in some way accessible to genetic regulatory proteins such as [[transcription factor]]s. This, however, does not necessarily mean that all of these segments have true biochemical function.<ref name=pennisi/> Still, a significant amount of the sequence of the genomes of eukaryotic organisms currently appears to fall under no existing classification other than "junk". For example, one experiment removed 0.1% of the mouse genome with no detectable effect on the [[phenotype]].<ref name=Nobrega>{{cite journal \| author=M.A. Nobrega, Y. Zhu, I. Plajzer-Frick, V. Afzal and E.M. Rubin \| year=2004 \| title=Megabase deletions of gene deserts result in viable mice\| journal=[[Nature (journal)\|Nature]] \| volume=431 \| pages=988–993 \| doi=10.1038/nature03022 \| issue=7011 \| pmid=15496924\|bibcode = 2004Natur.431..988N }}</ref> This result suggests that the removed DNA was largely nonfunctional. In addition, these sequences are enriched for the heterochromatic histone modification H3K9me3.<ref name="Rosenfeld_2009">{{Cite journal \| last1 = Rosenfeld \| first1 = JA. \| last2 = Wang \| first2 = Z. \| last3 = Schones \| first3 = DE. \| last4 = Zhao \| first4 = K. \| last5 = DeSalle \| first5 = R. \| last6 = Zhang \| first6 = MQ. \| title = Determination of enriched histone modifications in non-genic portions of the human genome. \| journal = BMC Genomics \| volume = 10 \| issue = \| pages = 143 \| month = \| year = 2009 \| doi = 10.1186/1471-2164-10-143 \| PMID = 19335899 \| pmc=2667539}}</ref> ==Noncoding DNA and Long range correlations== A statistical distinction between coding and noncoding DNA sequences has been found. It has been observed that nucleotides in non-coding DNA sequences display long range power law correlations while coding sequences do not.<ref>{{cite journal\|last=C.-K. Peng\|first=S. V. Buldyrev, A. L. Goldberger, [[Shlomo Havlin{{!}}S. Havlin]], F. Sciortino, M. Simons, H. E. Stanley\|title=Long-range correlations in nucleotide sequences\|journal=Nature\|year=1992\|volume=356\|pages=168–70\|url=http://havlin.biu.ac.il/Publications.php?keyword=Long-range+correlations+in+nucleotide+sequences&year=&match=all\|doi=10.1038/356168a0\|pmid=1301010\|last2=Buldyrev\|first2=SV\|last3=Goldberger\|first3=AL\|last4=Havlin\|first4=S\|last5=Sciortino\|first5=F\|last6=Simons\|first6=M\|last7=Stanley\|first7=HE\|issue=6365\|bibcode = 1992Natur.356..168P }}</ref><ref>{{cite journal\|last=W. Li and\|first=K. Kaneko\|title=Long-Range Correlation and Partial <math> 1/f^{alpha }</math> Spectrum in a Non-Coding DNA Sequence\|journal=Europhys. Lett\|year=1992\|volume=17\|pages=655–660\|url=http://chaos.c.u-tokyo.ac.jp/papers/bio0/wli.pdf\|bibcode = 1992EL.....17..655L \|doi = 10.1209/0295-5075/17/7/014 }}</ref><ref>{{cite journal\|last=S. V. Buldyrev\|first=A. L. Goldberger, [[Shlomo Havlin{{!}}S. Havlin]], R. N. Mantegna, M. Matsa, C.-K. Peng, M. Simons, and H. E. Stanley\|title=Long-range correlations properties of coding and noncoding DNA sequences: GenBank analysis\|journal=Phys. Rev. E\|year=1995\|volume=51\|pages=5084\|url=http://havlin.biu.ac.il/Publications.php?keyword=Long-range+correlations+properties+of+coding+and+noncoding+DNA+sequences:+GenBank+analysis&year=&match=all\|doi=10.1103/PhysRevE.51.5084\|last2=Goldberger\|first2=A.\|last3=Havlin\|first3=S.\|last4=Mantegna\|first4=R.\|last5=Matsa\|first5=M.\|last6=Peng\|first6=C.-K.\|last7=Simons\|first7=M.\|last8=Stanley\|first8=H.\|issue=5\|bibcode = 1995PhRvE..51.5084B }}</ref> ==See also== [[Conserved non-coding sequence]] [[Eukaryotic chromosome fine structure]] [[Gene-centered view of evolution]] [[Gene regulatory network]] [[Intergenic region]] [[Intragenomic conflict]] [[Phylogenetic footprinting]] [[Transcriptome]] ==References== {{reflist\|2}} {{cite book\| author = Bennett, M.D. and I.J. Leitch \| year = 2005 \| chapter = Genome size evolution in plants \| title = [[The Evolution of the Genome]] \| editor = T.R. Gregory (ed.) \| publisher = Elsevier \| ___location = San Diego \| pages = 89–162 }} {{cite book\| author = Gregory, T.R \| year = 2005 \| chapter = Genome size evolution in animals \| title = [[The Evolution of the Genome]] \| editor = T.R. Gregory (ed.) \| publisher = Elsevier \| ___location = San Diego\| isbn = 0-12-301463-8 }} {{cite journal \|author=Shabalina SA, Spiridonov NA \|title=The mammalian transcriptome and the function of non-coding DNA sequences \|journal=Genome Biol. \|volume=5 \|issue=4 \|pages=105 \|year=2004 \|pmid=15059247 \|pmc=395773 \|doi=10.1186/gb-2004-5-4-105 \|url=}} {{cite journal \|author=Castillo-Davis CI \|title=The evolution of noncoding DNA: how much junk, how much func? \|journal=Trends Genet. \|volume=21 \|issue=10 \|pages=533–6 \|year=2005 \|month=October \|pmid=16098630 \|doi=10.1016/j.tig.2005.08.001 \|url=}} ==External links== * [http://www.genomesize.com/ Animal Genome Size Database] * [http://www.rbgkew.org.uk/cval/homepage.html Plant DNA C-values Database] * [http://www.zbi.ee/fungal-genomesize/index.php Fungal Genome Size Database] * [http://www.nature.com/encode/#/threads ENCODE: The human encyclopaedia] [[Category:DNA]] [[Category:Genetics]] [[Category:Gene expression]] [[ar:فضلة الدنا]] [[ca:ADN no codificant]] [[cs:Nekódující DNA]] [[de:Nichtkodierende Desoxyribonukleinsäure]] [[fr:ADN non codant]] [[it:DNA non codificante]] [[pt:DNA não-codificante]] [[sk:Nekódujúca DNA]] [[sr:Nekodirajuća DNK]] [[fi:Tilke-DNA]] [[sv:Icke-kodande DNA]] [[tr:Kodlamayan DNA]] [[uk:Некодуюча ДНК]] [[zh:非编码DNA]]'
New page wikitext, after the edit (`new_wikitext`)	'In [[genomics]] and related disciplines, '''noncoding DNA''' sequences are components of an organism's [[DNA]] jbnvihdfbifdjhfihlbdihbsiudfbiuzcdiudsfaisucbch bsdcasuidcbzczxhjc xjcbshidfasiucbzxhj bxcbhjaiusbfzjkcx bzxkc zxkjcbsadkjas bczc skjdc hvj xvhidsvbsdgkbdsgvkjlthat do not [[genetic code\|encode]] [[protein]] sequences. Some noncoding DNA is [[Transcription (genetics)\|transcribed]] into functional [[noncoding RNA]] molecules (e.g. [[transfer RNA]], [[ribosomal RNA]], and [[RNA interference\|regulatory RNAs]]), while others are not transcribed or give rise to RNA transcripts of unknown function. The amount of noncoding DNA varies greatly among species. For example, about 50% of the [[human genome]] is noncoding DNA, while only about 2% of a typical [[bacterial genome size\|bacterial genome]] is noncoding DNA. Initially, a large proportion of noncoding DNA had no known biological function and was therefore sometimes referred to as "'''junk DNA'''", particularly in the lay press. Some sequences may have no biological function for the organism, such as [[endogenous retrovirus]]es. However, many types of noncoding DNA sequences do have important biological functions, including the [[Transcription (genetics)\|transcription]]al and [[translation (biology)\|translational]] [[regulatory sequence\|regulation]] of protein-coding sequences. Other noncoding sequences have likely, but as-yet undetermined, functions. (This is inferred from high levels of [[homology (biology)\|homology]] and [[Conserved sequence\|conservation]] seen in sequences that do not encode proteins but, nonetheless, appear to be under heavy [[selective pressure]].) The Encyclopedia of DNA Elements ([[ENCODE]]) project<ref name=Nature489p57>{{cite journal \|journal=Nature \| title=An integrated encyclopedia of DNA elements in the human genome \| year = 2012 \| volume = 489 \| pages = 57–74 \| doi = 10.1038/nature11247 \| author = The ENCODE Project Consortium \|issue=7414 }}</ref> reported in September 2012 that over 80% of DNA in the [[human genome]] "serves some purpose, biochemically speaking".<ref name=pennisi>{{Cite journal \| last1 = Pennisi \| first1 = E. \| title = Genomics. ENCODE project writes eulogy for junk DNA. \| journal = Science \| volume = 337 \| issue = 6099 \| pages = 1159, 1161 \| month = Sep \| year = 2012 \| doi = 10.1126/science.337.6099.1159 \| PMID = 22955811 }}</ref> == Fraction of noncoding genomic DNA == The amount of total genomic DNA varies widely between organisms, and the proportion of coding and noncoding DNA within these genomes varies greatly as well. More than 98% of the [[human genome]] does not encode protein sequences, including most sequences within [[intron]]s and most [[intergenic region\|intergenic DNA]].<ref name="Elgar & Vavouri">{{cite journal \|author=Elgar G, Vavouri T \|title=Tuning in to the signals: noncoding sequence conservation in vertebrate genomes \|journal=Trends Genet. \|volume=24 \|issue=7 \|pages=344–52 \|year=2008 \|month=July \|pmid=18514361 \|doi=10.1016/j.tig.2008.04.005 \|url=}}</ref> While overall [[genome size]], and by extension the amount of noncoding DNA, are correlated to organism complexity, there are many exceptions. For example, the genome of the unicellular ''[[Polychaos dubium]]'' (formerly known as ''Amoeba dubia'') has been reported to contain more than 200 times the amount of DNA in humans.<ref name=Gregory>{{cite journal \|author=Gregory TR, Hebert PD \|title=The modulation of DNA content: proximate causes and ultimate consequences \|journal=Genome Res. \|volume=9 \|issue=4 \|pages=317–24 \|year=1999 \|month=April \|pmid=10207154 \|doi= 10.1101/gr.9.4.317\|url=http://www.genome.org/cgi/pmidlookup?view=long&pmid=10207154}}</ref> The [[pufferfish]] ''[[Takifugu]] rubripes'' genome is only about one eighth the size of the human genome, yet seems to have a comparable number of genes; approximately 90% of the ''Takifugu'' genome is noncoding DNA<ref name="Elgar & Vavouri"/> and most of the genome size difference appears to lie in the noncoding DNA. The extensive variation in nuclear genome size among eukaryotic species is known as the [[C-value enigma]] or C-value paradox.<ref name=Wahls>{{cite journal \| doi=10.1016/0092-8674(90)90719-U \| author=Wahls, W.P., ''et al.''\| title=Hypervariable minisatellite DNA is a hotspot for homologous recombination in human cells \| journal=Cell \| year=1990 \| pages=95–103 \| volume=60\| issue=1 \| pmid=2295091}}</ref> About 80 percent of the [[nucleotide base]]s in the human genome may be transcribed,<ref name ="DNAStudy">{{cite journal \| author=Pennisi, Elizabeth \|title =DNA Study Forces Rethink of What It Means to Be a Gene\|journal = Science\| volume = 316 \| issue = 5831 \| year = 2007 \| pages =1556–7 \|doi =10.1126/science.316.5831.1556 \|pmid =17569836}}</ref> but transcription does not necessarily imply function.<ref name ="RNApol2study">{{cite journal \| author=Struhl, Kevin \|title =Transcriptional noise and the fidelity of initiation by RNA polymerase II\|journal = Nature Structural & Molecular Biology\| volume = 14 \| issue = 2\| year = 2007 \| pages =103–105 \|doi =10.1038/nsmb0207-103 \|pmid =17277804}}</ref> ==Types of noncoding DNA sequences== {{main\|conserved non-coding sequence}} ===Noncoding functional RNA=== [[Noncoding RNA]]s are functional [[RNA]] molecules that are not translated into protein. Examples of noncoding RNA include [[ribosomal RNA]], [[transfer RNA]], [[Piwi-interacting RNA]] and [[microRNA]]. MicroRNAs are predicted to control the translational activity of approximately 30% of all protein-coding genes in [[mammals]] and may be vital components in the progression or treatment of various diseases including [[cancer]], [[cardiovascular disease]], and the [[immune system]] response to [[infection]].<ref name=pmid19030926>{{cite journal \|author=Li M, Marin-Muller C, Bharadwaj U, Chow KH, Yao Q, Chen C \|title=MicroRNAs: Control and Loss of Control in Human Physiology and Disease \|journal=World J Surg \|volume=33 \|issue=4 \|pages=667–84 \|year=2009 \|month=April \|pmid=19030926 \|pmc=2933043 \|doi=10.1007/s00268-008-9836-x \|url=}}</ref> ===''Cis''- and ''Trans''-regulatory elements=== [[Cis-regulatory element]]s are sequences that control the [[Transcription (genetics)\|transcription]] of a nearby gene. Cis-elements may be located in [[5']] or [[3']] [[untranslated region]]s or within [[intron]]s. [[Trans-regulatory element]]s control the [[Transcription (genetics)\|transcription]] of a distant gene. [[promoter (biology)\|Promoter]]s facilitate the transcription of a particular gene and are typically [[Upstream and downstream (DNA)\|upstream]] of the coding region. [[Enhancer (genetics)\|Enhancer]] sequences may also exert very distant effects on the transcription levels of genes.<ref>{{cite journal \|author=Visel A, Rubin EM, [[Len A. Pennacchio\|Pennacchio LA]] \|title=Genomic Views of Distant-Acting Enhancers \|journal=Nature \|volume=461 \|issue=7261 \|pages=199–205 \|year=2009 \|month=September \|pmid=19741700 \|pmc=2923221 \|doi=10.1038/nature08451 \|url=\|bibcode = 2009Natur.461..199V }}</ref> ===Introns=== [[Intron]]s are non-coding sections of a gene, transcribed into the [[precursor mRNA]] sequence, but ultimately removed by [[RNA splicing]] during the processing to mature [[messenger RNA]]. Many introns appear to be [[mobile genetic element]]s.<ref name=intron/> Studies of [[group I intron]]s from ''[[Tetrahymena]]'' [[protozoans]] indicate that some introns appear to be selfish genetic elements, neutral to the host because they remove themselves from flanking [[exon]]s during [[RNA processing]] and do not produce an expression bias between [[allele]]s with and without the intron.<ref name=intron>{{cite journal \|author=Nielsen H, Johansen SD \|title=Group I introns: Moving in new directions \|journal=RNA Biol \|volume=6 \|issue=4 \|pages=375–83 \|year=2009 \|pmid=19667762 \|doi=10.4161/rna.6.4.9334 \|url=http://www.landesbioscience.com/journals/rna/abstract.php?id=9334}}</ref> Some introns appear to have significant biological function, possibly through [[ribozyme]] functionality that may regulate [[tRNA]] and [[rRNA]] activity as well as protein-coding gene expression, evident in hosts that have become dependent on such introns over long periods of time; for example, the ''trnL-intron'' is found in all [[green plant]]s and appears to have been [[Vertical gene transfer\|vertically inherited]] for several billions of years, including more than a billion years within [[chloroplast]]s and an additional 2–3 billion years prior in the [[cyanobacteria]]l ancestors of chloroplasts.<ref name=intron/> ===Pseudogenes=== [[Pseudogene]]s are DNA sequences, related to known [[gene]]s, that have lost their protein-coding ability or are otherwise no longer [[Gene expression\|expressed]] in the cell. Pseudogenes arise from retrotransposition or genomic duplication of functional genes, and become "genomic fossils" that are nonfunctional due to [[mutation]]s that prevent the [[Transcription (genetics)\|transcription]] of the gene, such as within the gene promoter region, or fatally alter the [[translation (biology)\|translation]] of the gene, such as premature [[stop codon]]s or [[Translational frameshift\|frameshift]]s.<ref name=pmid17568002>{{cite journal \|author=Zheng D, Frankish A, Baertsch R, ''et al.'' \|title=Pseudogenes in the ENCODE regions: Consensus annotation, analysis of transcription, and evolution \|journal=Genome Res. \|volume=17 \|issue=6 \|pages=839–51 \|year=2007 \|month=June \|pmid=17568002 \|pmc=1891343 \|doi=10.1101/gr.5586307 \|url=}}</ref> Pseudogenes resulting from the retrotransposition of an RNA intermediate are known as processed pseudogenes; pseudogenes that arise from the genomic remains of [[gene duplication\|duplicated genes]] or residues of inactivated genes are nonprocessed pseudogenes.<ref name=pmid17568002/> While [[Dollo's Law]] suggests that the loss of function in pseudogenes is likely permanent, silenced genes may actually retain function for several million years and can be "reactivated" into protein-coding sequences<ref>{{cite journal \|author=Marshall CR, Raff EC, Raff RA \|title=Dollo's law and the death and resurrection of genes \|journal=Proc. Natl. Acad. Sci. U.S.A. \|volume=91 \|issue=25 \|pages=12283–7 \|year=1994 \|month=December \|pmid=7991619 \|pmc=45421 \|doi= 10.1073/pnas.91.25.12283\|bibcode = 1994PNAS...9112283M }}</ref> and a substantial number of pseudogenes are actively transcribed.<ref name=pmid17568002/><ref>{{Cite journal \| last1 = Tutar \| first1 = Y. \| title = Pseudogenes. \| journal = Comp Funct Genomics \| volume = 2012 \| issue = \| pages = 424526 \| month = \| year = 2012 \| doi = 10.1155/2012/424526 \| PMID = 22611337 \| pmc=3352212}}</ref> Because pseudogenes are presumed to change without evolutionary constraint, they can serve as a useful model of the type and frequencies of various spontaneous [[genetic mutation]]s.<ref name="Petrov & Hartl">{{cite journal \|author=Petrov DA, Hartl DL \|title=Pseudogene evolution and natural selection for a compact genome \|journal=J. Hered. \|volume=91 \|issue=3 \|pages=221–7 \|year=2000 \|pmid=10833048 \|doi= 10.1093/jhered/91.3.221\|url=http://jhered.oxfordjournals.org/cgi/pmidlookup?view=long&pmid=10833048}}</ref> ===Repeat sequences, transposons and viral elements=== [[Transposon]]s and [[retrotransposon]]s are [[mobile genetic elements]]. Retrotransposon [[Repeated sequence (DNA)\|repeated sequences]], which include [[Retrotransposon#LINEs\|long interspersed nuclear elements]] (LINEs) and [[Retrotransposon#SINEs\|short interspersed nuclear elements]] (SINEs), account for a large proportion of the genomic sequences in many species. [[Alu sequence]]s, classified as a short interspersed nuclear element, are the most abundant mobile elements in the human genome. Some examples have been found of SINEs exerting transcriptional control of some protein-encoding genes.<ref>{{cite journal \|author=Ponicsan SL, Kugel JF, Goodrich JA \|title=Genomic gems: SINE RNAs regulate mRNA production \|journal=Curr Opin Genet Dev \|volume= 20\|issue= 2\|pages= 149–55\|year=2010 \|month=February \|pmid=20176473 \|pmc=2859989 \|doi=10.1016/j.gde.2010.01.004 \|url=}}</ref><ref>{{cite journal \|author=Häsler J, Samuelsson T, Strub K \|title=Useful 'junk': Alu RNAs in the human transcriptome \|journal=Cell. Mol. Life Sci. \|volume=64 \|issue=14 \|pages=1793–800 \|year=2007 \|month=July \|pmid=17514354 \|doi=10.1007/s00018-007-7084-0 \|url=}}</ref><ref>{{cite journal \|author=Walters RD, Kugel JF, Goodrich JA \|title=InvAluable junk: the cellular impact and function of Alu and B2 RNAs \|journal=IUBMB Life \|volume=61 \|issue=8 \|pages=831–7 \|year=2009 \|month=Aug \|pmid=19621349 \|doi=10.1002/iub.227 \|url=}}</ref> [[Endogenous retrovirus]] sequences are the product of [[reverse transcription]] of [[retrovirus]] genomes into the genomes of [[germ cell]]s. Mutation within these retro-transcribed sequences can inactivate the viral genome.<ref>{{Cite journal \| last1 = Nelson \| first1 = PN. \| last2 = Hooley \| first2 = P. \| last3 = Roden \| first3 = D. \| last4 = Davari Ejtehadi \| first4 = H. \| last5 = Rylance \| first5 = P. \| last6 = Warren \| first6 = P. \| last7 = Martin \| first7 = J. \| last8 = Murray \| first8 = PG. \| title = Human endogenous retroviruses: transposable elements with potential? \| journal = Clin Exp Immunol \| volume = 138 \| issue = 1 \| pages = 1-9 \| month = Oct \| year = 2004 \| doi = 10.1111/j.1365-2249.2004.02592.x \| PMID = 15373898 }}</ref> Over 8% of the human genome is made up of (mostly decayed) endogenous retrovirus sequences, as part of the over 42% fraction that is recognizably derived of retrotransposons, while another 3% can be identified to be the remains of [[Transposon#Class II: DNA transposons\|DNA transposon]]s. Much of the remaining half of the genome that is currently without an explained origin is expected to have found its origin in transposable elements that were active so long ago (> 200 million years) that random mutations have rendered them unrecognizable.<ref name=humangenome>{{cite journal \| author=International Human Genome Sequencing Consortium \| month=February \| year=2001 \| title=Initial sequencing and analysis of the human genome\| journal=Nature \| volume=409 \| pages=879–888 \| pmid= 11237011 \| issue=6822 \| doi=10.1038/35057062}}</ref> Genome size variation in at least two kinds of plants is mostly the result of retrotransposon sequences.<ref>{{Cite journal \| last1 = Piegu \| first1 = B. \| last2 = Guyot \| first2 = R. \| last3 = Picault \| first3 = N. \| last4 = Roulin \| first4 = A. \| last5 = Sanyal \| first5 = A. \| last6 = Saniyal \| first6 = A. \| last7 = Kim \| first7 = H. \| last8 = Collura \| first8 = K. \| last9 = Brar \| first9 = DS. \| title = Doubling genome size without polyploidization: dynamics of retrotransposition-driven genomic expansions in Oryza australiensis, a wild relative of rice. \| journal = Genome Res \| volume = 16 \| issue = 10 \| pages = 1262-9 \| month = Oct \| year = 2006 \| doi = 10.1101/gr.5290206 \| PMID = 16963705 }} </ref><ref>{{Cite journal \| last1 = Hawkins \| first1 = JS. \| last2 = Kim \| first2 = H. \| last3 = Nason \| first3 = JD. \| last4 = Wing \| first4 = RA. \| last5 = Wendel \| first5 = JF. \| title = Differential lineage-specific amplification of transposable elements is responsible for genome size variation in Gossypium. \| journal = Genome Res \| volume = 16 \| issue = 10 \| pages = 1252-61 \| month = Oct \| year = 2006 \| doi = 10.1101/gr.5282906 \| PMID = 16954538 }}</ref> ===Telomeres=== [[Telomere]]s are regions of repetitive DNA at the end of a [[chromosome]], which provide protection from chromosomal deterioration during [[DNA replication]]. ==Functions of noncoding DNA== Many noncoding DNA sequences have important biological functions as indicated by [[comparative genomics]] studies that report some regions of noncoding DNA that are highly [[Conserved sequence\|conserved]], sometimes on time-scales representing hundreds of millions of years, implying that these noncoding regions are under strong [[evolution]]ary pressure and [[Natural selection\|positive selection]].<ref name=Ludwig>{{cite journal \|author=Ludwig MZ \|title=Functional evolution of noncoding DNA \|journal=Curr. Opin. Genet. Dev. \|volume=12 \|issue=6 \|pages=634–9 \|year=2002 \|month=December \|pmid=12433575 \|doi= 10.1016/S0959-437X(02)00355-6\|url=http://linkinghub.elsevier.com/retrieve/pii/S0959437X02003556}}</ref> For example, in the genomes of [[human]]s and [[mice]], which diverged from a [[common ancestor]] 65–75 million years ago, protein-coding DNA sequences account for only about 20% of conserved DNA, with the remaining 80% of conserved DNA represented in noncoding regions.<ref name="Cobb et al"/> [[Linkage mapping]] often identifies chromosomal regions associated with a disease with no evidence of functional coding variants of genes within the region, suggesting that disease-causing genetic variants lie in the noncoding DNA.<ref name="Cobb et al">{{cite journal \|author=Cobb J, Büsst C, Petrou S, Harrap S, Ellis J \|title=Searching for functional genetic variants in non-coding DNA \|journal=Clin. Exp. Pharmacol. Physiol. \|volume=35 \|issue=4 \|pages=372–5 \|year=2008 \|month=April \|pmid=18307723 \|doi=10.1111/j.1440-1681.2008.04880.x \|url=}}</ref> Some specific sequences of noncoding DNA may be features essential to chromosome structure, [[centromere]] function and [[homology (biology)\|homolog]] recognition in [[meiosis]].<ref>{{cite journal \|author=Subirana JA, Messeguer X \|title=The most frequent short sequences in non-coding DNA \|journal=Nucleic Acids Res. \|volume=38 \|issue=4 \|pages=1172–81 \|year=2010 \|month=March \|pmid=19966278 \|pmc=2831315 \|doi=10.1093/nar/gkp1094 \|url=}}</ref> According to a comparative study of over 300 [[prokaryotic]] and over 30 [[eukaryotic]] genomes,<ref>{{cite journal \| author = S. E. Ahnert, [[Thomas Fink\|T. M. A. Fink]] and A. Zinovyev \| title=How much non-coding DNA do eukaryotes require? \| url=http://www.tcm.phy.cam.ac.uk/~tmf20/PUBLICATIONS/jtb_07.pdf \|format=PDF\| journal=J Theor. Biol. \| year=2008 \| pages=587–592 \| issue = 4 \| pmid = 18384817 \| volume=252 \| doi = 10.1016/j.jtbi.2008.02.005}}</ref> eukaryotes appear to require a minimum amount of non-coding DNA. This minimum amount can be predicted using a growth model for regulatory genetic networks, implying that it is required for regulatory purposes. In humans the predicted minimum is about 5% of the total genome. === Protection of the genome === {{main\|Mutation}} Noncoding DNA separate genes from each other with long gaps, so mutation in one gene or part of a chromosome, for example deletion or insertion, does not have the "[[frameshift mutation]]" on the whole chromosome. When genome complexity is relatively high, like in the case of human genome, not only different genes, but also inside one gene there are gaps of [[introns]] to protect the entire coding segment to minimise the changes caused by mutation. === Genetic switches === Some noncoding DNA sequences are genetic "switches" that regulate when and where genes are expressed.<ref>{{cite journal\| doi=10.1038/scientificamerican0508-60\| author=Carroll, Sean B., et al.\| month=May \| year=2008 \| title=Regulating Evolution \| journal=Scientific American \| pages=60–67 \| volume=298 \| issue=5\| pmid=18444326 }}</ref> === Regulation of gene expression === {{Main\|Regulation of gene expression}} Some noncoding DNA sequences determine the expression levels of various genes.<ref name=Callaway>{{cite journal \| author=Callaway, Ewen \| title=Junk DNA gets credit for making us who we are \| journal=New Scientist \|year=2010 \| month=March \| url=http://www.newscientist.com/article/dn18680-junk-dna-gets-credit-for-making-us-who-we-are.html?full=true&print=true}}</ref> === Transcription factors === {{Main\|Transcription factor}} Some noncoding DNA sequences determine where transcription factors attach.<ref name=Callaway/> A transcription factor is a protein that binds to specific non-coding DNA sequences, thereby controlling the flow (or transcription) of genetic information from DNA to mRNA. Transcription factors act at very different locations on the genomes of different people. === Operators === {{Main\|Operator (biology) }} An operator is a segment of DNA to which a [[repressor]] binds. A repressor is a DNA-binding protein that regulates the expression of one or more genes by binding to the operator and blocking the attachment of RNA polymerase to the promoter, thus preventing transcription of the genes. This blocking of expression is called repression. === Enhancers === {{Main\|Enhancer (genetics) }} An enhancer is a short region of DNA that can be bound with proteins ([[trans-acting factors]]), much like a set of transcription factors, to enhance transcription levels of genes in a gene cluster. === Promoters === {{Main\|Promoter (biology) }} A promoter is a region of DNA that facilitates transcription of a particular gene. Promoters are typically located near the genes they regulate. === Insulators === {{Main\|Insulator (genetics) }} A genetic insulator is a boundary element that plays two distinct roles in gene expression, either as an enhancer-blocking code, or rarely as a barrier against condensed chromatin. An insulator in a DNA sequence is comparable to a linguistic [[word divider]] such as a comma (,) in a sentence, because the insulator indicates where an enhanced or repressed sequence ends. ==Noncoding DNA and evolution== Shared sequences of apparently non-functional DNA are a major line of [[evidence of common descent]].<ref name=TO-FAQ>[http://www.talkorigins.org/faqs/molgen/ "Plagiarized Errors and Molecular Genetics"], [[talkorigins]], by Edward E. Max, M.D., Ph.D.</ref> Pseudogene sequences appear to accumulate mutations more rapidly than coding sequences due to a loss of selective pressure.<ref name="Petrov & Hartl"/> This allows for the creation of mutant alleles that incorporate new functions that may be favored by natural selection; thus, pseudogenes can serve as raw material for [[evolution]] and can be considered "protogenes".<ref>{{cite journal \|author=Balakirev ES, Ayala FJ \|title=Pseudogenes: are they "junk" or functional DNA? \|journal=Annu. Rev. Genet. \|volume=37 \|issue= \|pages=123–51 \|year=2003 \|pmid=14616058 \|doi=10.1146/annurev.genet.37.040103.103949 \|url=}}</ref> ==Junk DNA== "Junk DNA" is a term that was introduced in 1972 by [[Susumu Ohno]],<ref name = Ohno>{{cite book\| author=S. Ohno \| year=1972\| last=[http://www.junkdna.com/ohno.html So much "junk" DNA in our genome]\| first=In Evolution of Genetic Systems\| editor=H. H. Smith\| pages=366–370\| publisher=Gordon and Breach, New York}}</ref> who noted that the [[mutational load]] from deleterious mutations placed an upper limit on the number of functional loci that could be expected given a typical mutation rate. Ohno predicted that mammal genomes could not have more than 30,000 loci under selection before the "cost" from the mutational load would cause an inescapable decline in fitness, and eventually extinction. This prediction remains robust, with the human genome containing approximately 20,000 genes. Junk DNA remains a label for the portions of a [[genome]] sequence for which no discernible [[Function (biology)\|function]] had been identified. According to a 1980 review in ''[[Nature (journal)\|Nature]]'' by [[Leslie Orgel]] and [[Francis Crick]], junk DNA has "little specificity and conveys little or no selective advantage to the organism".<ref>{{cite journal \|author=Orgel LE, Crick FH \|title=Selfish DNA: the ultimate parasite \|journal=Nature \|volume=284 \|issue=5757 \|pages=604–7 \|year=1980 \|month=April \|pmid=7366731 \|doi=10.1038/284604a0 \|url=http://www.nature.com/nature/journal/v284/n5757/abs/284604a0.html\|bibcode = 1980Natur.284..604O }}</ref> The term is used mainly in [[popular science]] and in a [[colloquialism\|colloquial]] way in scientific publications. "Scientific American" claims that its connotations may have slowed research into the biological functions of noncoding DNA.<ref name=SA>{{cite journal \|doi=10.1038/scientificamerican0307-104 \|author=Khajavinia A, Makalowski W \|title=What is "junk" DNA, and what is it worth? \|journal=[[Scientific American]] \|volume=296 \|issue=5 \|pages=104 \|year=2007 \|month=May \|pmid=17503549 \|url=http://www.scientificamerican.com/article.cfm?id=what-is-junk-dna-and-what \|quote=The term "junk DNA" repelled mainstream researchers from studying noncoding genetic material for many years}}</ref> Several lines of evidence indicate that some "junk DNA" sequences are likely to have unidentified functional activity, and other sequences may have had functions in the past.<ref>{{Cite journal\| doi = 10.1038/443521a\| title = Genetics: Junk DNA as an evolutionary force\| pmid = 17024082\| first2 = C \| year = 2006\| last2 = Vieira \| author = Biémont, Christian \| journal = Nature \| volume = 443\| issue = 7111 \| pages = 521–4 \| bibcode=2006Natur.443..521B}}</ref> Recently junk DNA sequences were artificially expressed resulting in the synthesis of functional proteins. In 2012, the [[ENCODE]] project, a research program supported by the [[National Human Genome Research Institute]], reported that 76% of the human genome's noncoding DNA sequences were [[Transcription (genetics)\|transcribed]] and that nearly half of the genome was in some way accessible to genetic regulatory proteins such as [[transcription factor]]s. This, however, does not necessarily mean that all of these segments have true biochemical function.<ref name=pennisi/> Still, a significant amount of the sequence of the genomes of eukaryotic organisms currently appears to fall under no existing classification other than "junk". For example, one experiment removed 0.1% of the mouse genome with no detectable effect on the [[phenotype]].<ref name=Nobrega>{{cite journal \| author=M.A. Nobrega, Y. Zhu, I. Plajzer-Frick, V. Afzal and E.M. Rubin \| year=2004 \| title=Megabase deletions of gene deserts result in viable mice\| journal=[[Nature (journal)\|Nature]] \| volume=431 \| pages=988–993 \| doi=10.1038/nature03022 \| issue=7011 \| pmid=15496924\|bibcode = 2004Natur.431..988N }}</ref> This result suggests that the removed DNA was largely nonfunctional. In addition, these sequences are enriched for the heterochromatic histone modification H3K9me3.<ref name="Rosenfeld_2009">{{Cite journal \| last1 = Rosenfeld \| first1 = JA. \| last2 = Wang \| first2 = Z. \| last3 = Schones \| first3 = DE. \| last4 = Zhao \| first4 = K. \| last5 = DeSalle \| first5 = R. \| last6 = Zhang \| first6 = MQ. \| title = Determination of enriched histone modifications in non-genic portions of the human genome. \| journal = BMC Genomics \| volume = 10 \| issue = \| pages = 143 \| month = \| year = 2009 \| doi = 10.1186/1471-2164-10-143 \| PMID = 19335899 \| pmc=2667539}}</ref> ==Noncoding DNA and Long range correlations== A statistical distinction between coding and noncoding DNA sequences has been found. It has been observed that nucleotides in non-coding DNA sequences display long range power law correlations while coding sequences do not.<ref>{{cite journal\|last=C.-K. Peng\|first=S. V. Buldyrev, A. L. Goldberger, [[Shlomo Havlin{{!}}S. Havlin]], F. Sciortino, M. Simons, H. E. Stanley\|title=Long-range correlations in nucleotide sequences\|journal=Nature\|year=1992\|volume=356\|pages=168–70\|url=http://havlin.biu.ac.il/Publications.php?keyword=Long-range+correlations+in+nucleotide+sequences&year=&match=all\|doi=10.1038/356168a0\|pmid=1301010\|last2=Buldyrev\|first2=SV\|last3=Goldberger\|first3=AL\|last4=Havlin\|first4=S\|last5=Sciortino\|first5=F\|last6=Simons\|first6=M\|last7=Stanley\|first7=HE\|issue=6365\|bibcode = 1992Natur.356..168P }}</ref><ref>{{cite journal\|last=W. Li and\|first=K. Kaneko\|title=Long-Range Correlation and Partial <math> 1/f^{alpha }</math> Spectrum in a Non-Coding DNA Sequence\|journal=Europhys. Lett\|year=1992\|volume=17\|pages=655–660\|url=http://chaos.c.u-tokyo.ac.jp/papers/bio0/wli.pdf\|bibcode = 1992EL.....17..655L \|doi = 10.1209/0295-5075/17/7/014 }}</ref><ref>{{cite journal\|last=S. V. Buldyrev\|first=A. L. Goldberger, [[Shlomo Havlin{{!}}S. Havlin]], R. N. Mantegna, M. Matsa, C.-K. Peng, M. Simons, and H. E. Stanley\|title=Long-range correlations properties of coding and noncoding DNA sequences: GenBank analysis\|journal=Phys. Rev. E\|year=1995\|volume=51\|pages=5084\|url=http://havlin.biu.ac.il/Publications.php?keyword=Long-range+correlations+properties+of+coding+and+noncoding+DNA+sequences:+GenBank+analysis&year=&match=all\|doi=10.1103/PhysRevE.51.5084\|last2=Goldberger\|first2=A.\|last3=Havlin\|first3=S.\|last4=Mantegna\|first4=R.\|last5=Matsa\|first5=M.\|last6=Peng\|first6=C.-K.\|last7=Simons\|first7=M.\|last8=Stanley\|first8=H.\|issue=5\|bibcode = 1995PhRvE..51.5084B }}</ref> ==See also== [[Conserved non-coding sequence]] [[Eukaryotic chromosome fine structure]] [[Gene-centered view of evolution]] [[Gene regulatory network]] [[Intergenic region]] [[Intragenomic conflict]] [[Phylogenetic footprinting]] [[Transcriptome]] ==References== {{reflist\|2}} {{cite book\| author = Bennett, M.D. and I.J. Leitch \| year = 2005 \| chapter = Genome size evolution in plants \| title = [[The Evolution of the Genome]] \| editor = T.R. Gregory (ed.) \| publisher = Elsevier \| ___location = San Diego \| pages = 89–162 }} {{cite book\| author = Gregory, T.R \| year = 2005 \| chapter = Genome size evolution in animals \| title = [[The Evolution of the Genome]] \| editor = T.R. Gregory (ed.) \| publisher = Elsevier \| ___location = San Diego\| isbn = 0-12-301463-8 }} {{cite journal \|author=Shabalina SA, Spiridonov NA \|title=The mammalian transcriptome and the function of non-coding DNA sequences \|journal=Genome Biol. \|volume=5 \|issue=4 \|pages=105 \|year=2004 \|pmid=15059247 \|pmc=395773 \|doi=10.1186/gb-2004-5-4-105 \|url=}} {{cite journal \|author=Castillo-Davis CI \|title=The evolution of noncoding DNA: how much junk, how much func? \|journal=Trends Genet. \|volume=21 \|issue=10 \|pages=533–6 \|year=2005 \|month=October \|pmid=16098630 \|doi=10.1016/j.tig.2005.08.001 \|url=}} ==External links== * [http://www.genomesize.com/ Animal Genome Size Database] * [http://www.rbgkew.org.uk/cval/homepage.html Plant DNA C-values Database] * [http://www.zbi.ee/fungal-genomesize/index.php Fungal Genome Size Database] * [http://www.nature.com/encode/#/threads ENCODE: The human encyclopaedia] [[Category:DNA]] [[Category:Genetics]] [[Category:Gene expression]] [[ar:فضلة الدنا]] [[ca:ADN no codificant]] [[cs:Nekódující DNA]] [[de:Nichtkodierende Desoxyribonukleinsäure]] [[fr:ADN non codant]] [[it:DNA non codificante]] [[pt:DNA não-codificante]] [[sk:Nekódujúca DNA]] [[sr:Nekodirajuća DNK]] [[fi:Tilke-DNA]] [[sv:Icke-kodande DNA]] [[tr:Kodlamayan DNA]] [[uk:Некодуюча ДНК]] [[zh:非编码DNA]]'
Whether or not the change was made through a Tor exit node (`tor_exit_node`)	0
Unix timestamp of change (`timestamp`)	1361373805