List of Internet phenomena and Processor design: Difference between pages

(Difference between pages)
Content deleted Content added
 
 
Line 1:
To a large extent, the [[design]] of a '''CPU''', or [[central processing unit]], is the design of its [[control unit]]. The modern (ie, 1965 to 1985) way to design control logic is to write a [[microprogram]].
An '''Internet phenomenon''' occurs when something relatively unknown becomes increasingly popular, but usually for a short duration of time. It is nearly impossible to accurately measure the depth of a phenomenon's popularity, and different groups of the [[Internet]] may participate more than others. The Internet's lack of physical boundaries leads to a much faster and wider spread of information and ideas, especially when the subject is based around humor or curiosity. Some people point to this sort of Internet phenomena as good examples of [[meme]]s, or [[neta]]. In [[William Gibson]]'s [[novel]] ''[[Pattern Recognition (novel)|Pattern Recognition]]'' an interesting kind of Internet phenomenon—"the footage"—plays an important role.
 
CPU design was originally an [[ad-hoc]] process. Just getting a CPU to work was a substantial governmental and technical event.
Internet phenomena include:
<!--
*****
Thanks for contributing. We think this list works best in *ALPHABETICAL ORDER*, and we'd appreciate it if new contributions or edits were inserted in the proper spot. Thanks!
*****
-->
 
Key design innovations include [[CPU cache|cache]], [[virtual memory]], [[instruction pipelining]], [[superscalar]], [[CISC]], [[RISC]], [[virtual machine]], [[emulators]], [[microprogram]], and [[Stack (computing)|Stack]].
==Videos==
* [[Andy Milonakis]] - A man with the appearance/voice of a 14 year old boy via a growth hormone defect, who rose to fame after releasing home [[webcam]] recordings of freestyle raps, silly and funny videos, and short films (such as the "[[Crispy New Freestyle]]" and "[[The Superbowl Is Gay]]") on the Internet. He even has his own MTV show now.
* [[Bad Day]] - a man takes out his rage on his computer.
* [[William Henry Gates III|Bill Gates]] gets a pie in the face. [http://www.flamingmailbox.com/maccomedy/movies/creamedgates.html Link to video here.]
* Bell Ringer on [[ESPN]] - A ESPN football game showing a man ringing a bell really fast that looks like he's [[masturbating]] due to the camera angle and his facial expressions. [http://www.spikedhumor.com/Article.aspx?id=789]
* [[Brian Collins]], whose entertainingly bad performance as a sports anchor coined the phrase "[[boom goes the dynamite]]" and led to an appearance on [[CBS]]. [http://media.ebaumsworld.com/index.php?e=sportsnews.wmv (original video)]
* [[Broom Dance]] - A video of a kid in his underwear dancing with a broom [http://www.angelfire.com/trek/broomdance/]
* [[Bubb Rubb]] - a man who rose to fame thanks to a humorous local [[TV]] broadcast where he was interviewed on the topic of [[whistle tip]]s
* "[[Call On Me]]" by [[Eric Prydz]] has had a popular video featuring a male in a female [[aerobics]] class, along with many spoofs. [http://www.partycampus.com/callonme1.php] The most famous spoof features a female in a male Naval aerobics class. [http://www.collegehumor.com/?movie_id=91110]
* [[Chin2]] - supposedly Korean kids dancing topless in front of a mirror
* [[Zladko Vladcik|Elektronik Supersonik]] - An allegedly Eastern European pop song and music video, featured on the [[Molvania]] website. A hilarious parody of Eastern European pop culture.
* [[Ellen Feiss]] - a teenage girl featured in an [[Apple Computer]] advertisement, whose slurred speech and disoriented eyes provoked speculation that she was under the influence of illicit drugs
* [[Exploding whale]] - an old news story thought myth gets a second following with the postage of a news footage video.
* [[Fan film]]s - especially for ''[[Star Wars]]''.[http://www.theforce.net] Fanfilms range from simple backyard antics to professional looking films such as "Duality" [http://www.crewoftwo.com/]
* [[Is This the Way to Armadillo]] - spoof music video of the song "[[Is This the Way to Amarillo]]" made by UK troops stationed in Iraq. [http://www.milkandcookies.com/links/30449/ Link to video here.]
* [[John Daker]] - a singer who makes up for his lack of ability and his unusual voice with bizarre facial expressions.
* [[More Cowbell]] - Saturday Night Live skit about cowbells and The Blue Oyster Cult. [http://gorillamask.net/morecowbell.shtml Link to Video here.]
* [[Numa Numa]] - an overly enthusiastic kid (Gary Brolsma) singing along to a Romanian-language dance song ("Dragostea Din Tei" by O-ZONE)
* [[Prophet Yahweh]] - video[http://www.filecabi.net/v.php?file=ufoondemand.wmv] of the prophet supposedly summoning [[UFO]]s, was widely circulated on the internet in June of 2005.
* [[Rock Paper Saddam]] [http://www.rockpapersaddam.com/]
* [[Rubber Johnny]] - a short film by music video director [[Chris Cunningham]], accompanied by music from [[Aphex Twin]], which was thought by some to depict an actual mutant teenager in a wheelchair when excerpts from it began appearing on the internet [http://www.rubberjohnny.tv Link to Video here.]
* [[Ghyslain Raza|The Star Wars Kid]] - a video digitally edited numerous times of a [[Québec]]ois teenager (Ghyslain Raza) pretending to be [[Darth Maul]]
* [[Stealth Disco]] - Videos of people rocking out behind unknowing victims.
* [[STFU (video)|STFU]] - A girl who does a bad job of singing, and then when she stops she is whacked from behind by a colleague who wants her to [[STFU]]. [http://www.stfu.se/stfuse.swf Link to video here.]
* [[Video Game Pianist]] - Piano player who plays both old and new video games' themes. Formerly called "[[The Blindfolded Pianist]]"
* [[William Hung]] - A Chinese college student at [[UC Berkeley]] who auditioned for the [[2004]] series of [[American Idol]] and had already been laughed at on TV rose to even more fame through the Internet with his audition getting more widespread exposure as well as remix
 
== History of general purpose CPUs ==
==Animation-based==
* [[All your base are belong to us]] - A flash cartoon making fun of a bad Japanese translation in an old videogame
* [[Animutation]]s - simple animations usually containing foriegn music and pop-culture references. The fad first gained widespread popularity with "Hyakugojyuuichi".
* [[Badger Badger Badger]] - a repetitive animation about Badgers
* [[Bananaphone]]
* [[Big Ass Titties]] - video which fantasizes over [[Lindsay Lohan]], [[Halle Berry]], [[Mariah Carey]], [[Angelina Jolie]], and various other celebrities' "big ass titties". [http://www.lohanfreestyle.com View the official website here.]
* [[Dancing baby]] - a 3D-rendered dancing baby.
* [[The Demented Cartoon Movie]] [http://www.albinoblacksheep.com/flash/demented.html]
* [[The End of the World]] - a flash animation using [[Group X]]-style voices.[http://www.albinoblacksheep.com/flash/end.php]
* [[Final Fantasy A Plus|Final Fantasy A+]] - a [[school]] based adaptation of the [[Final Fantasy]] video game
* [[Group X]] - makers of the songs "Bang Bang Bang", "SchfiftyFive" and a parody of the "[[Mario Twins]]" A version of the [[Super Mario Bros.]] theme by [[Group X]], adapted into [[Macromedia Flash|Flash]] video.
* [[The Hampster Dance]] - a page filled with animated GIFs of hamsters, linking to other animated pages. It now has its own CD soundtrack.
* [[Happy Tree Friends]] - a series featuring cute animals that meet violent ends
* [[Homestar Runner]] - an online series that features a popular "[[Strong Bad]] Emails" in which viewers can ask the main character questions
* [[How to Kill a Mockingbird]] - A parody of the novel [[To Kill a Mockingbird]] that quickly deviates into a fantasy about pirates, dinosaurs, robots, and ninja.
* [[How to Open a Door, Anime Style]] - a stick figure tries to open the door by pushing, using brute force and variety of weapons, but at the end the door is opened only by pulling. [http://web.archive.org/web/20031222093227/http://www.generyx.de/anime.gif]
*[[I Hate Yuo Myg0t 2]] - a flash animation of a Counter-Strike player who blames 'hacking' (especially that of [[myg0t]]) for his lack of skill. He is then tricked into downloading a Counter-Strike 'hacking' program off of [[myg0t]] which infects his computer. [http://www.pwned.nl]
* [[JibJab]] [http://www.jibjab.com] - a Web site featuring many cartoons including those that [[satirize]] the [[U.S. presidential election, 2004 (detail)|2004 Presidential Election]]
* [[Madness (flash cartoon)|Madness]]
* [[Weebl's cartoons#Magical Trevor|Magical Trevor]] - A flash cartoon about a magician.
* [[Neurotically Yours]] - a series featuring a [[Goth]] and her pet [[squirrel]]
* [[Peanut Butter Jelly Time]] - A dancing banana that moves about the screen during a rap about [[peanut butter and jelly]]. [http://www.ebaumsworld.com/flash/peanutbutter.html Link to one version of the video here.]
* [[Prank flash]]s - flash animations that tend to catch people off guard. They can be heart jumping "screamers" or just simply annoying flashes such as "[[You Are An Idiot]]".
* [[Rathergood.com]], a website which produces Photoshopped creatures, usually singing, manned by [[Joel Veitch]].
* [[Red vs Blue]] - a popular [[machinima]] using the [[Microsoft]] [[Halo (video game series) |Halo]] video game engine. A popular, fan-created outgrowth is [http://www.redvsblue.net Sponsors vs Freeloaders]
* [[Rejected]] - a story of an animator and the effects of rejection
* [[Salad Fingers]] - a series featuring an odd character in a strange and creepy world
* [[There she is!!]] - a Korean series about a girl rabbit that fell in love with a cat
* [[We Drink Ritalin]]
* [[Weebl and Bob]] - a series about two egg shaped friends
* [[Weeeeee! (Gonads and Strife)]]
* [[Xiao Xiao]] - a set of stick figure action animations. Xiao Xiao #3 was particularly popular.
 
=== 1950s: early designs ===
==Anime==
* [[Nevada-tan]] - an [[imageboard]] [[meme]] featuring [[CG artwork]] of a Japanese schoolgirl who murdered her classmate
* [[OS-tan]] [http://nijiura-os.hp.infoseek.co.jp/] - [[operating system]]s personified as cute mascots by various [[Japan]]ese artists
 
Each of the computer designs of the early 1950s was a unique design; there were no upward-compatible machines or computer architectures with multiple, differing implementations. Programs written for one machine would not run on another kind, even other kinds from the same company. This was not a major drawback at the time because there was not a large body of software developed to run on computers, so starting programming from scratch was not seen as a large barrier.
==Images==
* [[Bert is Evil]] - [[Adobe Photoshop|Photoshopped]] pictures placing the [[Muppet]] Bert with questionable people and situations
* [[William Henry Gates III|Bill Gates]] Photos From 1985 - A [[Monkey Methods]] Blog entry hosts photos of Bill Gates. Though the blog says otherwise, these photos are publicity shots from 1985. Hundreds of sites now host these photos, such as [[Neowin]], [[Monkey Methods]], and [[eBaum's World]].
* [[Boilerplate (robot)|Boilerplate]]
* [[Bonsai Kitten]]
* [[Brian Peppers]], an unusually ugly sex offender.
* [[Ceciliantas]] - an ''[[EverQuest II]]'' player whose "cybering" was revealed by screenshots from an invisible player in the same room
* [[Clock Spider]] - who "ate" a clock and fought Limecat
* [[Dog poop girl]] - a woman's dog relieves itself on a subway car floor and she refuses to clean it up; the Internet hears about it and punishes her
* [[Every time you masturbate... God kills a kitten]]
* [[Everywhere girl]] Reported often in the [http://www.theinquirer.net/?article=24735 Inquirer] about the stock photo usage gone wild. A specific university girl image appears in various institution and company adverts.
* [[Fatmouse]] - a large mouse with large ambitions
* [[HA! HA! guy]] - a customizable image of a laughing Quaker minister
* [[Icy Hot Stuntaz]]
* Keep Sharpies Away From Mexicans [http://13gb.com/content/gallery/index.php?id=432] - A photo of two girls, of Mexican descent, who have drawn exceptionally dark and asymmetrical eyebrows on their faces.
* [[Limecat]] - a cat with a lime on its head (a.k.a. Meloncat)
* [[Moshzilla]] - a girl [[moshing]] at a show
* [[Mustard Man]] - picture of a supposed fast food employee dropping the mustard
* [[Ate My Balls|Mr. T Ate My Balls]] - a [[Yahoo!]] site with images of [[Mr. T]], captioned with various absurd and questionable statements. Repeatedly done with other subjects, both fictional and non-fictional. Spawned an entire Yahoo! category under Tasteless Humor &#8594; Ate My Balls.
* [[Oolong the Rabbit]] - a [[Japan]]ese rabbit whose owner placed various objects on top of its head (the most well-known being pancakes) and then posted pictures. Also known as Pancakebunny
* [[O RLY]]? - Various pictures (usually an owl) with "O RLY?" printed on the picture. Often used in conjunction with another picture and the letters "YA RLY"
* [[Phooning]] - Introduced by John Darrow in 1999. A [http://www.phoons.com phoon] is standing in a running position in an otherwise normal picture.
* [[Tourist guy]] (http://www.touristofdeath.com) - the same person [[Adobe Photoshop|Photoshopped]] into photos of different events, mostly disasters
* [[Tubcat]] - a very fat cat
 
The design freedom of the time was very important, for designers were very constrained by the cost of electronics, yet just beginning to explore how a computer could best be organized. Some of the basic features introduced during this period included [[index registers]] (on the [[Ferranti Mark I]]), a return-address saving instruction ([[UNIVAC I]]), immediate operands ([[IBM 704]]), and the detection of invalid operations ([[IBM 650]]).
==Websites==
* [[Stile Project]] [http://www.stileproject.com]
* [[Consumption Junction]] [http://consumptionjunction.com]
* [[Black People Love Us]] [http://www.blackpeopleloveus.com]
* [[Crying, while eating]] [http://cryingwhileeating.com] - Users submit 30 second videos of themselves crying while eating food.
* [[The Death Clock]] [http://www.deathclock.com] - Where you can personalize your clock for a countdown to your death
* [[Dickcream]] - a website containing various amusing video loops set to music, many of which have circulated around the Internet. [http://www.dickcream.com Click here for front page] and [http://dickcream.com/h/ here for song/video selection (click any number at top except date).]
* [[eBay]] auctions that are unusual, such as the 10 year old [[grilled cheese]] sandwich with a supposed semblance to the [[Virgin Mary]]
* [[Emotion Eric]] - a person that makes various emotional expressions by request
* [[Neurocam]] - [http://www.neurocam.com], art project / social experiment / life role play /mysterious unknown
* [[Newgrounds]] - [http://www.newgrounds.com], A site that allows users to submit Macromedia Flash animations, games and audio to the site
* [[Ninja Burger]] - ninja who deliver fast food
* [[T-Shirt Hell]] [http://www.tshirthell.com]
* [[Toothpaste for dinner]] - website featuring daily comics of a self-proclaimed "Inter-Net Superstar" known only as "drew". [http://www.toothpastefordinner.com See website here].
* [[Vin Diesel Fact Generator]] - A list of humorous "facts" about the actor [[Vin Diesel]]. [http://www.4q.cc/vin/]
* [[Webcomic|Webcomics]] - various comics and their characters have gained large followings. They come in multiple formats ranging from hand drawn illustrations to [[sprite comic]]s
* [[Wikipedia]]
* [[Y2Khai]] - One Loc'd Out Asian Going Crazy![http://www.y2khai.com]
* [[YTMND]] - a simple humorous website becomes a template for hundreds of similar sites and [[List of YTMND fads|many sub-memes]].
* Zefrank.com [http://www.zefrank.com] - A website that is a fun way to waste one's time with due to the many games, interactive toys, pictures and videos presented on it. Zefrank.com is known for its "How to Dance Properly" video. [http://www.zefrank.com/invite/swfs/index2.html]
* [[Zombo.com]] - parodies the dot-com boom
 
By the end of the [[1950]]s commercial builders had developed factory-constructed, truck-deliverable computers. The most widely installed computer was the [[IBM 650]], which used [[drum memory]] onto which programs were loaded using either [[punched tape|paper tape]] or [[punch card]]s. Some very high-end machines also included [[core memory]] which provided higher speeds. [[Hard disk]]s were also starting to become popular.
===Shock sites===
* [[Goatse.cx]], [[Lemonparty]] & [[Tubgirl]], are [[shock sites]] frequently linked from Internet forums and [[Internet Relay Chat|IRC]] channels
 
Computers are automatic [[Abacus|abaci]]. The type of number system affects the way they work. In the early [[1950s]] most computers were built for specific numerical processing tasks, and many machines used decimal numbers as their basic number system &ndash; that is, the mathematical functions of the machines worked in base-10 instead of base-2 as is common today. These were not merely [[binary coded decimal]]. The machines actually had ten vacuum tubes per digit in each [[Processor register|register]]. Some early [[Soviet Union|Soviet]] computer designers implemented systems based on ternary logic; that is, a bit could have three states: +1, 0, or -1, corresponding to positive, no, or negative voltage.
===Personal sites===
* [[Flying Spaghetti Monster|First United Church of the Flying Spaghetti Monster]] - [[parody religion]] set up to satirize [[intelligent design]].
* [[Fully Ramblomatic]] [http://www.fullyramblomatic.com (website here)] - Personal website of Benjamin '[[Yahtzee]]' Croshaw, a British teenager who rambles, reviews and entertains.
* [[Hello My Future Girlfriend]] [http://www.pr0k.net/thechilde/]
* [[Mahir Cagri]] - personal website of a Turkish man; has received mass adoration by fans, mainly for its overly enthusiastic text.
 
An early project for the [[U.S. Air Force]], [[BINAC]] attempted to make a lightweight, simple computer by using binary arithmetic. It deeply impressed the industry.
===Fan sites===
* ''[[Big Trouble in Little China]]'' - devoted to the [[Jack Burton]] movie.
* [[Real Ultimate Power]] - devoted to ninjas
 
As late as 1970, major computer languages such as "[[C_language|C]]" were unable to standardize their numeric behavior because decimal computers had groups of users too large to alienate.
===Blogs===
* [[The Best Page in the Universe]] – 100,000,000+ visits to a website operated by a pirate. The individual articles from this site often spread [[meme|memetically]].
* [[Currently Stationary]] - The original [[Engrish]] cult classic
* [[The Dullest Blog in the World]] - A very dull blog about the mundane happenings of a very dull person. [http://www.wibsite.com/wiblog/dull]
* [[Rachelle Waterman]] - The blog of a teenage girl who wrote "just to let everyone know, my mother was murdered," and was arrested shortly thereafter. Over 5,000 comments in her [[LiveJournal]] blog, before it was deleted. Mirrors still exist.
* [[Redneck Neighbor]] - one man's experiences with a bad neighbor [http://www.joespc.com/carlos/redneck.htm]
* [[Tucker Max]] - Millions of hits as well as a cult following to a "man's man," whose skill with alcohol, women, and witty insults is "supposedly" unmatched.
* [[PostSecret]] - An ongoing community art project where people mail-in their secrets anonymously on homemade postcards. [http://postsecret.blogspot.com]
 
Even when designers used a binary system, they still had many odd ideas. Some used sign-magnitude arthmetic (-1 = 10001), rather than modern [[two's complement]] arithmetic (-1 = 11111). Most computers used six-bit character sets, because they adequately encoded [[Hollerith]] cards. It was a major revelation to designers of this period to realize that the data word should be a multiple of the character size. They began to design computers with 12, 24 and 36 bit data words.
==Audio==
* [[Arnold Schwarzenegger prank calls]] - Various recorded prank calls (which spun off [[Macromedia Flash|Flash]] versions of these calls), from soundboards which were created from [[Arnold Schwarzenegger]]'s voice as a cop (Detective John Kimble) in the film ''[[Kindergarten Cop]]'' ([[1990]]).
* [[Asian Pride]] - a phenomenon which began with a [[rap music|rap]] song "[[Got Rice?]]" which represented [[Asia]] and the staple food of that continent ([[rice]]), with the backing instrumental being [[2Pac]]'s song "Changes". This spun off various songs representing Asian countries, sometimes credited to '''Asian Pride''', '''Azn Pride''' or '''AZN Pride''' and various "sequels" to "Got Rice?".
* [[Combo No. 5]] - a parody of "[[Mambo No. 5]]" by [[Lou Bega]] in a Chinese accent based on Chinese [[take-away]] meals
* The "[[Crazy Frog]]" [[ringtone]]/"The Annoying Thing" video which originated as a sound file spread on the Internet and used in Insanity Tests
* [[Kerpal]] - "You kicked my dog!" prank call
* [[Jared Smith|Jared: Butcher of Song]]
* [[MC Hawking]] - a text-to-speech rapper based off [[Stephen Hawking]]
* [[Schnappi|Schnappi das kleine Krokodil]] - a song about a crocodile sung by a very young German girl, where its huge commercial success would not have been possible without [[P2P]] networks.
* [[Tai Mai Shu]] - an Asian freestyle rapper
* [[Yatta]] - a song by the [[fig leaf]] wearing group Happa-tai that was also made into the animutation "Irrational Exuberance."
 
In this era, [[Grosch's law]] dominated computer design: Computer cost increased as the square of its speed.
==Text-based==
* [[All your base are belong to us]] - a phrase from the [[English language|English]] translation of the [[video game]] ''[[Zero Wing]]'', which later was adapted into a popular [[Macromedia Flash|Flash]] animation.
* [[Bash.org]] - A collection of online [[IRC]] [[chat log]]s at [http://www.bash.org bash.org]
* [[Bloodninja]] - a series of funny [[instant messaging|IM]] files of him messing with other people while Cybering
* [[Densha Otoko]] - online postings about a man who meets someone by saving her on a train, which was later adapted into a comic book and TV-movie
* [[First post]], participants strive to be the first person to add a comment (post) to a new article or discussion thread.
* [[Help I burned my girlfriends cooter]] or "HELP - I BURNED MY GIRLFRIENDSS COOTER" was the name of a joke created by a user named "ssl" on the [[IGN]] [[Vestibule]] [http://boards.ign.com/The_Vestibule/b5296/] [[Internet]] [[forum]] about a 22-year-old requesting assistance after burning a teenage girl's genitals. [http://www.dumpanimage.com/image.php?i=92F2riELXP A screenshot of the post can be found here.] Although the original thread was deleted, a slightly modified version of the post was entered into [http://www.tribalwar.com/forums/showthread.php?t=384959 this forum].
* [[Kurt Vonnegut]]'s supposed [[1997]] "wear sunscreen" commencement address at [[MIT]], widely circulated on the Internet. In fact, the commencement speaker at MIT in 1997 was [[Kofi Annan]] and the putative Vonnegut speech was an article published in the ''[[Chicago Tribune]]'' on [[June 1]], 1997 by columnist [[Mary Schmich]].
* [[Leet speak]] and [[AOL speak]] may also be considered forms of memetic Internet phenomenon.
* [[There Is No Cabal]] - a phrase used on [[Usenet]].
* [[Timecube]] - the profound yet incoherent rantings of a schizophrenic philosopher.
 
=== 1960s: the computer revolution and CISC ===
==Advertising==
* [[Anabukinchan]]
* [http://www.ilovebees.com ilovebees.com], A.K.A. the [[Haunted Apiary]], a viral marketing campaign/virtual reality game launched to advertise [[Halo 2]].
* The [[Six Flags]] adverts starring the Six Flags mascot [[Mr. Six]]. [http://www.thegreenhead.com/watercooler/2004/07/six-flags-dancing-old-guy-mr-six.php Article and clips can be found here.]
* The [[Spongmonkey]]s- bizarre creatures that sing, later used to advertise for [[Quiznos]]
* [[The Subservient Chicken]]- a [[Burger King]] promotional website that features a "live" chicken that can obey thousands of typed commands
* [[Whazzup?]] [http://www.bud-true.com/pages/movies7.htm] - [[Budweiser]] [[television commercial|commercial]] series that took a new life when it was parodied with the ''[[SuperFriends]]'' [http://whassup.com/spoofs/whassup.mpg] and [[Elián Wazzup]] [http://www.seanbonner.com/history.php] (which was also inspired by the [[Eli%C3%A1n Gonz%C3%A1lez]] debacle).
 
One major problem with early computers was that a program for one would not work on others. Computer companies found that their customers had little reason to remain loyal to a particular brand, as the next computer they purchased would be incompatible anyway. At that point price and performance were usually the only concerns.
==See also==
* [[List_of_clich%C3%A9s#Internet|Clichés on the Internet]]
* [[fad]]
 
In 1962, IBM tried a new approach to designing computers. The plan was to make an entire family of computers that could all run the same software, but with different performances, and at different prices. As users' requirements grew they could move up to larger computers, and still keep all of their investment in programs, data and storage media.
[[Category:Internet memes]]
 
In order to do this they designed a single ''reference computer'' called the '''[[System 360]]''' (or '''S/360'''). The System 360 was a virtual computer, a reference instruction set and capabilities that all machines in the family would support. In order to provide different classes of machines, each computer in the family would use more or less hardware emulation, and more or less [[microprogram]] emulation, to create a machine capable of running the entire System 360 [[instruction set]].
 
For instance a low-end machine could include a very simple processor for low cost. However this would require the use of a larger microcode emulator to provide the rest of the instruction set, which would slow it down. A high-end machine would use a much more complex processor that could directly process more of the System 360 design, thus running a much simpler and faster emulator.
 
IBM chose to make the reference [[instruction set]] quite complex, and very capable. This was a conscious choice. Even though the computer was complex, its "[[control store]]" containing the [[microprogram]] would stay relatively small, and could be made with very fast memory. Another important effect was that a single instruction could describe quite a complex sequence of operations. Thus the computers would generally have to fetch fewer instructions from the main memory, which could be made slower, smaller and less expensive for a given combination of speed and price.
 
As the S/360 was to be a successor to both scientific machines like the [[IBM 7090|7090]] and data processing machines like the [[IBM 1401|1401]], it needed a design that could reasonably support all forms of processing. Hence the instruction set was designed to manipulate not just simple binary numbers, but text, scientific floating-point (similar to the numbers used in a calculator), and the [[binary coded decimal]] arithmetic needed by accounting systems.
 
Almost all following computers included these innovations in some form. This basic set of features is now called a "[[complex instruction set computer]]," or CISC (pronounced "sisk"), a term not invented until many years later.
 
In many CISCs, an instruction could access either registers or memory, usually in several different ways.
This made the CISCs easier to program, because a programmer could remember just thirty to a hundred instructions, and a set of three to ten [[addressing mode]]s rather than thousands of distinct instructions.
This was called an "[[orthogonal instruction set]]."
The [[PDP-11]] and [[Motorola 68000]] architecture are examples of nearly orthogonal instruction sets.
 
There was also the ''BUNCH'' (Burroughs, Univac, NCR, CDC, and Honeywell) that competed against IBM at this time though IBM dominated the era with [[S/360]].
 
The Burroughs Corporation (which later became Unisys when they merged with Sperry/Univac) offered an alternative to S/360 with their [[Burroughs B5000|B5000]] series machines. The B5000 series [[1961]] had virtual memory, a multi-programming operating system (Master Control Program or MCP), written in [[ALGOL 60]], and the industry's first recursive-descent compilers as early as 1963.
 
=== 1970s: large scale integration ===
 
In the 1960s, the [[Apollo guidance computer]] and [[Minuteman missile]] made the [[integrated circuit]] economical and practical.
 
Around 1971, the first calculator and clock chips began to show that very small computers might be possible. The first [[microprocessor]] was the 4004, designed in 1971 for a calculator company, and produced by [[Intel]]. The 4004 is the direct ancestor of the [[Intel 80386]], even now maintaining some code compatibility. Just a few years later, the word size of the 4004 was doubled to form the 8008.
 
By the mid-1970s, the use of integrated circuits in computers was commonplace. The whole decade consists of upheavals caused by the shrinking price of transistors.
 
It became possible to put an entire CPU on a single printed circuit board. The result was that minicomputers, usually with 16-bit words, and 4k to 64K of memory, came to be commonplace.
 
CISCs were believed to be the most powerful types of computers, because their microcode was small and could be stored in very high-speed memory. The CISC architecture also addressed the "semantic gap" as it was perceived at the time. This was a defined distance between the machine language, and the higher level language people used to program a machine. It was felt that compilers could do a better job with a richer instruction set.
 
Custom CISCs were commonly constructed using "bit slice" computer logic such as the AMD 2900 chips, with custom microcode. A bit slice component is a piece of an [[ALU]], register file or microsequencer. Most bit-slice integrated circuits were 4-bits wide.
 
By the early 1970s, the [[PDP-11]] was developed, arguably the most advanced small computer of its day. Almost immediately, wider-word CISCs were introduced, the 32-bit [[VAX]] and 36-bit [[PDP-10]].
 
Also, to control a cruise missile, Intel developed a more-capable version of its 8008 microprocessor, the 8080.
 
IBM continued to make large, fast computers. However the definition of large and fast now meant more than a megabyte of RAM, clock speeds near one megahertz [http://www.hometoys.com/mentors/caswell/sep00/trends01.htm][http://research.microsoft.com/users/GBell/Computer_Structures_Principles_and_Examples/csp0727.htm], and tens of megabytes of disk drives.
 
IBM's System 370 was a version of the 360 tweaked to run virtual computing environments. The [[VM (Operating system) |virtual computer]] was developed in order to reduce the possibility of an unrecoverable software failure.
 
The Burroughs B5000/B6000/B7000 series reached its largest market share. It was a stack computer programmed in a dialect of Algol. It used 64-bit fixed-point arithmetic, rather than floating-point.
 
All these different developments competed madly for marketshare.
 
=== Early 1980s: the lessons of RISC ===
 
In the early [[1980s]], researchers at [[UC Berkeley]] and [[IBM]] both discovered that most computer language compilers and interpreters used only a small subset of the instructions of a [[CISC]]. Much of the power of the CPU was simply being ignored in real-world use. They realized that by making the computer simpler and less orthogonal, they could make it faster and less expensive at the same time.
 
At the same time, CPUs were growing faster in relation to the memory they addressed. Designers also experimented with using large sets of internal registers. The idea was to [[cache]] intermediate results in the registers under the control of the compiler.
This also reduced the number of [[addressing mode]]s and orthogonality.
 
The computer designs based on this theory were called [[Reduced Instruction Set Computer]]s, or RISC. RISCs generally had larger numbers of registers, accessed by simpler instructions, with a few instructions specifically to load and store data to memory. The result was a very simple core CPU running at very high speed, supporting the exact sorts of operations the compilers were using anyway.
 
A common variation on the RISC design employs the [[Harvard architecture]], as opposed to the [[Von Neumann architecture|Von Neumann]] or Stored Program architecture common to most other designs. In a Harvard Architecture machine, the program and data occupy separate memory devices and can be accessed simultaneously. In Von Neumann machines the data and programs are mixed in a single memory device, requiring sequential accessing which produces the so-called "Von Neumann bottleneck."
 
One downside to the RISC design has been that the programs that run on them tend to be larger. This is because [[compiler]]s have to generate longer sequences of the simpler instructions to accomplish the same results. Since these instructions need to be loaded from memory anyway, the larger code size offsets some of the RISC design's fast memory handling.
 
Recently, engineers have found ways to compress the reduced instruction sets so they fit in even smaller memory systems than CISCs. Examples of such compression schemes include [[ARM architecture|the ARM]]'s "Thumb" instruction set. In applications that do not need to run older binary software, compressed RISCs are coming to dominate sales.
 
Another approach to RISCs was the "[[niladic]]" or "zero-address" instruction set. This approach realized that the majority of space in an instruction was to identify the operands of the instruction. These machines placed the operands on a push-down (last-in, first out) [[stack (computing)|stack]]. The instruction set was supplemented with a few instructions to fetch and store memory. Most used simple caching to provide extremely fast RISC machines, with very compact code. Another benefit was that the interrupt latencies were extremely small, smaller than most CISC machines (a rare trait in RISC machines). The first zero-address computer was developed by [[Chuck Moore|Charles Moore]]. It placed six 5-bit instructions in a 32-bit word, and was a precursor to [[VLIW]] design (see below: 1990 to Today).
 
Commercial variants were mostly characterized as "[[FORTH]]" machines, and probably failed because that language became unpopular. Also, the machines were developed by defense contractors at exactly the time that the cold war ended. Loss of funding may have broken up the development teams before the companies could perform adequate commercial marketing.
 
RISC chips now dominate the market for 32-bit embedded systems. Smaller RISC chips are even becoming common in the cost-sensitive 8-bit embedded-system market. The main market for RISC CPUs has been systems that require low power or small size.
 
Even some CISC processors (based on architectures that were created before RISC became dominant) translate instructions internally into a RISC-like instruction set. These CISC chips include newer [[X86|x86]] and [[VAX]] models.
 
These numbers may surprise many, because the "market" is perceived to be desktop computers. With Intel x86 designs dominating the vast majority of all desktop sales, RISC is found only in the [[Apple Computer|Apple]] desktop computer lines. However, desktop computers are only a tiny fraction of the computers now sold. Most people own more computers in embedded systems in their car and house than on their desks.
 
=== Mid-1980s to today: exploiting instruction level parallelism ===
 
In the mid-to-late 1980s, designers began using a technique known as '''[[instruction pipelining]]''', in which the processor works on multiple instructions in different stages of completion. For example, the processor may be retrieving the operands for the next instruction while calculating the result of the current one. Modern CPUs may use over a dozen such stages.
 
A similar idea, introduced only a few years later, was to execute multiple instructions in parallel on separate arithmetic-logic units ([[ALU]]s). Instead of operating on only one instruction at a time, the CPU will look for several similar instructions that are not dependent on each other, and execute them in parallel. This approach is known as [[superscalar]] processor design.
 
Such techniques are limited by the degree of [[instruction level parallelism]] (ILP), the number of non-dependent instructions in the program code. Some programs are able to run very well on superscalar processors due to their inherent high ILP, notably graphics. However more general problems do not have such high ILP, thus making the achievable speedups due to these techniques to be lower.
 
Branching is one major culprit. For example, the program might add two numbers and branch to a different code segment if the number is bigger than a third number. In this case even if the branch operation is sent to the second ALU for processing, it still must wait for the results from the addition. It thus runs no faster than if there were only one ALU. The most common solution for this type of problem is to use a type of [[branch prediction]].
 
To further the efficiency of multiple functional units which are available in superscalar designs, operand register dependencies was found to be another limiting factor. To minimize these dependencies, [[out-of-order execution]] of instructions was introduced. In such a scheme, the instruction results which complete out-of-order must be re-ordered in program order by the processor for the program to be restartable after an exception. ''Out-of-Order'' execution was the main advancement of the computer industry during the [[1990s]].
A similar concept is [[speculative execution]], where instructions from both sides of a branch are executed at the same time, and the results of one side or the other are thrown out once the branch answer is known.
 
These advances, which were originally developed from research for RISC-style designs, allow modern CISC processors to execute twelve or more instructions per clock cycle, when traditional CISC designs could take twelve or more cycles to execute just one instruction.
 
The resulting instruction scheduling logic of these processors is large, complex and difficult to verify. Furthermore, the higher complexity requires more transistors, increasing power consumption and heat. In this respect RISC is superior because the instructions are simpler, have less interdependence and make superscalar implementations easier. However, as Intel has demonstrated, the concepts can be applied to a CISC design, given enough time and money.
 
:Historical note: Some of these techniques (e.g. pipelining) were originally developed in the late [[1950s]] by [[International Business Machines|IBM]] on their [[IBM 7030|Stretch]] mainframe computer.
 
=== 1990 to today: looking forward ===
 
====VLIW and EPIC====
 
The instruction scheduling logic that makes a superscalar processor is just boolean logic. In the early 1990s, a significant innovation was to realize that the coordination of a multiple-ALU computer could be moved into the [[compiler]], the software that translates a programmer's instructions into machine-level instructions.
 
This type of computer is called a '''[[very long instruction word]]''' (VLIW) computer.
 
Statically scheduling the instructions in the compiler (as opposed to letting the processor do the scheduling dynamically) has many practical advantages over doing so in the CPU.
 
Oddly, speed is not one of them. With enough transistors, the CPU could do everything at once. However all those transistors make the chip larger, and therefore more expensive. The transistors also use power, which means that they generate heat that must be removed. The heat also makes the design less reliable.
 
Since compiling happens only once on the developer's machine, the control logic is "canned" in the final realization of the program. This means that it consumes no transistors, and no power, and therefore is free, and generates no heat.
 
The resulting CPU is simpler, and runs at least as fast as if the scheduling were in the CPU.
 
There were several unsuccessful attempts to commercialize VLIW. The basic problem is that a VLIW computer does not scale to different price and performance points, as a dynamically scheduled computer can.
 
Also, VLIW computers optimise for throughput, not low latency, so they were not attractive to the engineers designing controllers and other computers embedded in machinery. The [[embedded system]]s markets had often pioneered other computer improvements by providing a large market that did not care about compatibility with older software.
 
In January [[2000]], a company called [[Transmeta]] took the interesting
step of placing a compiler in the central processing unit, and making the compiler translate from a reference byte code (in their case, [[x86]] instructions) to an internal VLIW instruction set. This approach
combines the hardware simplicity, low power and speed of VLIW RISC with
the compact main memory system and software reverse-compatibility provided
by popular CISC.
 
[[Intel]] released a chip, called the [[Itanium]], based on what they call an [[Explicitly Parallel Instruction Computing]] (EPIC) design. This design supposedly provides the VLIW advantage of increased instruction throughput. However, it avoids some of the issues of scaling and complexity, by explicitly providing in each "bundle" of instructions information concerning their dependencies. This information is calculated by the compiler, as it would be in a VLIW design. The early versions are also backward-compatible with current [[x86]] software by means of an on-chip [[emulation]] mode. Integer performance has been disappointing as have sales in volume markets.
 
====Multi-threading====
 
Also, we may soon see multi-threaded CPUs. Current designs work best when the computer is running only a single program, however nearly all modern [[operating system]]s allow the user to run multiple programs at the same time. For the CPU to change over and do work on another program requires expensive [[context switching]]. In contrast, a multi-threaded CPU could handle instructions from multiple programs at once.
 
To do this, such CPUs include several sets of registers. When a context switch occurs, the contents of the "working registers" are simply copied into one of a set of registers for this purpose.
 
Such designs often include thousands of registers instead of hundreds as in a typical design. On the downside, registers tend to be somewhat expensive in chip space needed to implement them. This chip space might otherwise be used for some other purpose.
 
====Reconfigurable logic====
 
Another track of development is to combine reconfigurable logic with a general-purpose CPU. In this scheme, a special computer language compiles fast-running subroutines into a bit-mask to configure the logic. Slower, or less-critical parts of the program can be run by sharing their time on the CPU. This process has the capability to create devices such as software [[radio]]s, by using digital signal processing to perform functions usually performed by analog [[electronics]].
 
====Public ___domain processors====
 
As the lines between hardware and software increasingly blur due to progress in design methodology and availability of chips such as [[FPGA]]s and cheaper production processes, even [[open source hardware]] has begun to appear. Loosely-knit communities like [[OpenCores]] have recently announced completely open CPU architectures such as the [[OpenRISC]] which can be readily implemented on FPGAs or in custom produced chips, by anyone, without paying license fees.
 
====High end processor economics====
 
Developing new, high-end CPUs is a '''very''' expensive proposition. Both the logical complexity (needing very large logic design and logic verification teams and simulation farms with perhaps thousands of computers) and the high operating frequencies (needing large circuit design teams and access to the state-of-the-art fabrication process) account for the high cost of design for this type of chip. The design cost of a high-end CPU will be on the order of US $100 million. Since the design of such high-end chips nominally take about five years to complete, to stay competitive a company has to fund at least two of these large design teams to release products at the rate of 2.5 years per product generation. Only the personal computer mass market (with production rates in the hundreds of millions, producing billions of dollars in revenue) can support such economics. As of 2004, only four companies are actively designing and fabricating state of the art general purpose computing CPU chips: [[Intel]], [[AMD]], [[IBM]] and [[Fujitsu]]. [[Motorola]] has spun off its semiconductor division as [[Freescale]] as that division was dragging down profit margins for the rest of the company. [[Texas Instruments]], [[TSMC]] and [[Toshiba]] are a few examples of a companies doing manufacturing for another company's CPU chip design.
 
== Embedded design ==
 
The majority of computer systems in use today are embedded in other machinery, such as telephones, clocks, appliances, vehicles, and infrastructure. An [[embedded system]] usually has minimal requirements for memory and program length and may require simple but unusual input/output systems. For example, most embedded systems lack keyboards, screens, disks, printers, or other recognizable I/O devices of a personal computer. They may control electric motors, relays or voltages, and reed switches, variable resistors or other electronic devices. Often, the only I/O device readable by a human is a single light-emitting diode, and severe cost or power constraints can even eliminate that.
 
In contrast to general-purpose computers, embedded systems often seek to minimize [[interrupt latency]] over instruction throughput.
 
When an electronic device causes an interrupt, the intermediate results, the registers, have to be saved before the software responsible for handling the interrupt can run, and then must be put back after it is finished. If there are more registers, this saving and restoring process takes more time, increasing the latency.
 
Low-latency CPUs generally have relatively few registers in their central processing units, or they have "shadow registers" that are only used by the interrupt software.
 
=== Other design issues ===
 
<!-- [[virtual memory]] moved to [[Computer architecture]] -->
 
One interesting near-term possibility would be to eliminate the bus. Modern vertical [[laser diode]]s enable this change. In theory, an optical computer's components could directly connect through a holographic or phased open-air switching system. This would provide a large increase in effective speed and design flexibility, and a large reduction in cost. Since a computer's connectors are also its most likely failure point, a busless system might be more reliable, as well.
 
Another farther-term possibility is to use light instead of electricity for the digital logic itself.
In theory, this could run about 30% faster and use less power, as well as permit a direct interface with quantum computational devices.
The chief problem with this approach is that for the foreseeable future, electronic devices are faster, smaller (i.e. cheaper) and more reliable.
An important theoretical problem is that electronic computational elements are already smaller than some wavelengths of light, and therefore even wave-guide based optical logic may be uneconomic compared to electronic logic.
We can therefore expect the majority of development to focus on electronics, no matter how unfair it might seem.
See also [[optical computing]].
 
Yet another possibility is the "clockless CPU" (asynchronous CPU). Unlike conventional processors, clockless processors have no central clock to coordinate the progress of data through the pipeline.
Instead, stages of the CPU are coordinated using logic devices called "pipe line controls" or "FIFO sequencers." Basically, the pipeline controller clocks the next stage of logic when the existing stage is complete. In this way, a central clock is unnecessary. There are two advantages to clockless CPUs over clocked CPUs:
* components can run at different speeds in the clockless CPU. In a clocked CPU, no component can run faster than the clock rate.
* In a clocked CPU, the clock can go no faster than the worst-case performance of the slowest stage. In a clockless CPU, when a stage finishes quicker than normal, the next stage can immediately take the results rather than waiting for the next clock tick. A stage might finish quicker than normal because of the particular data inputs (multiplication can be very fast if it is multiplying by 0 or 1), or because it is running at a higher voltage or lower temperature than normal.
 
Two examples of asynchronous CPUs are the [[ARM_architecture|ARM]]-implementing [[AMULET_microprocessor|AMULET]] and the asynchronous implementation of [[MIPS_architecture|MIPS]] R3000, dubbed [http://www.async.caltech.edu/mips.html MiniMIPS].
 
The biggest disadvantage of the clockless CPU is that most CPU design tools assume a clocked CPU, so making a clockless CPU involves modifying the design tools to handle clockless logic and doing extra testing to ensure the design avoids [[Metastability in electronics|metastable]] problems. For example, the group that designs the aforementioned AMULET developed a tool called [http://www.cs.man.ac.uk/apt/projects/tools/lard/ LARD] to cope with the complex design of AMULET3.
 
==Design concepts==
In general, all processors, micro or otherwise, run the same sort of task over and over:
 
#read an instruction and decode it
#find any associated data that is needed to process the instruction
#process the instruction
#write the results out
 
Complicating this simple-looking series of events is the fact that [[main memory]] has always been slower than the processor itself. Step (2) often introduces a lengthy (in CPU terms) delay while the data arrives over the [[computer bus]]. A considerable amount of research has been put into designs that avoid these delays as much as possible. This often requires complex circuitry and was at one time found only on hand-wired [[supercomputer]] designs. However, as the manufacturing processes have improved, they have become a common feature of almost all designs.
 
===RISC===
The basic concept of [[RISC]] is to clearly identify what step 2 does. In older processor designs, now retroactively known as [[CISC]], the instructions were offered in a number of different modes that meant that step 2 took an unknown length of time to complete. In RISC, almost all instructions come in exactly one mode that reads data from one place -- the registers. These ''addressing modes'' are then handled by the [[compiler]], which writes code to load the data into the registers and store it back out. For this reason the term '''load-store''' is often used to describe this philosophy in design; there are many processors with limited instruction sets that are not really RISC.
 
The side effect of this change is twofold. One is that the resulting logic core is much smaller, largely by making step 1 and 2 much simpler. Secondly it means that step 2 always takes one cycle, also reducing the complexity of the overall chip design which would otherwise require complex "locks" that ensure the processor completes one instruction before starting the other. For any given level of performance, a RISC design will have a much smaller "gate count" (number of transistors), the main driver in overall cost -- in other words a fast RISC chip is much cheaper than a fast CISC chip.
 
The downside is that the program gets much longer as a side effect of the compiler having to write out explicit instructions for memory handling, the "code density" is lower. This increases the number of instructions that have to be read over the computer bus. When RISC was first being introduced there were arguments that the increased bus access would overwhelm the speed, and that such designs would actually be slower. In theory this might be true, but the real reason for RISC was to allow [[instruction pipeline]]s to be built much more easily.
 
===Instruction pipelining===
One of the first, and most powerful, techniques to improve performance is the [[instruction pipeline]]. Early microcoded designs would carry out all of the steps above for one instruction before moving onto the next. Large portions of the circuitry were left idle at any one step, for instance, the instruction decoding circuitry would be idle during execution and so on.
 
Pipelines improve performance by allowing a number of instructions to work their way through the processor at the same time. In the same basic example, the processor would start to decode (step 1) a new instruction while the last one was waiting for results. This would allow up to four instructions to be "in flight" at one time, making the processor look four times as fast. Although any one instruction takes just as long to complete, there's still four steps, the CPU as a whole "retires" instructions much faster and can be run at a much higher clock speed.
 
RISC make pipelines smaller, and much easier to construct by cleanly separating each stage of the instruction process and making them take the same amount of time -- one cycle. The processor as a whole operates in an [[assembly line]] fashion, with instructions coming in one side and results out the other. Due to the reduced complexity of the [[Classic RISC pipeline]], the pipelined core and an instruction cache could be placed on the same size die that would otherwise fit the core alone on a CISC design. This was the real reason that RISC was faster, early designs like the [[SPARC]] and [[MIPS architecture|MIPS]] often running over 10 times as fast as [[Intel]] and [[Motorola]] CISC solutions at the same clock speed and price.
 
Pipelines are by no means limited to RISC designs. By 1986 the top-of-the-line VAX (the 8800) was a heavily pipelined design, slightly predating the first commercial MIPS and SPARC designs. Most modern CPUs (even embedded CPUs) are now pipelined, and microcoded CPUs with no pipelining are seen only in the most area-constrained embedded processors. Large CISC machines, from the VAX 8800 to the modern Pentium 4 and Athlon, are implemented with both microcode and pipelines. Improvements in pipelining and caching are the two major microarchitectural advances that have enabled processor performance to keep pace with the circuit technology on which they are based.
 
===Speculative execution===
One problem with an instruction pipeline is that there are a class of instructions that must make their way entirely through the pipeline before execution can continue. In particular, conditional branches need to know the result of some prior instruction before "which side" of the branch to run is known. For instance, an instruction that says "if x is larger than 5 then do this, otherwise do that" will have to wait for the results of x to be known before it knows if the instructions for this or that can be fetched.
 
For a small four-deep pipeline this means a delay of up to three cycles -- the decode can still happen. But as clock speeds increase the depth of the pipeline increases with it, and modern processors may have 20 stages or more. In this case the CPU is being stalled for the vast majority of its cycles every time one of these instructions is encountered.
 
The solution, or one of them, is ''[[speculative execution]]'', also known as ''branch prediction''. In reality one side or the other of the branch will be called much more often than the other, so it is often correct to simply go ahead and say "x will likely be smaller than five, start processing that". If the prediction turns out to be correct, a huge amount of time will be saved. Modern designs have rather complex prediction systems, which watch the results of past branches to predict the future with greater accuracy.
 
===Cache===
It was not long before improvements in chip manufacturing allowed for even more circuitry to be placed on the die, and designers started looking for ways to use it. One of the most common was to add an ever-increasing amount of [[CPU cache|cache memory]] on-die. Cache is simply very fast memory, memory that can be accessed in a few cycles as opposed to "many" needed to talk to main memory. The CPU includes a cache controller which automates reading and writing from the cache, if the data is already in the cache it simply "appears", whereas if it is not the processor is "stalled" while the cache controller reads it in.
 
RISC designs started adding cache in the mid-to-late 1980s, often only 4k in total. This number grew over time, and modern CPU's typically include about 512kbytes, while CPU's intended for server use come with 1 or 2 Mbytes. Generally speaking, more cache means more speed.
 
===Out-of-order execution===
Use of cache also introduces a new delay when the data asked for by the CPU is not already in the cache. In early designs this would force the cache controller to stall the processor and wait. Of course there may be some other instruction in the program whose data ''is'' available in the cache at that point. [[out-of-order execution]] allows that instruction to be processed while the processor waits on the cache, then re-orders the results to make it appear that everything happened in the normal order.
 
===Superscalar designs===
Even with all of the added complexity and gates needed to support the concepts outlined above, chip manufacturing had soon made even them have room left over. This led to the rise of [[superscalar]] processors in the early 1990s, processors that could run more than one instruction at once.
 
In the outline above the processor runs parts of a single instruction at a time. If one were simply to place two entire cores on a die, then the processor would be able to run two instructions at once. However this is not actually required, as in the average program certain instructions are much more common than others. For instance, the load-store instructions on a RISC design are more common than [[floating point]], so building two complete cores isn't as efficient a use of space as building two load-store units and only one floating point.
 
In modern designs it is common to find two load units, one store (many instructions have no results to store), two or more integer math units, two or more floating point units, and often a [[SIMD]] unit of some sort. The decoder grows in complexity by reading in a huge list of instructions from memory and handing them off to the different units that are idle at that point. The results are then collected and re-ordered at the end, as in out-of-order.
 
===Simultaneous multithreading===
One of the newest techniques in high-speed processor design is [[simultaneous multithreading]]. Oddly it may have been easier to add this in the past than some of the other techniques described above.
 
The cache controller knows where in main memory any piece of data came from. It therefore "knows" that different data in the cache are actually from different programs entirely, a side effect of modern [[computer multitasking|multitasking]] [[operating system]]s. In simultaneous multithreading designs, the cache controller will not look just for the instruction that is ready, but the program (or thread) that is "most ready". This can be quite effective in many cases, as programs often switch between handling data and processing, simultaneous multithreading can make more effecient use of the various units in these cases by going out and finding entirely different programs to run while the "running one" waits for data.
 
== See also ==
 
* [[Microprocessor]]
* [[Moore's Law]]
* [[Amdahl's law]]
* [[Simultaneous multithreading]]
* [[RISC]]
* [[CISC]]
 
[[Category:Computer architecture]]
 
[[nl:Processorarchitectuur]]