Compressed instruction set: Difference between revisions

Browse history interactively

← Previous edit

Content deleted Content added

VisualWikitext

Revision as of 09:54, 6 July 2021 edit Narky Blert (talk \| contribs) Autopatrolled, Extended confirmed users, Page movers 419,246 edits Link to DAB page repaired ← Previous edit		Latest revision as of 02:25, 28 February 2025 edit undo RastaKins (talk \| contribs) Extended confirmed users 2,129 edits →Concept: 8086 instructions can be up to six bytes, not including segment override.
(16 intermediate revisions by 9 users not shown)
Line 1: {{short description\|~~Variation~~Compact onformat aof microprocessor~~'s instruction set architecture (ISA) that allows~~ instructions ~~to be represented in a more compact format~~}} A '''compressed instruction set''', or simply '''compressed instructions''', are a variation on a [[microprocessor]]'s [[instruction set architecture]] (ISA) that allows instructions to be represented in a more compact format. In most real-world examples, compressed instructions are 16 bits long in a processor that would otherwise use 32-bit instructions. ItThe is16-bit ~~distinct~~ISA ~~from~~is ~~the~~a ~~concept~~subset of ~~[[variable~~the ~~length~~full ~~instructions]]~~32-bit ISA, ~~where~~not ~~any~~a separate instruction ~~can~~set. ~~have~~The ~~any~~smaller ~~length;~~format inrequires asome ~~compressed~~tradeoffs: ~~set~~generally, ~~the~~there are fewer instructions ~~are~~available, ~~alternative~~and ~~forms~~fewer ~~for~~[[processor ~~existing~~register]]s ~~32-bit~~can be ~~versions~~used. The concept was originally introduced ~~in the 1980s~~ by [[Hitachi]] as a way to improve the [[code density]] of their [[SuperH]] [[RISC]] processor design ~~and~~as ~~thereby~~it ~~allow~~moved ~~programs~~from 16-bit to ~~run~~32-bit instructions in ~~smaller~~the ~~amounts~~SH-5 version. The new design had two instruction sets, one giving access to the entire ISA of ~~[[main~~the ~~memory]].~~new ~~Today~~design, ~~these~~and a smaller ~~instructions~~16-bit ~~are~~set known as SHcompact that allowed programs to run in smaller amounts of [[main memory]]. As the memory of even the smallest systems is now [[order-of-magnitude\|orders of magnitude]] larger than the systems that spawned the concept, ~~this~~size is no longer the main concern;. ~~today~~Today the advantage is that it reduces the number of accesses to main memory and thereby reduces energy use in [[mobile device]]s~~. Hitachi's patents were licensed by [[Arm Ltd.]] for their processors, where it was known as "Thumb". Similar systems are found in MIPS16e and Power ISA~~. Hitachi's patents were licensed by [[Arm Ltd.]] for their processors, where it was known as "Thumb". Similar systems are found in MIPS16e and [[PowerPC]] VLE. The original patents have expired and the concept can be found in a number of modern designs, including [[RISC-V]], which was designed from the outset to use it. The introduction of [[64-bit computing]] has led to the term no longer being as widely used; these processors generally use 32-bit instructions and are technically a form of compressed ISA, but as they are mostly modified versions of an older ~~32-bit~~ ISA from a 32-bit version of the same processor family; there is no real compression~~. The original patents have expired and the concept can be found in a number of modern designs, including the [[RISC-V]], which was designed from the outset to use it~~. ==Concept== [[Microprocessor]]s encode their instructions as a series of [[bit]]s, normally divided into a number of 8-bit [[byte]]s. For instance, in the [[MOS 6502]], the {{code\|ADC}} instruction performs binary addition between an operand value and the value already stored in the [[Accumulator (computing)\|accumulator]]. There are a variety of places the processor might find the operand; it might be located in [[main memory]], or in the special [[zero page]], or be an explicit constant like "10". Each of these variations used a different 8-bit instruction, or [[opcode]]; if one wanted to add the constant 10 to the accumulator the instruction would be encoded in memory as {{code\|$69 $0A}}, with ~~the hexadecimal~~ $0A being [[hexadecimal]] for ~~10,~~the ~~whereas~~decimal value 10. ifIf it was instead adding the value stored in main memory at ___location $4400, it would be {{code\|$6D $~~0044~~00 $44}}, with a [[Endianness\|little -endian]] address.{{sfn\|Verts\|2004}} Note that the second instruction requires three bytes because the memory address is 16- bits long. Depending on the instruction, it might use one, two, or three bytes.{{sfn\|Verts\|2004}} This is now known as a [[variable length instruction set]], although that term was not common at the time as most processors, including [[mainframe]]s and [[minicomputer]]s, normally used some variation of this concept. Even in the late 1970s, as microprocessors began to move from 8-bit formats to 16, this concept remained common; the [[Intel ~~8088~~8086]] continued to use 8-bit opcodes which could be followed by zero to ~~six~~five additional bytes depending on the [[addressing mode]].<ref>{{cite magazine \|title=Understanding ARM Architectures \|url=https://www.informit.com/articles/article.aspx?p=1620207&seqNum=3 \|date=23 August 2010 \|website=informIT}}</ref> It was during the move to 32-bit systems, and especially as the [[RISC]] concept began to take over processor design, that variable length instructions began to go away. In the [[MIPS architecture]], for instance, all instructions are a single 32-bit value, with a 6-bit opcode in the [[most significant bit]]s and the remaining 26 bits used in various ways representing its limited set of addressing modes. Most RISC designs are similar. Moving to a fixed-length instruction format was one of the key design concepts behind the performance of early RISC designs; in earlier systems the instruction might take one to six memory cycles to read, requiring wiring between various parts of the logic to ensure the processor didn't attempt to perform the instruction before the data was ready. In RISC designs, operations normally take one cycle, greatly simplifying the decoding. The savings in these interlocking circuits is instead applied to additional logic or adding [[processor register]]s, which have a direct impact on performance.<ref>{{cite web \|url=http://www.cs.uwm.edu/classes/cs315/Bacon/Lecture/HTML/ch05s07.html \|title=MIPS Instruction Code Formats \|website=Computer Science 315 Lecture Notes \|first=Jason \|last=Bacon \|access-date=2021-04-09 \|archive-date=2019-07-17 \|archive-url=https://web.archive.org/web/20190717133605/http://www.cs.uwm.edu/classes/cs315/Bacon/Lecture/HTML/ch05s07.html \|url-status=dead }}</ref> ===Code density=== The downside to the RISC approach is that many instructions simply do not require four bytes. For instance, the [[Logical shift\|Logical Shift Left]] instruction shifts the bits in a register to the left~~, and only requires the instruction opcode and a register number~~. In the 6502, which has only a single arithmetic register A, ~~and~~this ~~thus~~instruction ~~does~~can ~~not~~be ~~even~~represented ~~need~~entirely aby ~~register~~its ~~number,~~8-bit ~~the~~opcode {{code\|~~ASL~~$06}} ~~instruction takes up one byte~~.{{sfn\|Verts\|2004}} InOn processors with ~~theory~~more registers, ~~MIPS~~all ~~needs~~that ~~only~~is aneeded ~~6-bit~~is the opcode and ~~a 5-bit~~ register number, ~~for~~another ~~this~~4 ~~same~~or ~~operation,~~5 ~~and~~bits. inOn ~~theory~~MIPS, ~~could~~for ~~fit~~instance, the instruction inneeds only a 166-bit opcode and a 5-bit ~~value~~register number. But as is the case for most RISC designs, the instruction still takes up a full 32- bits. As these sorts of instructions are relatively common, RISC programs generally take up more memory than the same program on a variable length processor.{{sfn\|Weaver\|McKee\|2009}} One notable, and particularly early, exception amongst RISC designs is the [[IBM 801]] architecture which maintains five instruction formats: two utilising a 16-bit instruction length, and three utilising a 32-bit instruction length.<ref name="ibm_801_system_architecture">{{ cite book \| url=http://bitsavers.org/pdf/ibm/system801/System_801_Principles_of_Operation_Jan76.pdf \| title=System 801 Principles of Operation \| publisher=International Business Machines \| date=16 January 1976 \| access-date=11 November 2024 }}</ref>{{rp\|pages=10}} For instructions requiring less space, such as shift instructions employing only register operands, the shorter 16-bit instruction formats are used.<ref name="ibm_801_system_architecture"/>{{rp\|pages=51–58}} In the 1980s, when the RISC concept was first emerging, this was a common point of complaint. As the instructions took up more room, the system would have to spend more time reading instructions from memory. It was suggested these extra accesses might actually slow the program down. Extensive [[benchmarking]] eventually demonstrated RISC was faster in almost all cases, and this argument faded. However, there are cases where memory use remains a concern regardless of performance, and that is in small systems and embedded applications. Even in the early 2000s, the price of [[DRAM]] was enough that cost-sensitive devices had limited memory. It was for this market that [[Hitachi]] developed the [[SuperH]] design.<ref>{{cite web \|url=http://resource.renesas.com/lib/eng/e_learnig/sh4/02/index.html \|title=Effects of 16-bit instructions \|website=Renesas}}</ref>▼ ▲In the 1980s, when the RISC concept was first emerging, ~~this~~increased program size was a common point of complaint. As the instructions took up more room, the system would have to spend more time reading instructions from memory. It was suggested these extra accesses might actually slow the program down. Extensive [[benchmarking]] eventually demonstrated RISC was faster in almost all cases, and this argument faded. However, there are cases where memory use remains a concern regardless of performance, and that is in small systems and embedded applications. Even in the early 2000s, the price of [[DRAM]] was enough that cost-sensitive devices had limited memory. It was for this market that [[Hitachi]] developed the [[SuperH]] design.<ref>{{cite web \|url=http://resource.renesas.com/lib/eng/e_learnig/sh4/02/index.html \|title=Effects of 16-bit instructions \|website=Renesas}}</ref> In the earlier SuperH designs, SH-1 through SH-4, instructions always take up 16-bits. The resulting instruction set has real-world limitations; for instance, it can only perform two-operand math of the form {{code\|A {{=}} A + B}}, whereas most processors of the era allowed {{code\|A {{=}} B + C}}, the three-operand format. By removing one operand, four bits are removed from the instruction (there are 16 registers, needing 4 bits), although this is at the cost of making math code somewhat more complex to write. For the markets targeted by the SuperH, this was an easy tradeoff to make. A significant advantage of the 16-bit format is that the [[instruction cache]] now holds twice as many instructions for any given amount of [[Static random-access memory\|SRAM]]. This allows the system to perform at higher speeds, although some of that might be mitigated by the use of additional instructions needed to perform operations that might be performed by a single 3-operand instruction.{{sfn\|SuperH\|1996}}▼ ▲In the earlier SuperH designs, SH-1 through SH-4, instructions always take up 16- bits. The resulting instruction set has real-world limitations; for instance, it can only perform two-operand math of the form {{code\|A {{=}} A + B}}, whereas most processors of the era ~~allowed~~used the three-operand format, {{code\|A {{=}} B + C}}~~, the three-operand format~~. By removing one operand, four bits are removed from the instruction (there are 16 registers, needing 4 bits), although this is at the cost of making math code somewhat more complex to write. For the markets targeted by the SuperH, this was an easy tradeoff to make. A significant advantage of the 16-bit format is that the [[instruction cache]] now holds twice as many instructions for any given amount of [[Static random-access memory\|SRAM]]. This allows the system to perform at higher speeds, although some of that might be mitigated by the use of additional instructions needed to perform operations that might be performed by a single 3-operand instruction.{{sfn\|SuperH\|1996}} For the SH-5, Hitachi moved to a 32-bit instruction format. In order to provide [[backward compatibility]] with their earlier designs, they included a second instruction set, SHcompact. SHcompact mapped the original 16-bit instructions one-way onto the internal 32-bit instruction; it did not perform multiple instructions as would be the case in earlier [[microcode]]d processors, it was simply a smaller format for the same instruction. This allowed the original small-format programs to be easily ported to the new SH-5, while adding little to the complexity of the [[instruction decoder]].<ref>{{cite book \|title=SH-5 CPU Core, Volume 1: Architecture \|url=http://Flars.nocrew.org/computers/processors/SuperH/cpush5v1.pdf \|page=8}}</ref> ARM licensed a number of Hitachi's patents on aspects of the instruction design and used them to implement their Thumb instructions. ARM processors with a "T" in the name included this instruction set in addition to their original 32-bit versions, and could be switched from 32- to 16-bit mode on the fly using the {{code\|BX}} command. When in Thumb mode, only the top eight registers of the ARM's normal sixteen registers are visible, but these are the same registers as in 32-bit mode and thus data can be passed between Thumb and normal code using those registers. Every Thumb instruction was a counterpart of a 32-bit version, so Thumb was a strict subset of the original ISA.{{sfn\|Lemieux\|2004}} One key difference between ARM's model and SuperH is that Thumb retains some three-operand instructions in the 16-bit format, which it accomplished by reducing the visible register file to eight, so only 3- bits are required to select a register.<ref>{{cite web \|url=https://developer.arm.com/documentation/ddi0210/c/Introduction/Instruction-set-summary/Thumb-instruction-summary?lang=en \|title=Thumb instruction summary \|website=ARM7TDMI Technical Reference Manual}}</ref> The [[MIPS architecture]] also added a similar compressed set in their MIPS16e, which is very similar to Thumb. It too allows only eight registers to be used, although these are not simply the first eight; the MIPS design uses register 0 as the [[zero register]], so registers 0 and 1 in 16-bit mode are instead mapped onto MIPS32 registers 16 and 17. Most other details of the system are similar to Thumb.<ref>{{cite book \|title=MIPS16e2 Application-Specific Extension Technical Reference Manual \|date=26 April 2016 \|publisher=MIPS}}</ref> Likewise, the latest version of the [[Power ISA]], formerly [[PowerPC]], include the "VLE" instructions which are essentially identical. These were added at the behest of [[Freescale Semiconductor]], whose interest in Power is mostly aimed at the embedded market.<ref>{{cite book \|title=Power ISA V2.07 \|publisher=IBM \|url=https://www.ibm.com/support/pages/power-isa-version-207-latest-power-instruction-set-architecture}}</ref> ===Modern use=== Starting around 2015, many processors have moved to a 64-bit format. These generally retained a 32-bit instruction format, while expanding the internal registers to a 64-bit format. By the original definition, these are compressed instructions, as they are smaller than the basic data word size. However, this term is not used in this context,; references to compressed instructions invariably refer to 16-bit versions.<ref>{{cite book \|title=Alpha Architecture Handbook \|publisher=DEC \|url=http://www.cs.cmu.edu/afs/cs/academic/class/15740-f03/public/doc/alpha-ref.pdf \|date=October 1996 \|page=1.4}}</ref> ==References==