Cache hierarchy: Difference between revisions

Browse history interactively

← Previous edit

Content deleted Content added

VisualWikitext

Revision as of 16:55, 29 January 2025 edit Comp.arch (talk \| contribs) Extended confirmed users 41,478 edits decimal (1000) kB -> binary (1024) KB. Other (more) minor changes. Missing microop/L0 cache info... Tag: 2017 wikitext editor ← Previous edit		Latest revision as of 07:50, 12 August 2025 edit undo IrisChronomia (talk \| contribs) Extended confirmed users, IP block exemptions 1,287 edits →Recent implementation models: copyediting. Tag: Visual edit
(2 intermediate revisions by 2 users not shown)
Line 53: In a banked cache, the cache is divided into a cache dedicated to [[machine code\|instruction]] storage and a cache dedicated to data. In contrast, a unified cache contains both the instructions and data in the same cache.<ref>Yan Solihin, 2015. Fundamentals of Parallel Multicore Architecture. CRC Press. p. 150. {{ISBN\|978-1-4822-1119-1}}.</ref> During a process, the L1 cache (or most upper-level cache in relation to its connection to the processor) is accessed by the processor to retrieve both instructions and data. Requiring both actions to be implemented at the same time requires multiple ports and more access time in a unified cache. Having multiple ports requires additional hardware and wiring, leading to a significant structure between the caches and processing units.<ref>Steve Heath, 2002. Embedded Systems Design. Elsevier. p. 106. {{ISBN\|978-0-08-047756-5}}.</ref> To avoid this, the L1 cache is often organized as a banked cache which results in fewer ports, less hardware, and generally lower access times.<ref name=":1" /> Modern processors have split caches, and in systems with multilevel caches ~~higher~~lower level caches may be unified while ~~lower~~higher levels split.<ref name="CA:QA" /><ref>Alan Clements, 2013. Computer Organization & Architecture: Themes and Variations. Cengage Learning. p. 588. {{ISBN\|1-285-41542-6}}.</ref> === Inclusion policies === Line 94: === Intel i5 Raptor Lake-HX (2024) === 6-core (performance \| efficiency): * L1 cache – 128 {{abbr\|KB\|kilobytes}} per core * L2 cache – 2 {{abbr\|MB\|megabytes}} per core \| 4–8 {{abbr\|MB\|megabytes}} semi-shared * L3 cache – 20–24 {{abbr\|MB\|megabytes}} shared === AMD EPYC 9684X (Zen 4, 2023) === 96-core: * L1 cache – 64 {{abbr\|KB\|kilobytes}} per core Line 111: * L3 cache – 96 {{abbr\|MB\|megabytes}} shared === AMD ~~Ryzen~~Zen ~~7000~~3 (2022) === 6- to 16-core: * L1 cache – 64 {{abbr\|KB\|kilobytes}} per core Line 117: * L3 cache – 32 to 128 {{abbr\|MB\|megabytes}} shared === AMD Zen 2 ~~microarchitecture~~ (2019) === * L1 cache – 32 KB data & 32 KB instruction per core, 8-way * L2 cache – 512 KB per core, 8-way inclusive * L3 cache – 16 MB local per 4-core CCX, 2 CCXs per chiplet, 16-way non-inclusive. Up to 64 MB on desktop CPUs and 256 MB on server CPUs === AMD Zen ~~microarchitecture~~ (2017) === * L1 cache – 32 KB data & 64 KB instruction per core, 4-way * L2 cache – 512 KB per core, 4-way inclusive * L3 cache – 4 MB local & remote per 4-core CCX, 2 CCXs per chiplet, 16-way non-inclusive. Up to 16 MB on desktop CPUs and 64 MB on server CPUs === Intel Kaby Lake ~~microarchitecture~~ (2016) === * L1 cache (instruction and data) – 64 KB per core * L2 cache – 256 KB per core * L3 cache – 2 MB to 8 MB shared<ref name=":3">{{Cite web\|url=https://ark.intel.com/\|title=Intel Kaby Lake Microrchitecture}}</ref> === Intel Broadwell ~~microarchitecture~~ (2014) === * L1 cache (instruction and data) – 64 {{abbr\|KB\|kilobytes}} per core * L2 cache – 256 KK per core Line 139: === IBM POWER7 (2010) === * L1 cache (instruction and data) – each 64-banked, each bank has ~~2rd~~2nd+1wr ports 32 KB, 8-way associative, 128B block, write through * L2 cache – 256 KB, 8-way, 128B block, write back, inclusive of L1, 2 ns access latency * L3 cache – 8 regions of 4 MB (total 32 MB), local region 6 ns, remote 30 ns, each region 8-way associative, DRAM data array, SRAM tag array<ref>{{Cite web\|url=https://www-03.ibm.com/systems/power/hardware/795/specs.html\|archive-url=https://web.archive.org/web/20100821102938/http://www-03.ibm.com/systems/power/hardware/795/specs.html\|url-status=dead\|archive-date=August 21, 2010\|title=IBM Power7}}</ref> == See also == * CPU microarchitectures mentioned in this article: [[POWER7]] [[Broadwell (microarchitecture)\|Intel Broadwell ~~Microarchitecture~~]] * [[Kaby Lake\|Intel Kaby Lake Microarchitecture]] [[Zen (microarchitecture)\|AMD Zen]] [[Apple silicon\|Apple Silicon]] * [[CPU cache]] * [[Memory hierarchy]] * [[CAS latency\|CAS latency (RAM)]] * [[Cache (computing)]]