Content deleted Content added
m fix |
Nimarazmjoo (talk | contribs) m minor writing correction |
||
Line 20:
=== Minimize the distance over which data is moved ===
Moving data in general-purpose [[Memory hierarchy|memory hierarchies]] requires a remarkable amount of energy in order to attempt to minimize the latency to access data. In the case of Domain-Specific Architectures, it is expected that understanding
=== Invest saved resources into arithmetic units or bigger memories ===
Since a remarkable amount of hardware resources can be saved by dropping general-purpose architectural optimizations such as out-of-order execution, [[Prefetching (computing)|prefetching]], address [[Coalescing (computer science)|coalescing]], and hardware speculation, the resources saved should be re-invested to maximally exploit the available [[Parallelism (computing)|parallelism]], for example, by adding more arithmetic units
=== Use the easiest form of parallelism that matches the ___domain ===
|