Content deleted Content added
No edit summary |
m Task 18 (cosmetic): eval 5 templates: hyphenate params (5×); |
||
Line 93:
inserting the appropriate code to forward the output of one processor to the next processor.
Recent research focuses on using the power of GPU's<ref>J Anantpur, R Govindarajan, Runtime dependence computation and execution of loops on heterogeneous systems {{cite web|url=http://hpc.serc.iisc.ernet.in/~jayvant/papers/CGO-2013.pdf |title=Archived copy |
The memory accessed (whether direct or indirect) can be simply marked for different iterations of a loop and can be compared for dependency detection. Using this information, the iterations are grouped into levels such that iterations belonging to the same level are independent of each other, and can be executed in parallel.
==Difficulties==
Automatic parallelization by compilers or tools is very difficult due to the following reasons:<ref>{{cite web|url=http://blitzprog.org/posts/automatic-parallelism-and-data-dependency |title=Automatic parallelism and data dependency |url-status=dead |
* dependence analysis is hard for code that uses indirect addressing, pointers, recursion, or indirect function calls because it is difficult to detect such dependencies at compile time;
* loops have an unknown number of iterations;
|