Revision as of 17:10, 27 February 2015 edit 137.44.6.37 (talk) No edit summary ← Previous edit		Revision as of 22:58, 15 March 2015 edit undo Wootery (talk \| contribs) Extended confirmed users 868 edits Improved discussion of masking Next edit →
Line 9: <!-- Strictly, the latency-hiding is a feature of the zero-overhead scheduling implemented by modern GPUs... this might or might not be considered to be a property of 'SIMT' itself --> A downside of SIMT execution is the fact that ~~control~~thread-specific control-flow has to be ~~simulated~~performed using "masking:", ~~when~~leading to poor utilisation where control-flow is not coherent for all threads of a processor. ~~hits~~For instance, to handle an ''if~~''-''then~~''-''else'' block~~, and~~ ~~its~~where various threads of a processor execute ~~the~~ different paths ~~though the block~~, all threads must actually ~~pass~~process ~~through~~both paths (as all threads of ~~the~~a ~~block~~processor always execute in lock-step), but ~~for~~"masking" ~~processors~~is ~~that~~used ~~hit~~to ~~the~~disable ~~''if''~~and ~~part~~enable the ~~''else''~~various ~~part~~threads isas ~~"masked~~appropriate. This ~~out~~"masking" strategy is what distinguishes SIMT from ordinary SIMD, and ~~vice~~has ~~versa. A~~the benefit of itinexpensive ~~this~~synchronization isbetween ~~inexpensive~~the ~~synchronization~~threads of a processor.<ref name="spp">{{cite book \|author1=Michael McCool \|author2=James Reinders \|author3=Arch Robison \|title=Structured Parallel Programming: Patterns for Efficient Computation \|publisher=Elsevier \|year=2013 \|pages=209 ff.}}</ref> == See also ==

Single instruction, multiple threads: Difference between revisions