Revision as of 06:41, 12 November 2015 edit SwisterTwister (talk \| contribs) 187,094 edits Cleaning up accepted Articles for creation submission (AFCH 0.9) ← Previous edit		Revision as of 12:26, 12 November 2015 edit undo Sing0512 (talk \| contribs) 143 edits m →Examples of GPU Programming for Multidimensional DSP Tag: Visual edit Next edit →
Line 67: \end{pmatrix},\quad C_{ij}=\sum_{k=1}^m A_{ik}B_{kj}</math> To compute each element in {{math\|'''C'''}} takes {{math\|''m''}} multiplications and {{math\|(''m'' - ''1'')}} additions. Therefore, with a CPU implementation, the time complexity to achieve this computation is ''Θ(n''<sup href="Category:GPGPU">''3''</sup>'')'' in the following C example''.'' However, we have known that elements in {{math\|'''C'''}} are independent to each ~~others~~other. Hence, the computation can be fully parallelized by SIMD processors, such as GPGPU devices. With a GPGPU implementation, the time complexity significantly reduces to ''Θ(n)'' by unrolling the for-loop showing in the following OpenCL example''.''<source lang="c" line="1"> // MxM matrix multiplication in C void matrixMul(

Multidimensional DSP with GPU acceleration: Difference between revisions