Quantcast
Channel: Intel® Integrated Performance Primitives
Viewing all articles
Browse latest Browse all 1294

performance issue ippiCrossCorrNorm_8u32f_C1R

$
0
0

Hello

I compared the ipp703 call ippiCrossCorrValid_NormLevel_8u32f_C1R

to the 802 call ippiCrossCorrNorm_8u32f_C1R

and measured Timing in endless Loops (all buffers pre-allocated, 1000x1000 Image, 10x10 template)

Results s. Attachment

First Trial: in the Loop a sleep(0) directive was used

The 703 turns out to be 4x faster (!) than the new 802 function, but cpu load is extreme and would not give space to other Tasks in complex applications

Second Trial: in the Loop a sleep(100) directive was used

The 703 still Shows an extremly high cpu load though my calculated cpu use time is only at 5% !! 802 works as expected with few remaining load.

 

Now what shall i do ? I don´t want to use 703 because it seems to be bugous that cpu load is constantly high even after the call finished and the thread is sleeping. The 802 Performance is way poorer however.

Stefan


Viewing all articles
Browse latest Browse all 1294

Trending Articles



<script src="https://jsc.adskeeper.com/r/s/rssing.com.1596347.js" async> </script>