Hello
I compared the ipp703 call ippiCrossCorrValid_NormLevel_8u32f_C1R
to the 802 call ippiCrossCorrNorm_8u32f_C1R
and measured Timing in endless Loops (all buffers pre-allocated, 1000x1000 Image, 10x10 template)
Results s. Attachment
First Trial: in the Loop a sleep(0) directive was used
The 703 turns out to be 4x faster (!) than the new 802 function, but cpu load is extreme and would not give space to other Tasks in complex applications
Second Trial: in the Loop a sleep(100) directive was used
The 703 still Shows an extremly high cpu load though my calculated cpu use time is only at 5% !! 802 works as expected with few remaining load.
Now what shall i do ? I don´t want to use 703 because it seems to be bugous that cpu load is constantly high even after the call finished and the thread is sleeping. The 802 Performance is way poorer however.
Stefan