Hello,
The latest community version of IPP has a 4x performance regression in the ippiDilateBorder_16u_C1R function for largish neighborhoods. A sized 221x221 neighborhood in our use case seems to be affected (with an image size of 7002x8998), though I'm sure it's measurable for smaller neighboorhoods as well. I've seen this regression in Windows, haven't tested it in Linux, yet. This is while using a Haswell CPU. I'm not sure how much it matters, but the neighborhood is defined as 1 for all values.