Ok guys. I fully removed some HADDPS (sse3)and HSUBPS (sse3) from the sampler rutine and added some "standart" ADDPS, SUBPS and PSHUFD and the result is like... WTF !!! The demo runs with the sse3 instructions on 32.89 FPS and now with the sse instructions 33.60. In the normal logic should be the sse3 instructions faster but ... they are slower. So yes... its true.. Better use some sse2 and lower version of sse instructions as the instructions from the higher versions. But still its good to use some benchmarks. I am surprised of the result. But i was reading similar messages about the speed of new sse instructions in other forums, so the surrprise was not so big. But i am not sure about the speed on new processor. As i said... better use a benchmark.