For games, users of FPC will especially be in advantage due to the ability to generate SSE2 floating point code, which speeds up floating point with double digit percentages.
So..FPC compiler generates assembly code with usage of new CPU functions? I wish that was true but I hardly believe it since I haven't seen a compiler even with MMX implementation.