I see 2 problems with your test code:

First of all you are using GetTickCount. Its propably the most unprecise
counter available. Do this:

[pascal]uses windows;

var
t1, t2, fq: int64;

begin
QueryPerformanceFrequency(fq);
QueryPerformanceCounter(t1);
YourTestCode;
QueryPerformanceCounter(t2);
writeln(((t2-t1)/fq):8:;
end.[/pascal]

And second of all:
The parameters you pass are way too big to fit 2 (or 3, in some
pascal implementations) registers - so the performance improvement,
if any, is very narrow.
It could be that managing to push params over stack and registers
requires some more administration to the compiler.

And yes: Try to make the functions do something (if you use less data,
2x32Bit, then its the data is available immediately because it has not to
be popped from the stack first).

Edit:
I see you are using const. Then forget about my size matters