Attached are 32 bit and 64 bit compilation results of my OpenMP benchmark on a Phenom II X4 CPU using 64-Bit Windows 7. The 32 bit version uses old i387 floating point instructions with the 64 bit variety running SSE instructions, but not very efficiently due to compiler limitations.
The main observation is that, with 4 CPUs, a single program using OpenMP can produce speeds almost four times faster than a single CPU, where there are not cache size limitations.
Note software information shown by the benchmark:
Windows 7 is identified as Version 6.1, build 7600.
64 Bit Version correctly shows 8 GB RAM and 8388 GB Virtual Space
32 Bit Version at least shows 4 GB RAM available and 4 GB Virtual Space
Roy
|