CPU2006 license: | 9006 | Test date: | Jan-2009 |
---|---|---|---|
Test sponsor: | NEC Corporation | Hardware Availability: | Nov-2008 |
Tested by: | Bull SAS | Software Availability: | Nov-2008 |
Hardware | |
---|---|
CPU Name: | Intel Xeon E7440 |
CPU Characteristics: | 1066 MHz system bus |
CPU MHz: | 2400 |
FPU: | Integrated |
CPU(s) enabled: | 16 cores, 4 chips, 4 cores/chip |
CPU(s) orderable: | 1,2,3,4 chips |
Primary Cache: | 32 KB I + 32 KB D on chip per core |
Secondary Cache: | 6 MB I+D on chip per chip, 3 MB shared / 2 cores |
L3 Cache: | 16 MB I+D on chip per chip |
Other Cache: | None |
Memory: | 32 GB (16 x 2GB DDR2-667 FBDIMM) |
Disk Subsystem: | 1x146 GB SAS, 10000 RPM |
Other Hardware: | None |
Software | |
---|---|
Operating System: | SUSE Linux Enterprise Server 10 (x86_64) SP2, Kernel 2.6.16.60-0.21-smp |
Compiler: | Intel C++ and Fortran Compiler 11.0 for Linux Build 20080730 Package ID: l_cproc_b_11.0.042, l_fproc_b_11.0.042 |
Auto Parallel: | Yes |
File System: | ReiserFS |
System State: | Run level 3 (multi-user) |
Base Pointers: | 64-bit |
Peak Pointers: | 32/64-bit |
Other Software: | Binutils 2.18.50.0.7.20080502 |
Benchmark | Base | Peak | ||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Copies | Seconds | Ratio | Seconds | Ratio | Seconds | Ratio | Copies | Seconds | Ratio | Seconds | Ratio | Seconds | Ratio | |
Results appear in the order in which they were run. Bold underlined text indicates a median measurement. | ||||||||||||||
410.bwaves | 16 | 3851 | 56.5 | 3841 | 56.6 | 3848 | 56.5 | 16 | 3852 | 56.4 | 3834 | 56.7 | 3848 | 56.5 |
416.gamess | 16 | 1097 | 286 | 1096 | 286 | 1099 | 285 | 16 | 1080 | 290 | 1076 | 291 | 1079 | 290 |
433.milc | 16 | 2288 | 64.2 | 2285 | 64.3 | 2282 | 64.4 | 16 | 2282 | 64.4 | 2278 | 64.5 | 2277 | 64.5 |
434.zeusmp | 16 | 1296 | 112 | 1297 | 112 | 1298 | 112 | 16 | 1293 | 113 | 1302 | 112 | 1305 | 112 |
435.gromacs | 16 | 492 | 232 | 492 | 232 | 493 | 232 | 16 | 488 | 234 | 488 | 234 | 488 | 234 |
436.cactusADM | 16 | 1287 | 149 | 1280 | 149 | 1282 | 149 | 1 | 81.0 | 148 | 81.0 | 148 | 81.2 | 147 |
437.leslie3d | 16 | 2730 | 55.1 | 2747 | 54.8 | 2738 | 54.9 | 16 | 2682 | 56.1 | 2694 | 55.8 | 2691 | 55.9 |
444.namd | 16 | 621 | 207 | 621 | 207 | 627 | 205 | 16 | 625 | 205 | 633 | 203 | 632 | 203 |
447.dealII | 16 | 938 | 195 | 932 | 196 | 932 | 196 | 16 | 914 | 200 | 906 | 202 | 903 | 203 |
450.soplex | 16 | 2107 | 63.3 | 2108 | 63.3 | 2104 | 63.4 | 16 | 1917 | 69.6 | 1916 | 69.6 | 1917 | 69.6 |
453.povray | 16 | 275 | 310 | 270 | 316 | 271 | 314 | 16 | 230 | 371 | 230 | 371 | 230 | 371 |
454.calculix | 16 | 547 | 241 | 558 | 237 | 546 | 242 | 16 | 558 | 236 | 546 | 242 | 544 | 243 |
459.GemsFDTD | 16 | 3704 | 45.8 | 3700 | 45.9 | 3694 | 46.0 | 16 | 3742 | 45.4 | 3707 | 45.8 | 3706 | 45.8 |
465.tonto | 16 | 886 | 178 | 889 | 177 | 890 | 177 | 16 | 836 | 188 | 837 | 188 | 837 | 188 |
470.lbm | 16 | 5838 | 37.7 | 5822 | 37.8 | 5836 | 37.7 | 8 | 1561 | 70.4 | 1558 | 70.6 | 1559 | 70.5 |
481.wrf | 16 | 2008 | 89.0 | 2003 | 89.2 | 1972 | 90.6 | 16 | 2008 | 89.0 | 2003 | 89.2 | 1972 | 90.6 |
482.sphinx3 | 16 | 2210 | 141 | 2214 | 141 | 2198 | 142 | 8 | 977 | 160 | 984 | 158 | 992 | 157 |
The config file option 'submit' was used. taskset was used to bind processes to cores except for 436.cactusADM peak For peak modules using 1/2 the number of available cores, copies were each assigned to a single L2 cache using mysubmit.pl script. See the flags description file for mysubmit.pl details.
'ulimit -s unlimited' was used to set the stacksize to unlimited prior to run OMP_NUM_THREADS set to number of cores KMP_AFFINITY set to physical,0 KMP_STACKSIZE set to 64M
BIOS Settings: Adjacent Cache Line Prefetch = Disabled Hardware Prefetcher = Disabled High Bandwidth option = Enabled
The NEC Express5800/R140a-4(Intel Xeon E7440) and the Bull NovaScale R480 E1(Intel Xeon E7440, 2.40 GHz) models are electronically equivalent. The results have been measured on a Bull NovaScale R480 E1(Intel Xeon E7440, 2.40 GHz) model.
icc |
icpc |
ifort |
icc ifort |
410.bwaves: | -DSPEC_CPU_LP64 |
416.gamess: | -DSPEC_CPU_LP64 |
433.milc: | -DSPEC_CPU_LP64 |
434.zeusmp: | -DSPEC_CPU_LP64 |
435.gromacs: | -DSPEC_CPU_LP64 -nofor_main |
436.cactusADM: | -DSPEC_CPU_LP64 -nofor_main |
437.leslie3d: | -DSPEC_CPU_LP64 |
444.namd: | -DSPEC_CPU_LP64 |
447.dealII: | -DSPEC_CPU_LP64 |
450.soplex: | -DSPEC_CPU_LP64 |
453.povray: | -DSPEC_CPU_LP64 |
454.calculix: | -DSPEC_CPU_LP64 -nofor_main |
459.GemsFDTD: | -DSPEC_CPU_LP64 |
465.tonto: | -DSPEC_CPU_LP64 |
470.lbm: | -DSPEC_CPU_LP64 |
481.wrf: | -DSPEC_CPU_LP64 -DSPEC_CPU_CASE_FLAG -DSPEC_CPU_LINUX |
482.sphinx3: | -DSPEC_CPU_LP64 |
-xSSE4.1 -ipo -O3 -no-prec-div -static -opt-prefetch |
-xSSE4.1 -ipo -O3 -no-prec-div -static -opt-prefetch |
-xSSE4.1 -ipo -O3 -no-prec-div -static -opt-prefetch |
-xSSE4.1 -ipo -O3 -no-prec-div -static -opt-prefetch |
icc | |
482.sphinx3: | /opt/intel/Compiler/11.0/042/bin/ia32/icc -L/opt/intel/Compiler/11.0/042/ipp/ia32/lib -I/opt/intel/Compiler/11.0/042/ipp/ia32/include |
icpc | |
450.soplex: | /opt/intel/Compiler/11.0/042/bin/ia32/icpc -L/opt/intel/Compiler/11.0/042/ipp/ia32/lib -I/opt/intel/Compiler/11.0/042/ipp/ia32/include |
ifort | |
437.leslie3d: | /opt/intel/Compiler/11.0/042/bin/ia32/ifort -L/opt/intel/Compiler/11.0/042/ipp/ia32/lib -I/opt/intel/Compiler/11.0/042/ipp/ia32/include |
icc ifort |
410.bwaves: | -DSPEC_CPU_LP64 |
416.gamess: | -DSPEC_CPU_LP64 |
433.milc: | -DSPEC_CPU_LP64 |
434.zeusmp: | -DSPEC_CPU_LP64 |
435.gromacs: | -DSPEC_CPU_LP64 -nofor_main |
436.cactusADM: | -DSPEC_CPU_LP64 -nofor_main |
444.namd: | -DSPEC_CPU_LP64 |
447.dealII: | -DSPEC_CPU_LP64 |
453.povray: | -DSPEC_CPU_LP64 |
454.calculix: | -DSPEC_CPU_LP64 -nofor_main |
459.GemsFDTD: | -DSPEC_CPU_LP64 |
465.tonto: | -DSPEC_CPU_LP64 |
470.lbm: | -DSPEC_CPU_LP64 |
481.wrf: | -DSPEC_CPU_LP64 -DSPEC_CPU_CASE_FLAG -DSPEC_CPU_LINUX |
433.milc: | -prof-gen(pass 1) -prof-use(pass 2) -xSSE4.1 -ipo -O3 -no-prec-div -static -fno-alias |
470.lbm: | -xSSE4.1 -ipo -O3 -no-prec-div -static -opt-prefetch -auto-ilp32 |
482.sphinx3: | -xSSE4.1 -ipo -O3 -no-prec-div -static -unroll2 |
444.namd: | -prof-gen(pass 1) -prof-use(pass 2) -xSSE4.1 -ipo -O3 -no-prec-div -static -fno-alias -auto-ilp32 |
447.dealII: | -prof-gen(pass 1) -prof-use(pass 2) -xSSE4.1 -ipo -O3 -no-prec-div -static -unroll2 -ansi-alias -scalar-rep- |
450.soplex: | -prof-gen(pass 1) -prof-use(pass 2) -xSSE4.1 -ipo -O3 -no-prec-div -static -opt-malloc-options=3 |
453.povray: | -prof-gen(pass 1) -prof-use(pass 2) -xSSE4.1 -ipo -O3 -no-prec-div -static -unroll4 -ansi-alias |
410.bwaves: | -xSSE4.1 -ipo -O3 -no-prec-div -static -opt-prefetch |
416.gamess: | -prof-gen(pass 1) -prof-use(pass 2) -xSSE4.1 -ipo -O3 -no-prec-div -static -unroll2 -Ob0 -ansi-alias -scalar-rep- |
434.zeusmp: | -prof-gen(pass 1) -prof-use(pass 2) -xSSE4.1 -ipo -O3 -no-prec-div -static |
437.leslie3d: | -prof-gen(pass 1) -prof-use(pass 2) -xSSE4.1 -ipo -O3 -no-prec-div -static -opt-malloc-options=3 -opt-prefetch |
459.GemsFDTD: | -prof-gen(pass 1) -prof-use(pass 2) -xSSE4.1 -ipo -O3 -no-prec-div -static -unroll2 -Ob0 -opt-prefetch |
465.tonto: | -prof-gen(pass 1) -prof-use(pass 2) -xSSE4.1 -ipo -O3 -no-prec-div -static -unroll4 -auto |
435.gromacs: | -prof-gen(pass 1) -prof-use(pass 2) -xSSE4.1 -ipo -O3 -no-prec-div -static -opt-prefetch -auto-ilp32 |
436.cactusADM: | -prof-gen(pass 1) -prof-use(pass 2) -xSSE4.1 -ipo -O3 -no-prec-div -static -unroll2 -opt-prefetch -parallel -auto-ilp32 |
454.calculix: | -xSSE4.1 -ipo -O3 -no-prec-div -static -auto-ilp32 |
481.wrf: | basepeak = yes |