CPU2006 license: | 11 | Test date: | Feb-2009 |
---|---|---|---|
Test sponsor: | IBM Corporation | Hardware Availability: | Mar-2009 |
Tested by: | Advanced Micro Devices | Software Availability: | May-2008 |
Hardware | |
---|---|
CPU Name: | AMD Opteron 8378 |
CPU Characteristics: | |
CPU MHz: | 2400 |
FPU: | Integrated |
CPU(s) enabled: | 16 cores, 4 chips, 4 cores/chip |
CPU(s) orderable: | 1,2,3,4 chips |
Primary Cache: | 64 KB I + 64 KB D on chip per core |
Secondary Cache: | 512 KB I+D on chip per core |
L3 Cache: | 6 MB I+D on chip per chip |
Other Cache: | None |
Memory: | 64 GB (16 x 4 GB, DDR2-667 CL5 Reg Dual Rank) |
Disk Subsystem: | 1 x 73.4 GB SAS, 15000 RPM |
Other Hardware: | None |
Software | |
---|---|
Operating System: | SuSE Linux Enterprise Server 10 (x86_64) SP1, Kernel 2.6.16.46-0.12-smp |
Compiler: | PGI Server Complete Version 7.2 |
Auto Parallel: | Yes |
File System: | ReiserFS |
System State: | Run level 3 (Full multiuser with network) |
Base Pointers: | 32/64-bit |
Peak Pointers: | 64-bit |
Other Software: | binutils 2.18.50 |
Benchmark | Base | Peak | ||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|
Seconds | Ratio | Seconds | Ratio | Seconds | Ratio | Seconds | Ratio | Seconds | Ratio | Seconds | Ratio | |
Results appear in the order in which they were run. Bold underlined text indicates a median measurement. | ||||||||||||
410.bwaves | 303 | 44.9 | 264 | 51.4 | 261 | 52.1 | 303 | 44.9 | 264 | 51.4 | 261 | 52.1 |
416.gamess | 1348 | 14.5 | 1344 | 14.6 | 1344 | 14.6 | 1272 | 15.4 | 1277 | 15.3 | 1281 | 15.3 |
433.milc | 631 | 14.6 | 633 | 14.5 | 631 | 14.5 | 617 | 14.9 | 619 | 14.8 | 619 | 14.8 |
434.zeusmp | 696 | 13.1 | 697 | 13.0 | 697 | 13.1 | 624 | 14.6 | 616 | 14.8 | 619 | 14.7 |
435.gromacs | 523 | 13.7 | 523 | 13.7 | 523 | 13.7 | 433 | 16.5 | 433 | 16.5 | 435 | 16.4 |
436.cactusADM | 102 | 117 | 91.6 | 131 | 93.0 | 129 | 96.6 | 124 | 92.2 | 130 | 93.3 | 128 |
437.leslie3d | 672 | 14.0 | 677 | 13.9 | 675 | 13.9 | 704 | 13.4 | 761 | 12.3 | 614 | 15.3 |
444.namd | 690 | 11.6 | 690 | 11.6 | 692 | 11.6 | 601 | 13.3 | 601 | 13.3 | 601 | 13.3 |
447.dealII | 656 | 17.4 | 656 | 17.4 | 656 | 17.4 | 592 | 19.3 | 591 | 19.4 | 592 | 19.3 |
450.soplex | 783 | 10.6 | 781 | 10.7 | 782 | 10.7 | 783 | 10.6 | 781 | 10.7 | 782 | 10.7 |
453.povray | 350 | 15.2 | 352 | 15.1 | 351 | 15.2 | 325 | 16.4 | 329 | 16.2 | 326 | 16.3 |
454.calculix | 546 | 15.1 | 541 | 15.3 | 543 | 15.2 | 438 | 18.8 | 438 | 18.8 | 437 | 18.9 |
459.GemsFDTD | 382 | 27.8 | 372 | 28.5 | 377 | 28.1 | 382 | 27.8 | 372 | 28.5 | 377 | 28.1 |
465.tonto | 692 | 14.2 | 692 | 14.2 | 689 | 14.3 | 609 | 16.2 | 610 | 16.1 | 610 | 16.1 |
470.lbm | 530 | 25.9 | 531 | 25.9 | 531 | 25.9 | 530 | 25.9 | 531 | 25.9 | 531 | 25.9 |
481.wrf | 550 | 20.3 | 550 | 20.3 | 552 | 20.2 | 607 | 18.4 | 607 | 18.4 | 612 | 18.3 |
482.sphinx3 | 1160 | 16.8 | 1097 | 17.8 | 1162 | 16.8 | 983 | 19.8 | 984 | 19.8 | 985 | 19.8 |
The config file option 'submit' was used. 'numactl' was used to bind copies to the cores.
Environment stack size set to 'unlimited'. The powersaved was disabled, set the CPU frequency to its maximum. Total number of huge pages available is 14336. 'ulimit -l 2097152' was used to set environment locked pages in memory quantity. Set vm/nr_hugepages=14336 in /etc/sysctl.conf mount -t hugetlbfs nodev /mnt/hugepages
Environment variables set by runspec before the start of the run: LD_LIBRARY_PATH = "/root/work/cpu2006v1.1/pgi72/linux_lib64:/root/work/cpu2006v1.1/pgi72/linux_lib32" NCPUS = "16"
pgcc |
pgcpp |
pgf95 |
pgcc pgf95 |
410.bwaves: | -DSPEC_CPU_LP64 |
416.gamess: | -DSPEC_CPU_LP64 |
433.milc: | -DSPEC_CPU_LP64 |
434.zeusmp: | -DSPEC_CPU_LP64 |
435.gromacs: | -DSPEC_CPU_LP64 -Mnomain |
436.cactusADM: | -DSPEC_CPU_LP64 -Mnomain |
437.leslie3d: | -DSPEC_CPU_LP64 |
444.namd: | -DSPEC_CPU_LP64 |
447.dealII: | -DSPEC_CPU_LP64 |
450.soplex: | -DSPEC_CPU_LP64 |
453.povray: | -DSPEC_CPU_LP64 |
454.calculix: | -DSPEC_CPU_LP64 -Mnomain |
459.GemsFDTD: | -DSPEC_CPU_LP64 |
465.tonto: | -DSPEC_CPU_LP64 |
470.lbm: | -DSPEC_CPU_LP64 |
481.wrf: | -DSPEC_CPU_LP64 -DSPEC_CPU_CASE_FLAG -DSPEC_CPU_LINUX |
482.sphinx3: | -DSPEC_CPU_LP64 |
-Mvect=cachesize:6291456 -fastsse -Msmartalloc=huge -Mconcur -Mfprelaxed -Mipa=fast -Mipa=inline -tp barcelona-64 -Bstatic_pgi |
-Mvect=cachesize:6291456 -fastsse -Msmartalloc=huge -Mfprelaxed -Mconcur --zc_eh -Mipa=fast -Mipa=inline -tp barcelona-64 -Bstatic_pgi |
-Mvect=cachesize:6291456 -fastsse -Mfprelaxed -Msmartalloc=huge -Mconcur -Mipa=fast -Mipa=inline -tp barcelona-64 -Bstatic_pgi |
-Mvect=cachesize:6291456 -fastsse -Msmartalloc=huge -Mconcur -Mfprelaxed -Mipa=fast -Mipa=inline -tp barcelona-64 -Bstatic_pgi |
-Mipa=jobs:8 |
-Mipa=jobs:8 |
-Mipa=jobs:8 |
-Mipa=jobs:8 |
pgcc |
pgcpp |
pgf95 |
pgcc pgf95 |
410.bwaves: | -DSPEC_CPU_LP64 |
416.gamess: | -DSPEC_CPU_LP64 |
433.milc: | -DSPEC_CPU_LP64 |
434.zeusmp: | -DSPEC_CPU_LP64 |
435.gromacs: | -DSPEC_CPU_LP64 -Mnomain |
436.cactusADM: | -DSPEC_CPU_LP64 -Mnomain |
437.leslie3d: | -DSPEC_CPU_LP64 |
444.namd: | -DSPEC_CPU_LP64 |
450.soplex: | -DSPEC_CPU_LP64 |
453.povray: | -DSPEC_CPU_LP64 |
454.calculix: | -DSPEC_CPU_LP64 -Mnomain |
459.GemsFDTD: | -DSPEC_CPU_LP64 |
465.tonto: | -DSPEC_CPU_LP64 |
470.lbm: | -DSPEC_CPU_LP64 |
481.wrf: | -DSPEC_CPU_LP64 -DSPEC_CPU_CASE_FLAG -DSPEC_CPU_LINUX |
482.sphinx3: | -DSPEC_CPU_LP64 |
433.milc: | -Mvect=cachesize:6291456 -fastsse -Msmartalloc=huge -Msafeptr -Mconcur -Mfprelaxed -Mipa=inline -Mipa=arg -Mipa=const -Mipa=ptr -Mipa=shape -tp barcelona-64 -Bstatic_pgi |
470.lbm: | basepeak = yes |
482.sphinx3: | -Mpfi(pass 1) -Mpfo(pass 2) -Mipa=fast(pass 2) -Mipa=inline(pass 2) -Mvect=cachesize:6291456 -fastsse -Mfprelaxed -Msmartalloc -tp barcelona-64 -Bstatic_pgi |
444.namd: | -Mpfi(pass 1) -Mpfo(pass 2) -Mipa=fast(pass 2) -Mipa=inline(pass 2) -Mvect=cachesize:6291456 -fastsse -Munroll=n:4 -Munroll=m:8 -Msmartalloc=huge -Mnodepchk -Mfprelaxed --zc_eh -tp barcelona-64 -Bstatic_pgi |
447.dealII: | -Mvect=cachesize:6291456 -fastsse -alias=ansi -Msmartalloc=huge -Mprefetch=t0 -Mnovect -Mfprelaxed --zc_eh -Mipa=fast -Mipa=inline -tp barcelona-32 -Bstatic_pgi |
450.soplex: | basepeak = yes |
453.povray: | -Mpfi=indirect(pass 1) -Mpfo=indirect(pass 2) -Mipa=fast(pass 2) -Mipa=inlinenopfo:3(pass 2) -Mipa=staticfunc(pass 2) -Mvect=cachesize:6291456 -fastsse -Msmartalloc=huge -Mprefetch=t0 -Mfprelaxed -tp barcelona-64 -Bstatic_pgi |
410.bwaves: | basepeak = yes |
416.gamess: | -Mpfi(pass 1) -Mpfo(pass 2) -Mipa=fast(pass 2) -Mipa=inline(pass 2) -Mvect=cachesize:6291456 -fastsse -Msmartalloc=huge -Mvect=noaltcode -Mprefetch=t0 -Mfprelaxed -tp barcelona-64 -Bstatic_pgi |
434.zeusmp: | -Mvect=cachesize:6291456 -fastsse -Mfprelaxed -Mconcur -Mprefetch=distance:8 -Mprefetch=t0 -Msmartalloc=huge -Msmartalloc=hugebss -Mipa=fast -Mipa=inline -tp barcelona-64 -Bstatic_pgi |
437.leslie3d: | -Mpfi=indirect(pass 1) -Mpfo=indirect(pass 2) -Mconcur=noaltcode(pass 2) -Mipa=fast(pass 2) -Mipa=inline(pass 2) -Mvect=cachesize:6291456 -fastsse -Mvect=fuse -Msmartalloc=huge -Mprefetch=distance:8 -Mprefetch=t0 -Mfprelaxed -tp barcelona-64 -Bstatic_pgi |
459.GemsFDTD: | basepeak = yes |
465.tonto: | -Mvect=cachesize:6291456 -fastsse -O4 -Mvect=noaltcode -Msmartalloc=huge -Mprefetch=distance:8 -Mprefetch=t0 -Mfprelaxed -Mipa=fast -Mipa=inline -tp barcelona-64 -Bstatic_pgi |
-Mipa=jobs:8(pass 2) |
-Mipa=jobs:8(pass 2) |
-Mipa=jobs:8 |
-Mipa=jobs:8(pass 2) | |
481.wrf: | No flags used |