CPU2006 license: | 49 | Test date: | May-2009 |
---|---|---|---|
Test sponsor: | Advanced Micro Devices | Hardware Availability: | Jun-2009 |
Tested by: | Advanced Micro Devices | Software Availability: | Apr-2009 |
Hardware | |
---|---|
CPU Name: | AMD Opteron 8435 |
CPU Characteristics: | |
CPU MHz: | 2600 |
FPU: | Integrated |
CPU(s) enabled: | 24 cores, 4 chips, 6 cores/chip |
CPU(s) orderable: | 2,4 chips |
Primary Cache: | 64 KB I + 64 KB D on chip per core |
Secondary Cache: | 512 KB I+D on chip per core |
L3 Cache: | 6 MB I+D on chip per chip |
Other Cache: | None |
Memory: | 64 GB (16x4 GB, DDR2-800, CL5, Reg, Dual Rank) |
Disk Subsystem: | 1 x 250 GB SATA, 7200 RPM |
Other Hardware: | None |
Software | |
---|---|
Operating System: | Red Hat Enterprise Linux Server release 5.3, Advanced Platform, Kernel 2.6.18-128.el5 |
Compiler: | PGI Server Complete Version 8.0 x86 Open64 4.2.2 Compiler Suite (from AMD) |
Auto Parallel: | Yes |
File System: | ext3 |
System State: | Run level 3 (Full multiuser with network) |
Base Pointers: | 64-bit |
Peak Pointers: | 32/64-bit |
Other Software: | binutils 2.18 |
Benchmark | Base | Peak | ||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Copies | Seconds | Ratio | Seconds | Ratio | Seconds | Ratio | Copies | Seconds | Ratio | Seconds | Ratio | Seconds | Ratio | |
Results appear in the order in which they were run. Bold underlined text indicates a median measurement. | ||||||||||||||
410.bwaves | 24 | 1576 | 207 | 1577 | 207 | 1577 | 207 | 24 | 1535 | 213 | 1534 | 213 | 1538 | 212 |
416.gamess | 24 | 1191 | 395 | 1192 | 394 | 1195 | 393 | 24 | 1166 | 403 | 1116 | 421 | 1111 | 423 |
433.milc | 24 | 1385 | 159 | 1385 | 159 | 1384 | 159 | 24 | 1385 | 159 | 1385 | 159 | 1384 | 159 |
434.zeusmp | 24 | 749 | 292 | 745 | 293 | 741 | 295 | 24 | 741 | 295 | 746 | 293 | 744 | 293 |
435.gromacs | 24 | 509 | 337 | 512 | 334 | 513 | 334 | 24 | 419 | 409 | 425 | 404 | 432 | 396 |
436.cactusADM | 24 | 940 | 305 | 936 | 306 | 940 | 305 | 4 | 131 | 364 | 131 | 365 | 132 | 363 |
437.leslie3d | 24 | 1709 | 132 | 1700 | 133 | 1700 | 133 | 24 | 1610 | 140 | 1603 | 141 | 1607 | 140 |
444.namd | 24 | 617 | 312 | 617 | 312 | 618 | 312 | 24 | 570 | 337 | 560 | 344 | 560 | 344 |
447.dealII | 24 | 640 | 429 | 651 | 422 | 640 | 429 | 24 | 476 | 577 | 470 | 584 | 470 | 585 |
450.soplex | 24 | 1216 | 165 | 1209 | 165 | 1212 | 165 | 24 | 1133 | 177 | 1128 | 177 | 1119 | 179 |
453.povray | 24 | 325 | 393 | 331 | 386 | 321 | 397 | 24 | 294 | 434 | 299 | 427 | 268 | 476 |
454.calculix | 24 | 466 | 425 | 467 | 424 | 467 | 424 | 24 | 415 | 477 | 413 | 480 | 414 | 478 |
459.GemsFDTD | 24 | 1981 | 129 | 1976 | 129 | 1982 | 128 | 24 | 1903 | 134 | 1908 | 133 | 1904 | 134 |
465.tonto | 24 | 736 | 321 | 735 | 321 | 736 | 321 | 24 | 624 | 378 | 626 | 377 | 621 | 380 |
470.lbm | 24 | 2652 | 124 | 2654 | 124 | 2653 | 124 | 24 | 2646 | 125 | 2643 | 125 | 2646 | 125 |
481.wrf | 24 | 1109 | 242 | 1110 | 241 | 1109 | 242 | 24 | 1076 | 249 | 1077 | 249 | 1074 | 250 |
482.sphinx3 | 24 | 1627 | 287 | 1583 | 296 | 1579 | 296 | 24 | 1486 | 315 | 1494 | 313 | 1479 | 316 |
The config file option 'submit' was used. 'numactl' was used to bind copies to the cores. See the configuration file for details.
'ulimit -s unlimited' was used to set environment stack size 'ulimit -l 2097152' was used to set environment locked pages in memory limit Set vm/nr_hugepages=10800 in /etc/sysctl.conf mount -t hugetlbfs nodev /mnt/hugepages
The tested system can be assembled using an SSI-MEB case and a Zippy PSL-6850P 850W power supply.
Environment variables set by runspec before the start of the run: HUGETLB_LIMIT = "450" LD_LIBRARY_PATH = "/root/work/cpu2006v1.1/amd0905is-libs/64:/root/work/cpu2006v1.1/amd0905is-libs/32" NCPUS = "6" PGI_HUGE_PAGES = "450" The x86 Open64 Compiler Suite is only available from (and supported by) AMD at http://developer.amd.com/cpu/open64.
pgcc |
pgcpp |
pgf95 |
pgcc pgf95 |
410.bwaves: | -DSPEC_CPU_LP64 |
416.gamess: | -DSPEC_CPU_LP64 |
433.milc: | -DSPEC_CPU_LP64 |
434.zeusmp: | -DSPEC_CPU_LP64 |
435.gromacs: | -DSPEC_CPU_LP64 -Mnomain |
436.cactusADM: | -DSPEC_CPU_LP64 -Mnomain |
437.leslie3d: | -DSPEC_CPU_LP64 |
444.namd: | -DSPEC_CPU_LP64 |
447.dealII: | -DSPEC_CPU_LP64 |
450.soplex: | -DSPEC_CPU_LP64 |
453.povray: | -DSPEC_CPU_LP64 |
454.calculix: | -DSPEC_CPU_LP64 -Mnomain |
459.GemsFDTD: | -DSPEC_CPU_LP64 |
465.tonto: | -DSPEC_CPU_LP64 |
470.lbm: | -DSPEC_CPU_LP64 |
481.wrf: | -DSPEC_CPU_LP64 -DSPEC_CPU_CASE_FLAG -DSPEC_CPU_LINUX |
482.sphinx3: | -DSPEC_CPU_LP64 |
-fastsse -Msmartalloc=huge -Mfprelaxed -Mipa=fast -Mipa=inline -tp shanghai-64 -Bstatic_pgi |
-fastsse -Msmartalloc=huge -Mfprelaxed --zc_eh -Mipa=fast -Mipa=inline -tp shanghai-64 -Bstatic_pgi |
-fastsse -Msmartalloc=huge -Mfprelaxed -Mvect=short -Mipa=fast -Mipa=inline -tp shanghai-64 -Bstatic_pgi |
-fastsse -Msmartalloc=huge -Mfprelaxed -Mipa=fast -Mipa=inline -tp shanghai-64 -Mvect=short -Bstatic_pgi |
-Mipa=jobs:4 |
-Mipa=jobs:4 |
-Mipa=jobs:4 |
-Mipa=jobs:4 |
pgcc |
openCC | |
444.namd: | pgcpp |
openf95 | |
410.bwaves: | pgf95 |
434.zeusmp: | pgf95 |
437.leslie3d: | pgf95 |
pgcc pgf95 | |
435.gromacs: | opencc openf95 |
410.bwaves: | -DSPEC_CPU_LP64 |
416.gamess: | -DSPEC_CPU_LP64 |
433.milc: | -DSPEC_CPU_LP64 |
434.zeusmp: | -DSPEC_CPU_LP64 |
435.gromacs: | -DSPEC_CPU_LP64 |
436.cactusADM: | -DSPEC_CPU_LP64 -Mnomain |
437.leslie3d: | -DSPEC_CPU_LP64 |
444.namd: | -DSPEC_CPU_LP64 |
453.povray: | -DSPEC_CPU_LP64 |
454.calculix: | -DSPEC_CPU_LP64 -Mnomain |
459.GemsFDTD: | -DSPEC_CPU_LP64 |
465.tonto: | -DSPEC_CPU_LP64 |
470.lbm: | -DSPEC_CPU_LP64 |
481.wrf: | -DSPEC_CPU_LP64 -DSPEC_CPU_CASE_FLAG -DSPEC_CPU_LINUX |
482.sphinx3: | -DSPEC_CPU_LP64 |
433.milc: | basepeak = yes |
470.lbm: | -fastsse -Msmartalloc=huge -Mprefetch=t0 -Mloop32 -Mfprelaxed -Mipa=fast -Mipa=inline -tp shanghai-64 -Bstatic_pgi |
482.sphinx3: | -Mpfi=indirect(pass 1) -Mpfo=indirect(pass 2) -Mipa=fast(pass 2) -Mipa=inline(pass 2) -fastsse -Mfprelaxed -Msmartalloc -tp shanghai-64 -Bstatic_pgi |
410.bwaves: | -fastsse -Msmartalloc -Mprefetch=nta -Mfprelaxed -Mipa=fast -Mipa=inline -tp shanghai-64 -Bstatic_pgi |
416.gamess: | -march=barcelona -fb_create fbdata(pass 1) -fb_opt fbdata(pass 2) -O2 -OPT:Ofast -OPT:ro=3 -OPT:unroll_size=256 -HP:bdt=2m:heap=2m |
434.zeusmp: | -fastsse -Mfprelaxed -Mprefetch=distance:8 -Mprefetch=t0 -Msmartalloc=huge -Msmartalloc=hugebss -Mipa=fast -Mipa=inline -tp shanghai-64 -Bstatic_pgi |
437.leslie3d: | -Mpfi=indirect(pass 1) -Mpfo=indirect(pass 2) -Mipa=fast(pass 2) -Mipa=inline(pass 2) -fastsse -Mvect=fuse -Msmartalloc=huge -Mprefetch=distance:8 -Mprefetch=t0 -Mfprelaxed -tp shanghai-64 -Bstatic_pgi |
459.GemsFDTD: | -march=barcelona -Ofast -LNO:fission=2 -LNO:simd=2 -LNO:prefetch_ahead=1 -CG:load_exe=0 -HP |
465.tonto: | -march=barcelona -Ofast -OPT:alias=no_f90_pointer_alias -LNO:blocking=off -CG:load_exe=1 -IPA:plimit=525 -HP |
435.gromacs: | -march=barcelona -Ofast -OPT:rsqrt=2 -HP:bdt=2m:heap=2m |
436.cactusADM: | -fastsse -Mconcur -Msmartalloc=huge -Mfprelaxed -Mipa=fast -Mipa=inline -tp shanghai-64 -Bstatic_pgi |
454.calculix: | -Mpfi=indirect(pass 1) -Mpfo=indirect(pass 2) -Mipa=fast(pass 2) -Mipa=inline(pass 2) -fastsse -Mvect=short -Msmartalloc=huge -Mprefetch=t0 -Mpre -Mfprelaxed -tp shanghai-64 -Bstatic_pgi |
481.wrf: | -fastsse -Mvect=noaltcode -Msmartalloc=huge -Mprefetch=distance:8 -Mfprelaxed -tp shanghai-64 -Bstatic_pgi |
-Mipa=jobs:4(pass 2) |
444.namd: | -Mipa=jobs:4(pass 2) |
410.bwaves: | -Mipa=jobs:4 |
434.zeusmp: | -Mipa=jobs:4 |
437.leslie3d: | -Mipa=jobs:4(pass 2) |
436.cactusADM: | -Mipa=jobs:4 |
454.calculix: | -Mipa=jobs:4(pass 2) |