CPU2006 license: | 6 | Test date: | Aug-2009 |
---|---|---|---|
Test sponsor: | Sun Microsystems | Hardware Availability: | Jul-2009 |
Tested by: | Sun Microsystems | Software Availability: | Apr-2009 |
Hardware | |
---|---|
CPU Name: | AMD Opteron 8435 |
CPU Characteristics: | |
CPU MHz: | 2600 |
FPU: | Integrated |
CPU(s) enabled: | 24 cores, 4 chips, 6 cores/chip |
CPU(s) orderable: | 2 or 4 chips |
Primary Cache: | 64 KB I + 64 KB D on chip per core |
Secondary Cache: | 512 KB I+D on chip per core |
L3 Cache: | 6 MB I+D on chip per chip |
Other Cache: | None |
Memory: | 64 GB (16x4GB, DDR2-667, CL5, Reg, Dual Rank) |
Disk Subsystem: | 48 x 250GB 7200RPM SATA via NFS |
Other Hardware: | See additional details below |
Software | |
---|---|
Operating System: | SuSE Linux Enterprise Server 10 (x86_64) SP2, Kernel 2.6.16.60-0.21-smp |
Compiler: | PGI Server Complete Version 8.0 x86 Open64 4.2.2 Compiler Suite (from AMD) |
Auto Parallel: | Yes |
File System: | NFSv3 |
System State: | Run level 3 (Full multiuser with network) |
Base Pointers: | 64-bit |
Peak Pointers: | 32/64-bit |
Other Software: | binutils 2.18 |
Benchmark | Base | Peak | ||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Copies | Seconds | Ratio | Seconds | Ratio | Seconds | Ratio | Copies | Seconds | Ratio | Seconds | Ratio | Seconds | Ratio | |
Results appear in the order in which they were run. Bold underlined text indicates a median measurement. | ||||||||||||||
410.bwaves | 24 | 1899 | 172 | 1899 | 172 | 1898 | 172 | 24 | 1843 | 177 | 1842 | 177 | 1841 | 177 |
416.gamess | 24 | 1205 | 390 | 1201 | 391 | 1202 | 391 | 24 | 1121 | 419 | 1158 | 406 | 1145 | 410 |
433.milc | 24 | 1693 | 130 | 1692 | 130 | 1692 | 130 | 24 | 1693 | 130 | 1692 | 130 | 1692 | 130 |
434.zeusmp | 24 | 850 | 257 | 847 | 258 | 848 | 258 | 24 | 847 | 258 | 850 | 257 | 849 | 257 |
435.gromacs | 24 | 527 | 325 | 519 | 330 | 516 | 332 | 24 | 426 | 402 | 428 | 400 | 428 | 400 |
436.cactusADM | 24 | 1084 | 265 | 1075 | 267 | 1074 | 267 | 4 | 146 | 327 | 144 | 332 | 145 | 331 |
437.leslie3d | 24 | 2104 | 107 | 2105 | 107 | 2108 | 107 | 24 | 1986 | 114 | 1984 | 114 | 1981 | 114 |
444.namd | 24 | 621 | 310 | 620 | 310 | 627 | 307 | 24 | 564 | 341 | 563 | 342 | 563 | 342 |
447.dealII | 24 | 673 | 408 | 674 | 407 | 672 | 409 | 24 | 494 | 556 | 489 | 562 | 490 | 560 |
450.soplex | 24 | 1452 | 138 | 1448 | 138 | 1444 | 139 | 24 | 1349 | 148 | 1322 | 151 | 1319 | 152 |
453.povray | 24 | 321 | 398 | 323 | 395 | 350 | 365 | 24 | 270 | 472 | 292 | 437 | 269 | 474 |
454.calculix | 24 | 478 | 414 | 481 | 412 | 478 | 414 | 24 | 426 | 465 | 425 | 465 | 425 | 466 |
459.GemsFDTD | 24 | 2454 | 104 | 2450 | 104 | 2448 | 104 | 24 | 2377 | 107 | 2383 | 107 | 2376 | 107 |
465.tonto | 24 | 803 | 294 | 791 | 298 | 798 | 296 | 24 | 683 | 346 | 684 | 345 | 695 | 340 |
470.lbm | 24 | 3300 | 99.9 | 3299 | 100 | 3300 | 99.9 | 24 | 3290 | 100 | 3289 | 100 | 3287 | 100 |
481.wrf | 24 | 1358 | 197 | 1358 | 197 | 1361 | 197 | 24 | 1318 | 203 | 1318 | 203 | 1319 | 203 |
482.sphinx3 | 24 | 1923 | 243 | 1856 | 252 | 1881 | 249 | 24 | 1728 | 271 | 1733 | 270 | 1733 | 270 |
The config file option 'submit' was used. 'numactl' was used to bind copies to the cores. See the configuration file for details.
'ulimit -s unlimited' was used to set environment stack size 'ulimit -l 2097152' was used to set environment locked pages in memory limit Set vm/nr_hugepages=10800 in /etc/sysctl.conf mount -t hugetlbfs nodev /mnt/hugepages
Default BIOS settings used except: DCT Unganged Mode set to "Always" to enable Unganged Mode
Environment variables set by runspec before the start of the run: HUGETLB_LIMIT = "450" LD_LIBRARY_PATH = "/ctmp0/gnana/pegasus-istanbul-sles10sp2/amd0905is-libs/64:/ctmp0/gnana/pegasus-istanbul-sles10sp2/amd0905is-libs/32" NCPUS = "6" PGI_HUGE_PAGES = "450" The NFS server used was a Sun Fire X4540 containing 48 x 250GB 7200RPM SATA disks. Connections to the clients were via gigabit ethernet. The x86 Open64 Compiler Suite is only available from (and supported by) AMD at http://developer.amd.com/cpu/open64.
pgcc |
pgcpp |
pgf95 |
pgcc pgf95 |
410.bwaves: | -DSPEC_CPU_LP64 |
416.gamess: | -DSPEC_CPU_LP64 |
433.milc: | -DSPEC_CPU_LP64 |
434.zeusmp: | -DSPEC_CPU_LP64 |
435.gromacs: | -DSPEC_CPU_LP64 -Mnomain |
436.cactusADM: | -DSPEC_CPU_LP64 -Mnomain |
437.leslie3d: | -DSPEC_CPU_LP64 |
444.namd: | -DSPEC_CPU_LP64 |
447.dealII: | -DSPEC_CPU_LP64 |
450.soplex: | -DSPEC_CPU_LP64 |
453.povray: | -DSPEC_CPU_LP64 |
454.calculix: | -DSPEC_CPU_LP64 -Mnomain |
459.GemsFDTD: | -DSPEC_CPU_LP64 |
465.tonto: | -DSPEC_CPU_LP64 |
470.lbm: | -DSPEC_CPU_LP64 |
481.wrf: | -DSPEC_CPU_LP64 -DSPEC_CPU_CASE_FLAG -DSPEC_CPU_LINUX |
482.sphinx3: | -DSPEC_CPU_LP64 |
-fastsse -Msmartalloc=huge -Mfprelaxed -Mipa=fast -Mipa=inline -tp shanghai-64 -Bstatic_pgi |
-fastsse -Msmartalloc=huge -Mfprelaxed --zc_eh -Mipa=fast -Mipa=inline -tp shanghai-64 -Bstatic_pgi |
-fastsse -Msmartalloc=huge -Mfprelaxed -Mvect=short -Mipa=fast -Mipa=inline -tp shanghai-64 -Bstatic_pgi |
-fastsse -Msmartalloc=huge -Mfprelaxed -Mipa=fast -Mipa=inline -tp shanghai-64 -Mvect=short -Bstatic_pgi |
-Mipa=jobs:4 |
-Mipa=jobs:4 |
-Mipa=jobs:4 |
-Mipa=jobs:4 |
pgcc |
openCC | |
444.namd: | pgcpp |
openf95 | |
410.bwaves: | pgf95 |
434.zeusmp: | pgf95 |
437.leslie3d: | pgf95 |
pgcc pgf95 | |
435.gromacs: | opencc openf95 |
410.bwaves: | -DSPEC_CPU_LP64 |
416.gamess: | -DSPEC_CPU_LP64 |
433.milc: | -DSPEC_CPU_LP64 |
434.zeusmp: | -DSPEC_CPU_LP64 |
435.gromacs: | -DSPEC_CPU_LP64 |
436.cactusADM: | -DSPEC_CPU_LP64 -Mnomain |
437.leslie3d: | -DSPEC_CPU_LP64 |
444.namd: | -DSPEC_CPU_LP64 |
453.povray: | -DSPEC_CPU_LP64 |
454.calculix: | -DSPEC_CPU_LP64 -Mnomain |
459.GemsFDTD: | -DSPEC_CPU_LP64 |
465.tonto: | -DSPEC_CPU_LP64 |
470.lbm: | -DSPEC_CPU_LP64 |
481.wrf: | -DSPEC_CPU_LP64 -DSPEC_CPU_CASE_FLAG -DSPEC_CPU_LINUX |
482.sphinx3: | -DSPEC_CPU_LP64 |
433.milc: | basepeak = yes |
470.lbm: | -fastsse -Msmartalloc=huge -Mprefetch=t0 -Mloop32 -Mfprelaxed -Mipa=fast -Mipa=inline -tp shanghai-64 -Bstatic_pgi |
482.sphinx3: | -Mpfi=indirect(pass 1) -Mpfo=indirect(pass 2) -Mipa=fast(pass 2) -Mipa=inline(pass 2) -fastsse -Mfprelaxed -Msmartalloc -tp shanghai-64 -Bstatic_pgi |
410.bwaves: | -fastsse -Msmartalloc -Mprefetch=nta -Mfprelaxed -Mipa=fast -Mipa=inline -tp shanghai-64 -Bstatic_pgi |
416.gamess: | -march=barcelona -fb_create fbdata(pass 1) -fb_opt fbdata(pass 2) -O2 -OPT:Ofast -OPT:ro=3 -OPT:unroll_size=256 -HP:bdt=2m:heap=2m |
434.zeusmp: | -fastsse -Mfprelaxed -Mprefetch=distance:8 -Mprefetch=t0 -Msmartalloc=huge -Msmartalloc=hugebss -Mipa=fast -Mipa=inline -tp shanghai-64 -Bstatic_pgi |
437.leslie3d: | -Mpfi=indirect(pass 1) -Mpfo=indirect(pass 2) -Mipa=fast(pass 2) -Mipa=inline(pass 2) -fastsse -Mvect=fuse -Msmartalloc=huge -Mprefetch=distance:8 -Mprefetch=t0 -Mfprelaxed -tp shanghai-64 -Bstatic_pgi |
459.GemsFDTD: | -march=barcelona -Ofast -LNO:fission=2 -LNO:simd=2 -LNO:prefetch_ahead=1 -CG:load_exe=0 -HP |
465.tonto: | -march=barcelona -Ofast -OPT:alias=no_f90_pointer_alias -LNO:blocking=off -CG:load_exe=1 -IPA:plimit=525 -HP |
435.gromacs: | -march=barcelona -Ofast -OPT:rsqrt=2 -HP:bdt=2m:heap=2m |
436.cactusADM: | -fastsse -Mconcur -Msmartalloc=huge -Mfprelaxed -Mipa=fast -Mipa=inline -tp shanghai-64 -Bstatic_pgi |
454.calculix: | -Mpfi=indirect(pass 1) -Mpfo=indirect(pass 2) -Mipa=fast(pass 2) -Mipa=inline(pass 2) -fastsse -Mvect=short -Msmartalloc=huge -Mprefetch=t0 -Mpre -Mfprelaxed -tp shanghai-64 -Bstatic_pgi |
481.wrf: | -fastsse -Mvect=noaltcode -Msmartalloc=huge -Mprefetch=distance:8 -Mfprelaxed -tp shanghai-64 -Bstatic_pgi |
-Mipa=jobs:4(pass 2) |
444.namd: | -Mipa=jobs:4(pass 2) |
410.bwaves: | -Mipa=jobs:4 |
434.zeusmp: | -Mipa=jobs:4 |
437.leslie3d: | -Mipa=jobs:4(pass 2) |
436.cactusADM: | -Mipa=jobs:4 |
454.calculix: | -Mipa=jobs:4(pass 2) |