SPEC® CFP2006 Result

Copyright 2006-2014 Standard Performance Evaluation Corporation

IBM Corporation

IBM System x3755 M3 (AMD Opteron 6180 SE)

SPECfp®2006 = 33.7

CPU2006 license: 11 Test date: Mar-2011
Test sponsor: IBM Corporation Hardware Availability: May-2011
Tested by: IBM Corporation Software Availability: May-2010
Benchmark results graph
Hardware
CPU Name: AMD Opteron 6180 SE
CPU Characteristics:
CPU MHz: 2500
FPU: Integrated
CPU(s) enabled: 48 cores, 4 chips, 12 cores/chip
CPU(s) orderable: 2,4 chips
Primary Cache: 64 KB I + 64 KB D on chip per core
Secondary Cache: 512 KB I+D on chip per core
L3 Cache: 12 MB I+D on chip per chip, 6 MB shared / 6 cores
Other Cache: None
Memory: 128 GB (32 x 4 GB 2Rx4 PC3-10600R-9, ECC)
Disk Subsystem: 1 x 250 GB SATA, 7200 RPM
Other Hardware: None
Software
Operating System: Red Hat Enterprise Linux Server release 5.5,
Kernel 2.6.18-194.el5
Compiler: x86 Open64 4.2.3.2 Compiler Suite (from AMD)
Auto Parallel: Yes
File System: ext3
System State: Run level 3 (Full multiuser with network)
Base Pointers: 64-bit
Peak Pointers: 32/64-bit
Other Software: None

Results Table

Benchmark Base Peak
Seconds Ratio Seconds Ratio Seconds Ratio Seconds Ratio Seconds Ratio Seconds Ratio
Results appear in the order in which they were run. Bold underlined text indicates a median measurement.
410.bwaves 222 61.3 221 61.5 221 61.6 60.1 226   59.7 227   59.9 227  
416.gamess 1228 15.9 1228 15.9 1226 16.0 1118   17.5 1123   17.4 1120   17.5
433.milc 551 16.7 547 16.8 549 16.7 398   23.1 396   23.2 396   23.2
434.zeusmp 246 36.9 246 36.9 247 36.9 239   38.1 239   38.0 239   38.0
435.gromacs 540 13.2 541 13.2 540 13.2 422   16.9 422   16.9 421   17.0
436.cactusADM 151 79.2 153 78.3 151 79.3 98.8 121   99.7 120   99.3 120  
437.leslie3d 542 17.3 537 17.5 543 17.3 497   18.9 494   19.0 499   18.8
444.namd 625 12.8 626 12.8 627 12.8 570   14.1 570   14.1 569   14.1
447.dealII 491 23.3 487 23.5 492 23.3 417   27.4 416   27.5 417   27.5
450.soplex 626 13.3 626 13.3 625 13.4 543   15.4 546   15.3 541   15.4
453.povray 285 18.6 288 18.5 284 18.7 276   19.3 276   19.2 276   19.3
454.calculix 381 21.6 382 21.6 383 21.6 360   22.9 360   22.9 363   22.7
459.GemsFDTD 355 29.9 355 29.9 355 29.9 337   31.5 337   31.5 337   31.5
465.tonto 541 18.2 538 18.3 539 18.3 527   18.7 526   18.7 522   18.8
470.lbm 575 23.9 578 23.8 577 23.8 23.3 589   23.2 592   23.4 587  
481.wrf 362 30.8 364 30.7 362 30.9 362   30.8 364   30.7 362   30.9
482.sphinx3 822 23.7 822 23.7 820 23.8 781   25.0 784   24.9 780   25.0

Submit Notes

The config file option 'submit' was used.
'numactl' was used to bind copies to the cores.
See the configuration file for details.

Operating System Notes

'ulimit -s unlimited' was used to set environment stack size
'ulimit -l 2097152'  was used to set environment locked pages in memory limit

Set vm/nr_hugepages=8000 in /etc/sysctl.conf
mount -t hugetlbfs nodev /mnt/hugepages

cpuspeed stop was used to set the CPU frequency to its maximum.

Platform Notes

BIOS Settings:
 Operating Mode set to Performance Mode

General Notes

Environment variables set by runspec before the start of the run:
LD_LIBRARY_PATH = "/root/speccpu_speed_revA-6/amd1002mc-speed-libs-revA/64:/root/speccpu_speed_revA-6/amd1002mc-speed-libs-revA/32"
O64_OMP_AFFINITY_MAP = "0,1,2,3,4,5,6,7,8,9,10,11,12,13,14,15,16,17,18,19,20,21,22,23,24,25,26,27,28,29,30,31,32,33,34,35,36,37,38,39,40,41,42,43,44,45,46,47"
O64_OMP_SPIN_USER_LOCK = "true"

The x86 Open64 Compiler Suite is only available from (and supported by) AMD at
http://developer.amd.com/cpu/open64

Binaries were compiled on SLES10 SP2 with binutils 2.18

Base Compiler Invocation

C benchmarks:

 opencc 

C++ benchmarks:

 openCC 

Fortran benchmarks:

 openf95 

Benchmarks using both Fortran and C:

 opencc   openf95 

Base Portability Flags

410.bwaves:  -DSPEC_CPU_LP64 
416.gamess:  -DSPEC_CPU_LP64 
433.milc:  -DSPEC_CPU_LP64 
434.zeusmp:  -DSPEC_CPU_LP64 
435.gromacs:  -DSPEC_CPU_LP64 
436.cactusADM:  -DSPEC_CPU_LP64   -fno-second-underscore 
437.leslie3d:  -DSPEC_CPU_LP64 
444.namd:  -DSPEC_CPU_LP64 
447.dealII:  -DSPEC_CPU_LP64 
450.soplex:  -DSPEC_CPU_LP64 
453.povray:  -DSPEC_CPU_LP64 
454.calculix:  -DSPEC_CPU_LP64 
459.GemsFDTD:  -DSPEC_CPU_LP64 
465.tonto:  -DSPEC_CPU_LP64 
470.lbm:  -DSPEC_CPU_LP64 
481.wrf:  -DSPEC_CPU_LP64   -DSPEC_CPU_LINUX   -DSPEC_CPU_CASE_FLAG   -fno-second-underscore 
482.sphinx3:  -DSPEC_CPU_LP64 

Base Optimization Flags

C benchmarks:

 -march=barcelona   -Ofast   -HP:bdt=2m:heap=2m 

C++ benchmarks:

 -march=barcelona   -Ofast   -static   -INLINE:aggressive=on   -HP:bdt=2m:heap=2m 

Fortran benchmarks:

 -march=barcelona   -Ofast   -apo   -LNO:parallel_overhead=10000   -LNO:fusion_peeling_limit=0   -HP:bdt=2m:heap=2m 

Benchmarks using both Fortran and C:

 -march=barcelona   -Ofast   -HP:bdt=2m:heap=2m   -apo   -LNO:parallel_overhead=10000   -LNO:fusion_peeling_limit=0 

Peak Compiler Invocation

C benchmarks:

 opencc 

C++ benchmarks:

 openCC 

Fortran benchmarks:

 openf95 

Benchmarks using both Fortran and C:

 opencc   openf95 

Peak Portability Flags

410.bwaves:  -DSPEC_CPU_LP64 
416.gamess:  -DSPEC_CPU_LP64 
433.milc:  -DSPEC_CPU_LP64 
434.zeusmp:  -DSPEC_CPU_LP64 
435.gromacs:  -DSPEC_CPU_LP64 
436.cactusADM:  -DSPEC_CPU_LP64   -fno-second-underscore 
437.leslie3d:  -DSPEC_CPU_LP64 
444.namd:  -DSPEC_CPU_LP64 
453.povray:  -DSPEC_CPU_LP64 
454.calculix:  -DSPEC_CPU_LP64 
459.GemsFDTD:  -DSPEC_CPU_LP64 
465.tonto:  -DSPEC_CPU_LP64 
470.lbm:  -DSPEC_CPU_LP64 
481.wrf:  -DSPEC_CPU_LP64   -DSPEC_CPU_LINUX   -DSPEC_CPU_CASE_FLAG   -fno-second-underscore 
482.sphinx3:  -DSPEC_CPU_LP64 

Peak Optimization Flags

C benchmarks:

433.milc:  -march=barcelona   -Ofast   -apo   -CG:movnti=1   -CG:local_sched_alg=1   -CG:locs_shallow_depth=1   -CG:compute_to=on   -HP:bdt=2m:heap=2m   -LNO:prefetch=3 
470.lbm:  -march=barcelona   -Ofast   -mso   -apo   -CG:sse_cse_regs=0   -LNO:prefetch_ahead=4   -CG:locs_shallow_depth=1   -CG:cmp_peep=on   -CG:compute_to=on   -OPT:unroll_times_max=8   -OPT:unroll_size=256   -OPT:unroll_level=2   -OPT:keep_ext=on   -OPT:alias=restricted   -m3dnow   -IPA:inline=off 
482.sphinx3:  -march=barcelona   -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -Ofast   -OPT:malloc_alg=2   -CG:sse_cse_regs=0   -CG:locs_shallow_depth=1   -CG:cmp_peep=on   -CG:local_sched_alg=1   -INLINE:aggressive=on 

C++ benchmarks:

444.namd:  -march=barcelona   -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -Ofast   -LNO:ignore_feedback=off   -CG:local_sched_alg=2   -CG:load_exe=0   -CG:compute_to=on   -OPT:unroll_size=256   -fno-exceptions   -HP:bdt=2m:heap=2m 
447.dealII:  -march=barcelona   -Ofast   -static   -INLINE:aggressive=on   -LNO:opt=0   -fno-emit-exceptions   -m32   -OPT:unroll_times_max=8   -OPT:unroll_size=256   -OPT:unroll_level=2   -HP:bdt=2m:heap=2m   -GRA:unspill=on   -CG:cmp_peep=on   -TENV:frame_pointer=off 
450.soplex:  -march=barcelona   -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -O3   -INLINE:aggressive=on   -OPT:IEEE_arith=3   -OPT:IEEE_NaN_Inf=off   -OPT:fold_unsigned_relops=on   -CG:load_exe=0   -fno-exceptions   -m32   -HP:bdt=2m:heap=2m 
453.povray:  -march=barcelona   -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -Ofast   -INLINE:aggressive=on   -HP:bdt=2m:heap=2m 

Fortran benchmarks:

410.bwaves:  -march=barcelona   -Ofast   -apo   -OPT:malloc_alg=2   -CG:use_prefetchnta=on   -CG:cmp_peep=on   -LNO:blocking=off   -LNO:prefetch=3   -LNO:prefetch_ahead=5   -LNO:ignore_feedback=off   -LNO:apo_use_feedback=on   -WOPT:aggstr=0 
416.gamess:  -march=barcelona   -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -O3   -LNO:fu=6   -LNO:blocking=0   -LNO:prefetch=0   -OPT:Ofast   -OPT:ro=3   -OPT:unroll_size=256   -HP:bdt=2m:heap=2m 
434.zeusmp:  -march=barcelona   -Ofast   -apo   -LNO:blocking=off   -LNO:interchange=off   -LNO:fusion_peeling_limit=0   -OPT:treeheight=on   -OPT:unroll_size=256   -CG:cmp_peep=on   -CG:compute_to=on   -GRA:prioritize_by_density=on   -HP:bdt=2m:heap=2m 
437.leslie3d:  -march=barcelona   -Ofast   -apo   -OPT:unroll_size=256   -LNO:prefetch_ahead=4   -LNO:parallel_overhead=32768   -GRA:prioritize_by_density=on   -m3dnow   -HP:bdt=2m:heap=2m 
459.GemsFDTD:  -march=barcelona   -Ofast   -apo   -LNO:fission=2   -LNO:prefetch_ahead=1   -CG:load_exe=0   -CG:local_sched_alg=1   -HP 
465.tonto:  -march=barcelona   -Ofast   -apo   -OPT:alias=no_f90_pointer_alias   -LNO:blocking=off   -CG:load_exe=1   -IPA:plimit=525   -HP 

Benchmarks using both Fortran and C:

435.gromacs:  -march=barcelona   -Ofast   -apo   -OPT:rsqrt=2   -HP:bdt=2m:heap=2m 
436.cactusADM:  -march=barcelona   -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -Ofast   -apo   -LANG:heap_allocation_threshold=1000   -LNO:prefetch_ahead=1   -HP:bdt=2m:heap=2m 
454.calculix:  -march=barcelona   -Ofast   -LNO:prefetch_ahead=30   -CG:load_exe=0   -CG:ptr_load_use=0   -CG:local_sched_alg=2   -CG:compute_to=on   -WOPT:unroll=2   -GRA:optimize_boundary=on   -HP:bdt=2m:heap=2m   -apo 
481.wrf:  basepeak = yes 

The flags files that were used to format this result can be browsed at
http://www.spec.org/cpu2006/flags/x86-open64-423-flags-speed-revA.20101207.html,
http://www.spec.org/cpu2006/flags/amd-platform-speed-revA.html.

You can also download the XML flags sources by saving the following links:
http://www.spec.org/cpu2006/flags/x86-open64-423-flags-speed-revA.20101207.xml,
http://www.spec.org/cpu2006/flags/amd-platform-speed-revA.xml.