SPEC® CFP2006 Result

Copyright 2006-2014 Standard Performance Evaluation Corporation

Supermicro (Test Sponsor: Advanced Micro Devices)

Supermicro A+ Server 1022G-NTF,
AMD Opteron 6134

SPECfp®2006 = 31.2

CPU2006 license: 49 Test date: Sep-2010
Test sponsor: Advanced Micro Devices Hardware Availability: Mar-2010
Tested by: Advanced Micro Devices Software Availability: May-2010
Benchmark results graph
Hardware
CPU Name: AMD Opteron 6134
CPU Characteristics:
CPU MHz: 2300
FPU: Integrated
CPU(s) enabled: 16 cores, 2 chips, 8 cores/chip
CPU(s) orderable: 1,2 chips
Primary Cache: 64 KB I + 64 KB D on chip per core
Secondary Cache: 512 KB I+D on chip per core
L3 Cache: 12 MB I+D on chip per chip, 6 MB shared / 4 cores
Other Cache: None
Memory: 64 GB (16 x 4 GB 2Rx4 PC3-10600R-9, ECC)
Disk Subsystem: 1 x 250 GB SATA, 7200 RPM
Other Hardware: None
Software
Operating System: Red Hat Enterprise Linux Server release 5.4,
Advanced Platform with patch RHSA-2009:1670,
Kernel 2.6.18-164.9.1.el5
Compiler: x86 Open64 4.2.3.2 Compiler Suite (from AMD)
Auto Parallel: Yes
File System: ext3
System State: Run level 3 (Full multiuser with network)
Base Pointers: 64-bit
Peak Pointers: 32/64-bit
Other Software: binutils 2.18

Results Table

Benchmark Base Peak
Seconds Ratio Seconds Ratio Seconds Ratio Seconds Ratio Seconds Ratio Seconds Ratio
Results appear in the order in which they were run. Bold underlined text indicates a median measurement.
410.bwaves 222 61.2 222 61.2 220 61.7 63.4 214   63.2 215   63.3 215  
416.gamess 1319 14.8 1324 14.8 1326 14.8 1209   16.2 1208   16.2 1214   16.1
433.milc 537 17.1 537 17.1 540 17.0 391   23.5 391   23.5 391   23.5
434.zeusmp 281 32.3 281 32.4 282 32.3 272   33.4 272   33.4 272   33.5
435.gromacs 580 12.3 582 12.3 579 12.3 451   15.8 450   15.9 450   15.9
436.cactusADM 200 59.8 202 59.3 198 60.3 105   113   106   112   107   112  
437.leslie3d 502 18.7 505 18.6 504 18.7 497   18.9 499   18.9 495   19.0
444.namd 678 11.8 677 11.8 676 11.9 615   13.0 617   13.0 618   13.0
447.dealII 507 22.6 507 22.5 506 22.6 440   26.0 442   25.9 440   26.0
450.soplex 622 13.4 620 13.5 623 13.4 544   15.3 545   15.3 545   15.3
453.povray 310 17.1 311 17.1 311 17.1 301   17.7 300   17.7 300   17.8
454.calculix 405 20.4 405 20.4 406 20.3 385   21.4 387   21.3 383   21.6
459.GemsFDTD 349 30.4 349 30.4 350 30.4 330   32.2 329   32.2 329   32.2
465.tonto 524 18.8 521 18.9 522 18.8 499   19.7 497   19.8 496   19.9
470.lbm 581 23.6 581 23.7 580 23.7 46.8 294   46.9 293   46.9 293  
481.wrf 362 30.9 362 30.8 362 30.9 362   30.9 362   30.8 362   30.9
482.sphinx3 858 22.7 857 22.8 858 22.7 822   23.7 820   23.8 822   23.7

Submit Notes

The config file option 'submit' was used.
'numactl' was used to bind copies to the cores.
See the configuration file for details.

Operating System Notes

'ulimit -s unlimited' was used to set environment stack size
'ulimit -l 2097152'  was used to set environment locked pages in memory limit

Set vm/nr_hugepages=4000 in /etc/sysctl.conf
mount -t hugetlbfs nodev /mnt/hugepages

cpuspeed stop was used to set the CPU frequency to its maximum.

General Notes

Environment variables set by runspec before the start of the run:
LD_LIBRARY_PATH = "/root/work/cpu2006/amd1002mc-speed-libs-revA/64:/root/work/cpu2006/amd1002mc-speed-libs-revA/32"
O64_OMP_AFFINITY_MAP = "0,1,2,3,4,5,6,7,8,9,10,11,12,13,14,15"
O64_OMP_SPIN_USER_LOCK = "true"

The x86 Open64 Compiler Suite is only available from (and supported by) AMD at
http://developer.amd.com/cpu/open64

Base Compiler Invocation

C benchmarks:

 opencc 

C++ benchmarks:

 openCC 

Fortran benchmarks:

 openf95 

Benchmarks using both Fortran and C:

 opencc   openf95 

Base Portability Flags

410.bwaves:  -DSPEC_CPU_LP64 
416.gamess:  -DSPEC_CPU_LP64 
433.milc:  -DSPEC_CPU_LP64 
434.zeusmp:  -DSPEC_CPU_LP64 
435.gromacs:  -DSPEC_CPU_LP64 
436.cactusADM:  -DSPEC_CPU_LP64   -fno-second-underscore 
437.leslie3d:  -DSPEC_CPU_LP64 
444.namd:  -DSPEC_CPU_LP64 
447.dealII:  -DSPEC_CPU_LP64 
450.soplex:  -DSPEC_CPU_LP64 
453.povray:  -DSPEC_CPU_LP64 
454.calculix:  -DSPEC_CPU_LP64 
459.GemsFDTD:  -DSPEC_CPU_LP64 
465.tonto:  -DSPEC_CPU_LP64 
470.lbm:  -DSPEC_CPU_LP64 
481.wrf:  -DSPEC_CPU_LP64   -DSPEC_CPU_LINUX   -DSPEC_CPU_CASE_FLAG   -fno-second-underscore 
482.sphinx3:  -DSPEC_CPU_LP64 

Base Optimization Flags

C benchmarks:

 -march=barcelona   -Ofast   -HP:bdt=2m:heap=2m 

C++ benchmarks:

 -march=barcelona   -Ofast   -static   -INLINE:aggressive=on   -HP:bdt=2m:heap=2m 

Fortran benchmarks:

 -march=barcelona   -Ofast   -apo   -LNO:parallel_overhead=10000   -LNO:fusion_peeling_limit=0   -HP:bdt=2m:heap=2m 

Benchmarks using both Fortran and C:

 -march=barcelona   -Ofast   -HP:bdt=2m:heap=2m   -apo   -LNO:parallel_overhead=10000   -LNO:fusion_peeling_limit=0 

Peak Compiler Invocation

C benchmarks:

 opencc 

C++ benchmarks:

 openCC 

Fortran benchmarks:

 openf95 

Benchmarks using both Fortran and C:

 opencc   openf95 

Peak Portability Flags

410.bwaves:  -DSPEC_CPU_LP64 
416.gamess:  -DSPEC_CPU_LP64 
433.milc:  -DSPEC_CPU_LP64 
434.zeusmp:  -DSPEC_CPU_LP64 
435.gromacs:  -DSPEC_CPU_LP64 
436.cactusADM:  -DSPEC_CPU_LP64   -fno-second-underscore 
437.leslie3d:  -DSPEC_CPU_LP64 
444.namd:  -DSPEC_CPU_LP64 
453.povray:  -DSPEC_CPU_LP64 
454.calculix:  -DSPEC_CPU_LP64 
459.GemsFDTD:  -DSPEC_CPU_LP64 
465.tonto:  -DSPEC_CPU_LP64 
470.lbm:  -DSPEC_CPU_LP64 
481.wrf:  -DSPEC_CPU_LP64   -DSPEC_CPU_LINUX   -DSPEC_CPU_CASE_FLAG   -fno-second-underscore 
482.sphinx3:  -DSPEC_CPU_LP64 

Peak Optimization Flags

C benchmarks:

433.milc:  -march=barcelona   -Ofast   -apo   -CG:movnti=1   -CG:local_sched_alg=1   -CG:locs_shallow_depth=1   -CG:compute_to=on   -HP:bdt=2m:heap=2m   -LNO:prefetch=3 
470.lbm:  -march=barcelona   -Ofast   -mso   -apo   -CG:sse_cse_regs=0   -LNO:prefetch_ahead=4   -CG:locs_shallow_depth=1   -CG:cmp_peep=on   -CG:compute_to=on   -OPT:unroll_times_max=8   -OPT:unroll_size=256   -OPT:unroll_level=2   -OPT:keep_ext=on   -OPT:alias=restricted   -m3dnow   -IPA:inline=off 
482.sphinx3:  -march=barcelona   -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -Ofast   -OPT:malloc_alg=2   -CG:sse_cse_regs=0   -CG:locs_shallow_depth=1   -CG:cmp_peep=on   -CG:local_sched_alg=1   -INLINE:aggressive=on 

C++ benchmarks:

444.namd:  -march=barcelona   -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -Ofast   -LNO:ignore_feedback=off   -CG:local_sched_alg=2   -CG:load_exe=0   -CG:compute_to=on   -OPT:unroll_size=256   -fno-exceptions   -HP:bdt=2m:heap=2m 
447.dealII:  -march=barcelona   -Ofast   -static   -INLINE:aggressive=on   -LNO:opt=0   -fno-emit-exceptions   -m32   -OPT:unroll_times_max=8   -OPT:unroll_size=256   -OPT:unroll_level=2   -HP:bdt=2m:heap=2m   -GRA:unspill=on   -CG:cmp_peep=on   -TENV:frame_pointer=off 
450.soplex:  -march=barcelona   -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -O3   -INLINE:aggressive=on   -OPT:IEEE_arith=3   -OPT:IEEE_NaN_Inf=off   -OPT:fold_unsigned_relops=on   -CG:load_exe=0   -fno-exceptions   -m32   -HP:bdt=2m:heap=2m 
453.povray:  -march=barcelona   -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -Ofast   -INLINE:aggressive=on   -HP:bdt=2m:heap=2m 

Fortran benchmarks:

410.bwaves:  -march=barcelona   -Ofast   -apo   -OPT:malloc_alg=2   -CG:use_prefetchnta=on   -CG:cmp_peep=on   -LNO:blocking=off   -LNO:prefetch=3   -LNO:prefetch_ahead=5   -LNO:ignore_feedback=off   -LNO:apo_use_feedback=on   -WOPT:aggstr=0 
416.gamess:  -march=barcelona   -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -O3   -LNO:fu=6   -LNO:blocking=0   -LNO:prefetch=0   -OPT:Ofast   -OPT:ro=3   -OPT:unroll_size=256   -HP:bdt=2m:heap=2m 
434.zeusmp:  -march=barcelona   -Ofast   -apo   -LNO:blocking=off   -LNO:interchange=off   -LNO:fusion_peeling_limit=0   -OPT:treeheight=on   -OPT:unroll_size=256   -CG:cmp_peep=on   -CG:compute_to=on   -GRA:prioritize_by_density=on   -HP:bdt=2m:heap=2m 
437.leslie3d:  -march=barcelona   -Ofast   -apo   -OPT:unroll_size=256   -LNO:prefetch_ahead=4   -LNO:parallel_overhead=32768   -GRA:prioritize_by_density=on   -m3dnow   -HP:bdt=2m:heap=2m 
459.GemsFDTD:  -march=barcelona   -Ofast   -apo   -LNO:fission=2   -LNO:prefetch_ahead=1   -CG:load_exe=0   -CG:local_sched_alg=1   -HP 
465.tonto:  -march=barcelona   -Ofast   -apo   -OPT:alias=no_f90_pointer_alias   -LNO:blocking=off   -CG:load_exe=1   -IPA:plimit=525   -HP 

Benchmarks using both Fortran and C:

435.gromacs:  -march=barcelona   -Ofast   -apo   -OPT:rsqrt=2   -HP:bdt=2m:heap=2m 
436.cactusADM:  -march=barcelona   -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -Ofast   -apo   -LANG:heap_allocation_threshold=1000   -LNO:prefetch_ahead=1   -HP:bdt=2m:heap=2m 
454.calculix:  -march=barcelona   -Ofast   -LNO:prefetch_ahead=30   -CG:load_exe=0   -CG:ptr_load_use=0   -CG:local_sched_alg=2   -CG:compute_to=on   -WOPT:unroll=2   -GRA:optimize_boundary=on   -HP:bdt=2m:heap=2m   -apo 
481.wrf:  basepeak = yes 

The flags files that were used to format this result can be browsed at
http://www.spec.org/cpu2006/flags/x86-open64-423-flags-speed-revA.20101207.html,
http://www.spec.org/cpu2006/flags/amd-platform-speed-revA.20101207.html.

You can also download the XML flags sources by saving the following links:
http://www.spec.org/cpu2006/flags/x86-open64-423-flags-speed-revA.20101207.xml,
http://www.spec.org/cpu2006/flags/amd-platform-speed-revA.20101207.xml.