SPEC® CFP2006 Result

Copyright 2006-2014 Standard Performance Evaluation Corporation

Supermicro

Supermicro A+ Server 1012G-MTF
(H8SGL-F, AMD Opteron 6272)

CPU2006 license: 001176 Test date: May-2012
Test sponsor: Supermicro Hardware Availability: Nov-2011
Tested by: Supermicro Software Availability: Dec-2011
Benchmark results graph
Hardware
CPU Name: AMD Opteron 6272
CPU Characteristics: AMD Turbo CORE technology up to 3.00 GHz
CPU MHz: 2100
FPU: Integrated
CPU(s) enabled: 16 cores, 1 chip, 16 cores/chip
CPU(s) orderable: 1 chip
Primary Cache: 512 KB I on chip per chip,
64 KB I shared / 2 cores;
16 KB D on chip per core
Secondary Cache: 16 MB I+D on chip per chip, 2 MB shared / 2 cores
L3 Cache: 16 MB I+D on chip per chip, 8 MB shared / 8 cores
Other Cache: None
Memory: 64 GB (8 x 8 GB 2Rx4 PC3-12800R-11, ECC)
Disk Subsystem: 1 x 300 GB SATA, 10000 RPM
Other Hardware: None
Software
Operating System: Red Hat Enterprise Linux Server release 6.2,
Kernel 2.6.32-220.el6.x86_64
Compiler: C/C++/Fortran: Version 4.5.1 of x86 Open64
Compiler Suite (from AMD)
Auto Parallel: No
File System: ext3
System State: Run level 3 (Full multiuser with network)
Base Pointers: 64-bit
Peak Pointers: 32/64-bit
Other Software: None

Results Table

Benchmark Base Peak
Copies Seconds Ratio Seconds Ratio Seconds Ratio Copies Seconds Ratio Seconds Ratio Seconds Ratio
Results appear in the order in which they were run. Bold underlined text indicates a median measurement.
410.bwaves 16 1373 158 1371 159 1368 159 16 1354 161 1357 160 1355 160
416.gamess 16 2189 143 2209 142 2187 143 16 2067 152 2087 150 2067 152
433.milc 16 1100 134 1100 134 1100 133 16 941 156 941 156 941 156
434.zeusmp 16 775 188 778 187 773 188 16 757 192 750 194 752 194
435.gromacs 16 670 170 679 168 673 170 16 513 223 513 223 512 223
436.cactusADM 16 853 224 857 223 854 224 16 800 239 805 237 804 238
437.leslie3d 16 1358 111 1360 111 1360 111 16 1082 139 1080 139 1083 139
444.namd 16 878 146 875 147 875 147 16 862 149 863 149 860 149
447.dealII 16 572 320 575 318 570 321 16 551 332 548 334 560 327
450.soplex 16 1105 121 1014 132 1013 132 16 1044 128 997 134 977 137
453.povray 16 457 186 456 186 459 186 16 411 207 412 207 412 207
454.calculix 16 538 245 541 244 545 242 16 520 254 519 254 517 256
459.GemsFDTD 16 1682 101 1681 101 1684 101 16 1530 111 1530 111 1530 111
465.tonto 16 980 161 977 161 977 161 16 903 174 906 174 902 175
470.lbm 16 1009 218 1012 217 1018 216 16 1008 218 1014 217 1007 218
481.wrf 16 937 191 940 190 935 191 16 936 191 938 190 940 190
482.sphinx3 16 1824 171 1814 172 1812 172 16 1603 195 1583 197 1574 198

Submit Notes

The config file option 'submit' was used.
'numactl' was used to bind copies to the cores.
See the configuration file for details.

Operating System Notes

'ulimit -s unlimited' was used to set environment stack size
'ulimit -l 2097152'  was used to set environment locked pages in memory limit

Set transparent_hugepage=never as a boot parameter in /boot/grub/menu.lst

Set vm/nr_hugepages=14336 in /etc/sysctl.conf
mount -t hugetlbfs nodev /mnt/hugepages

General Notes

Environment variables set by runspec before the start of the run:
HUGETLB_LIMIT = "896"
LD_LIBRARY_PATH = "/usr/cpu2006/amd1104-rate-libs-revC/32:/usr/cpu2006/amd1104-rate-libs-revC/64"

The x86 Open64 Compiler Suite is only available from (and supported by) AMD at
http://developer.amd.com/cpu/open64

Binaries were compiled on a system with 2x AMD Opteron 6274 chips + 64GB Memory using RHEL 6.1

Base Compiler Invocation

C benchmarks:

 opencc 

C++ benchmarks:

 openCC 

Fortran benchmarks:

 openf95 

Benchmarks using both Fortran and C:

 opencc   openf95 

Base Portability Flags

410.bwaves:  -DSPEC_CPU_LP64 
416.gamess:  -DSPEC_CPU_LP64 
433.milc:  -DSPEC_CPU_LP64 
434.zeusmp:  -DSPEC_CPU_LP64 
435.gromacs:  -DSPEC_CPU_LP64 
436.cactusADM:  -DSPEC_CPU_LP64   -fno-second-underscore 
437.leslie3d:  -DSPEC_CPU_LP64 
444.namd:  -DSPEC_CPU_LP64 
447.dealII:  -DSPEC_CPU_LP64 
450.soplex:  -DSPEC_CPU_LP64 
453.povray:  -DSPEC_CPU_LP64 
454.calculix:  -DSPEC_CPU_LP64 
459.GemsFDTD:  -DSPEC_CPU_LP64 
465.tonto:  -DSPEC_CPU_LP64 
470.lbm:  -DSPEC_CPU_LP64 
481.wrf:  -DSPEC_CPU_LINUX   -DSPEC_CPU_CASE_FLAG   -DSPEC_CPU_LP64   -fno-second-underscore 
482.sphinx3:  -DSPEC_CPU_LP64 

Base Optimization Flags

C benchmarks:

 -march=bdver1   -Ofast   -OPT:malloc_alg=1   -HP:bd=2m:heap=2m   -IPA:plimit=8000   -IPA:small_pu=100   -mso 

C++ benchmarks:

 -march=bdver1   -Ofast   -static   -CG:load_exe=0   -OPT:malloc_alg=1   -INLINE:aggressive=on   -HP:bd=2m:heap=2m   -D__OPEN64_FAST_SET 

Fortran benchmarks:

 -march=bdver1   -Ofast   -LNO:blocking=off   -OPT:rsqrt=2   -OPT:unroll_size=256   -HP:bd=2m:heap=2m   -mso 

Benchmarks using both Fortran and C:

 -march=bdver1   -Ofast   -OPT:malloc_alg=1   -HP:bd=2m:heap=2m   -IPA:plimit=8000   -IPA:small_pu=100   -mso   -LNO:blocking=off   -OPT:rsqrt=2   -OPT:unroll_size=256 

Peak Compiler Invocation

C benchmarks:

 opencc 

C++ benchmarks:

 openCC 

Fortran benchmarks:

 openf95 

Benchmarks using both Fortran and C:

 opencc   openf95 

Peak Portability Flags

410.bwaves:  -DSPEC_CPU_LP64 
416.gamess:  -DSPEC_CPU_LP64 
433.milc:  -DSPEC_CPU_LP64 
434.zeusmp:  -DSPEC_CPU_LP64 
435.gromacs:  -DSPEC_CPU_LP64 
436.cactusADM:  -DSPEC_CPU_LP64   -fno-second-underscore 
437.leslie3d:  -DSPEC_CPU_LP64 
444.namd:  -DSPEC_CPU_LP64 
453.povray:  -DSPEC_CPU_LP64 
454.calculix:  -DSPEC_CPU_LP64 
459.GemsFDTD:  -DSPEC_CPU_LP64 
465.tonto:  -DSPEC_CPU_LP64 
470.lbm:  -DSPEC_CPU_LP64 
481.wrf:  -DSPEC_CPU_LINUX   -DSPEC_CPU_CASE_FLAG   -DSPEC_CPU_LP64   -fno-second-underscore 
482.sphinx3:  -DSPEC_CPU_LP64 

Peak Optimization Flags

C benchmarks:

433.milc:  -march=bdver1   -Ofast   -CG:movnti=1   -CG:locs_best=on   -HP:bdt=2m:heap=2m   -IPA:plimit=7000   -IPA:callee_limit=1200   -OPT:struct_array_copy=2   -OPT:alias=field_sensitive   -mso 
470.lbm:  -march=bdver1   -Ofast   -CG:cmp_peep=on   -OPT:unroll_times_max=8   -OPT:unroll_size=256   -OPT:unroll_level=2   -OPT:keep_ext=on   -HP:bdt=2m:heap=2m   -IPA:plimit=8000   -IPA:small_pu=100   -mso 
482.sphinx3:  -march=bdver1   -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -Ofast   -IPA:plimit=1000   -OPT:malloc_alg=2   -CG:cmp_peep=on   -CG:local_sched_alg=2   -CG:p2align=0   -INLINE:aggressive=on   -LNO:prefetch=2   -LNO:prefetch_ahead=4   -mso 

C++ benchmarks:

444.namd:  -march=bdver1   -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -Ofast   -IPA:plimit=3000   -LNO:ignore_feedback=off   -CG:local_sched_alg=2   -CG:load_exe=0   -OPT:unroll_size=256   -fno-exceptions   -HP:bdt=2m:heap=2m 
447.dealII:  -march=bdver1   -Ofast   -D__OPEN64_FAST_SET   -static   -INLINE:aggressive=on   -LNO:opt=0   -LNO:simd=0   -fno-emit-exceptions   -m32   -OPT:unroll_times_max=8   -OPT:unroll_size=256   -OPT:unroll_level=2   -HP:bdt=2m:heap=2m   -GRA:unspill=on   -CG:cmp_peep=on   -CG:movext_icmp=off   -TENV:frame_pointer=off 
450.soplex:  -march=bdver1   -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -O3   -LNO:ignore_feedback=off   -INLINE:aggressive=on   -OPT:RO=1   -OPT:IEEE_arith=3   -OPT:IEEE_NaN_Inf=off   -OPT:fold_unsigned_relops=on   -fno-exceptions   -CG:p2align=0   -m32   -HP:bdt=2m:heap=2m   -WOPT:sib=on 
453.povray:  -march=bdver1   -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -Ofast   -CG:pre_local_sched=off   -CG:p2align=0   -CG:p2align_split=on   -CG:dsched=on   -INLINE:aggressive=on   -HP:bd=2m:heap=2m   -OPT:transform=2   -OPT:alias=disjoint   -WOPT:aggcm=0 

Fortran benchmarks:

410.bwaves:  -march=bdver1   -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -Ofast   -OPT:Ofast   -OPT:treeheight=on   -LNO:blocking=off   -LNO:ignore_feedback=off   -LNO:fu=4   -LNO:loop_model_simd=on   -LNO:simd_rm_unity_remainder=on   -WOPT:aggstr=0   -HP:bdt=2m:heap=2m   -CG:cmp_peep=on 
416.gamess:  -march=bdver1   -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -O3   -LNO:fu=6   -LNO:blocking=0   -LNO:simd=0   -OPT:Ofast   -OPT:ro=3   -OPT:unroll_size=256   -OPT:unroll_times_max=2   -CG:local_sched_alg=1   -HP:bdt=2m:heap=2m   -WOPT:sib=on 
434.zeusmp:  -march=bdver1   -Ofast   -LNO:blocking=off   -LNO:interchange=off   -IPA:plimit=1500   -HP:bdt=2m:heap=2m 
437.leslie3d:  -march=bdver1   -Ofast   -CG:pre_minreg_level=2   -LNO:simd=0   -LNO:fusion=2   -HP:bdt=2m:heap=2m   -mso 
459.GemsFDTD:  -march=bdver1   -Ofast   -IPA:plimit=1500   -OPT:unroll_size=0   -LNO:fission=2   -CG:load_exe=0   -CG:local_sched_alg=2   -HP 
465.tonto:  -march=bdver1   -Ofast   -OPT:alias=no_f90_pointer_alias   -LNO:blocking=off   -CG:load_exe=1   -IPA:plimit=525   -HP:bdt=2m:heap=2m 

Benchmarks using both Fortran and C:

435.gromacs:  -march=bdver1   -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -Ofast   -OPT:rsqrt=2   -HP:bdt=2m:heap=2m   -CG:local_sched_alg=2   -GRA:unspill=ON   -CG:load_exe=3   -LNO:simd=3 
436.cactusADM:  -march=bdver1   -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -Ofast   -LNO:blocking=off   -LNO:prefetch=2   -HP   -CG:locs_shallow_depth=1   -CG:load_exe=0   -CG:dsched=on   -WOPT:sib=on 
454.calculix:  -march=bdver1   -Ofast   -OPT:unroll_size=256   -GRA:optimize_boundary=on   -CG:dsched=on   -HP:bdt=2m:heap=2m 
481.wrf:  -march=bdver1   -Ofast   -LNO:blocking=off   -LANG:copyinout=off   -IPA:callee_limit=5000   -GRA:prioritize_by_density=on   -CG:load_exe=1   -HP   -WOPT:sib=on 

The flags file that was used to format this result can be browsed at
http://www.spec.org/cpu2006/flags/x86-open64-451-flags-rate-revC-I.html.

You can also download the XML flags source by saving the following link:
http://www.spec.org/cpu2006/flags/x86-open64-451-flags-rate-revC-I.xml.