SPEC® CFP2006 Result

Copyright 2006-2014 Standard Performance Evaluation Corporation

MSI (Test Sponsor: Advanced Micro Devices)

MS-S0231,
AMD Opteron 3280

CPU2006 license: 49 Test date: Mar-2012
Test sponsor: Advanced Micro Devices Hardware Availability: Mar-2012
Tested by: Advanced Micro Devices Software Availability: Dec-2011
Benchmark results graph
Hardware
CPU Name: AMD Opteron 3280
CPU Characteristics: AMD Turbo CORE technology up to 3.40 GHz
CPU MHz: 2400
FPU: Integrated
CPU(s) enabled: 8 cores, 1 chip, 8 cores/chip
CPU(s) orderable: 1 chip
Primary Cache: 256 KB I on chip per chip,
64 KB shared / 2 cores;
16 KB D on chip per core
Secondary Cache: 8 MB I+D on chip per chip, 2 MB shared / 2 cores
L3 Cache: 8 MB I+D on chip per chip
Other Cache: None
Memory: 32 GB (8 x 8 GB 2Rx4 PC3-12800U-11, running at
1333MHz)
Disk Subsystem: 1 x 500 GB SATA, 7200 RPM
Other Hardware: None
Software
Operating System: Red Hat Enterprise Linux Server release 6.2,
Kernel 2.6.32-220.el6.x86_64
Compiler: C/C++/Fortran: Version 4.5.1 of x86 Open64
Compiler Suite (from AMD)
Auto Parallel: No
File System: ext3
System State: Run level 3 (Full multiuser with network)
Base Pointers: 64-bit
Peak Pointers: 32/64-bit
Other Software: None

Results Table

Benchmark Base Peak
Copies Seconds Ratio Seconds Ratio Seconds Ratio Copies Seconds Ratio Seconds Ratio Seconds Ratio
Results appear in the order in which they were run. Bold underlined text indicates a median measurement.
410.bwaves 8 1456 74.7 1451 74.9 1458 74.6 8 1443 75.4 1450 75.0 1448 75.1
416.gamess 8 1899 82.5 1900 82.5 1911 82.0 8 1810 86.5 1811 86.5 1813 86.4
433.milc 8 1201 61.1 1227 59.8 1213 60.6 8 1038 70.7 1028 71.4 1029 71.4
434.zeusmp 8 770 94.5 768 94.8 762 95.6 8 754 96.5 750 97.1 750 97.0
435.gromacs 8 598 95.5 598 95.5 598 95.5 8 471 121   470 121   471 121  
436.cactusADM 8 859 111   852 112   852 112   8 785 122   788 121   785 122  
437.leslie3d 8 1488 50.6 1488 50.5 1488 50.5 8 1167 64.4 1168 64.4 1166 64.5
444.namd 8 775 82.7 791 81.2 778 82.5 8 771 83.3 756 84.8 758 84.7
447.dealII 8 540 169   541 169   539 170   8 505 181   504 181   507 181  
450.soplex 8 1067 62.5 1069 62.4 1070 62.4 8 1029 64.8 1027 64.9 1029 64.9
453.povray 8 402 106   403 106   402 106   8 361 118   361 118   360 118  
454.calculix 8 477 138   478 138   478 138   8 458 144   458 144   458 144  
459.GemsFDTD 8 1846 46.0 1844 46.0 1848 45.9 8 1691 50.2 1691 50.2 1691 50.2
465.tonto 8 837 94.0 838 93.9 836 94.2 8 768 103   766 103   767 103  
470.lbm 8 1121 98.1 1124 97.8 1133 97.0 8 1114 98.7 1114 98.7 1113 98.8
481.wrf 8 1002 89.2 998 89.5 993 90.0 8 1002 89.2 998 89.5 993 90.0
482.sphinx3 8 1824 85.5 1824 85.5 1825 85.4 8 1522 102   1518 103   1523 102  

Submit Notes

The config file option 'submit' was used.
'numactl' was used to bind copies to the cores.
See the configuration file for details.

Operating System Notes

'ulimit -s unlimited' was used to set environment stack size
'ulimit -l 2097152'  was used to set environment locked pages in memory limit

Set transparent_hugepage=never as a boot parameter in /boot/grub/menu.lst

Set vm/nr_hugepages=3840 in /etc/sysctl.conf
mount -t hugetlbfs nodev /mnt/hugepages

General Notes

Environment variables set by runspec before the start of the run:
HUGETLB_LIMIT = "480"
LD_LIBRARY_PATH = "/root/work/cpu2006v1.2/amd1104-rate-libs-revC/32:/root/work/cpu2006v1.2/amd1104-rate-libs-revC/64"

The x86 Open64 Compiler Suite is only available from (and supported by) AMD at
http://developer.amd.com/cpu/open64

Binaries were compiled on a system with 2x AMD Opteron 6274 chips + 64GB Memory using RHEL 6.1

Base Compiler Invocation

C benchmarks:

 opencc 

C++ benchmarks:

 openCC 

Fortran benchmarks:

 openf95 

Benchmarks using both Fortran and C:

 opencc   openf95 

Base Portability Flags

410.bwaves:  -DSPEC_CPU_LP64 
416.gamess:  -DSPEC_CPU_LP64 
433.milc:  -DSPEC_CPU_LP64 
434.zeusmp:  -DSPEC_CPU_LP64 
435.gromacs:  -DSPEC_CPU_LP64 
436.cactusADM:  -DSPEC_CPU_LP64   -fno-second-underscore 
437.leslie3d:  -DSPEC_CPU_LP64 
444.namd:  -DSPEC_CPU_LP64 
447.dealII:  -DSPEC_CPU_LP64 
450.soplex:  -DSPEC_CPU_LP64 
453.povray:  -DSPEC_CPU_LP64 
454.calculix:  -DSPEC_CPU_LP64 
459.GemsFDTD:  -DSPEC_CPU_LP64 
465.tonto:  -DSPEC_CPU_LP64 
470.lbm:  -DSPEC_CPU_LP64 
481.wrf:  -DSPEC_CPU_LINUX   -DSPEC_CPU_CASE_FLAG   -DSPEC_CPU_LP64   -fno-second-underscore 
482.sphinx3:  -DSPEC_CPU_LP64 

Base Optimization Flags

C benchmarks:

 -march=bdver1   -Ofast   -OPT:malloc_alg=1   -HP:bd=2m:heap=2m   -IPA:plimit=8000   -IPA:small_pu=100   -mso 

C++ benchmarks:

 -march=bdver1   -Ofast   -static   -CG:load_exe=0   -OPT:malloc_alg=1   -INLINE:aggressive=on   -HP:bd=2m:heap=2m   -D__OPEN64_FAST_SET 

Fortran benchmarks:

 -march=bdver1   -Ofast   -LNO:blocking=off   -OPT:rsqrt=2   -OPT:unroll_size=256   -HP:bd=2m:heap=2m   -mso 

Benchmarks using both Fortran and C:

 -march=bdver1   -Ofast   -OPT:malloc_alg=1   -HP:bd=2m:heap=2m   -IPA:plimit=8000   -IPA:small_pu=100   -mso   -LNO:blocking=off   -OPT:rsqrt=2   -OPT:unroll_size=256 

Peak Compiler Invocation

C benchmarks:

 opencc 

C++ benchmarks:

 openCC 

Fortran benchmarks:

 openf95 

Benchmarks using both Fortran and C:

 opencc   openf95 

Peak Portability Flags

410.bwaves:  -DSPEC_CPU_LP64 
416.gamess:  -DSPEC_CPU_LP64 
433.milc:  -DSPEC_CPU_LP64 
434.zeusmp:  -DSPEC_CPU_LP64 
435.gromacs:  -DSPEC_CPU_LP64 
436.cactusADM:  -DSPEC_CPU_LP64   -fno-second-underscore 
437.leslie3d:  -DSPEC_CPU_LP64 
444.namd:  -DSPEC_CPU_LP64 
453.povray:  -DSPEC_CPU_LP64 
454.calculix:  -DSPEC_CPU_LP64 
459.GemsFDTD:  -DSPEC_CPU_LP64 
465.tonto:  -DSPEC_CPU_LP64 
470.lbm:  -DSPEC_CPU_LP64 
481.wrf:  -DSPEC_CPU_LINUX   -DSPEC_CPU_CASE_FLAG   -DSPEC_CPU_LP64   -fno-second-underscore 
482.sphinx3:  -DSPEC_CPU_LP64 

Peak Optimization Flags

C benchmarks:

433.milc:  -march=bdver1   -Ofast   -CG:movnti=1   -CG:locs_best=on   -HP:bdt=2m:heap=2m   -IPA:plimit=7000   -IPA:callee_limit=1200   -OPT:struct_array_copy=2   -OPT:alias=field_sensitive   -mso 
470.lbm:  -march=bdver1   -Ofast   -CG:cmp_peep=on   -OPT:unroll_times_max=8   -OPT:unroll_size=256   -OPT:unroll_level=2   -OPT:keep_ext=on   -HP:bdt=2m:heap=2m   -IPA:plimit=8000   -IPA:small_pu=100   -mso 
482.sphinx3:  -march=bdver1   -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -Ofast   -IPA:plimit=1000   -OPT:malloc_alg=2   -CG:cmp_peep=on   -CG:local_sched_alg=2   -CG:p2align=0   -INLINE:aggressive=on   -LNO:prefetch=2   -LNO:prefetch_ahead=4   -mso 

C++ benchmarks:

444.namd:  -march=bdver1   -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -Ofast   -IPA:plimit=3000   -LNO:ignore_feedback=off   -CG:local_sched_alg=2   -CG:load_exe=0   -OPT:unroll_size=256   -fno-exceptions   -HP:bdt=2m:heap=2m 
447.dealII:  -march=bdver1   -Ofast   -D__OPEN64_FAST_SET   -static   -INLINE:aggressive=on   -LNO:opt=0   -LNO:simd=0   -fno-emit-exceptions   -m32   -OPT:unroll_times_max=8   -OPT:unroll_size=256   -OPT:unroll_level=2   -HP:bdt=2m:heap=2m   -GRA:unspill=on   -CG:cmp_peep=on   -CG:movext_icmp=off   -TENV:frame_pointer=off 
450.soplex:  -march=bdver1   -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -O3   -LNO:ignore_feedback=off   -INLINE:aggressive=on   -OPT:RO=1   -OPT:IEEE_arith=3   -OPT:IEEE_NaN_Inf=off   -OPT:fold_unsigned_relops=on   -fno-exceptions   -CG:p2align=0   -m32   -HP:bdt=2m:heap=2m   -WOPT:sib=on 
453.povray:  -march=bdver1   -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -Ofast   -CG:pre_local_sched=off   -CG:p2align=0   -CG:p2align_split=on   -CG:dsched=on   -INLINE:aggressive=on   -HP:bd=2m:heap=2m   -OPT:transform=2   -OPT:alias=disjoint   -WOPT:aggcm=0 

Fortran benchmarks:

410.bwaves:  -march=bdver1   -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -Ofast   -OPT:Ofast   -OPT:treeheight=on   -LNO:blocking=off   -LNO:ignore_feedback=off   -LNO:fu=4   -LNO:loop_model_simd=on   -LNO:simd_rm_unity_remainder=on   -WOPT:aggstr=0   -HP:bdt=2m:heap=2m   -CG:cmp_peep=on 
416.gamess:  -march=bdver1   -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -O3   -LNO:fu=6   -LNO:blocking=0   -LNO:simd=0   -OPT:Ofast   -OPT:ro=3   -OPT:unroll_size=256   -OPT:unroll_times_max=2   -CG:local_sched_alg=1   -HP:bdt=2m:heap=2m   -WOPT:sib=on 
434.zeusmp:  -march=bdver1   -Ofast   -LNO:blocking=off   -LNO:interchange=off   -IPA:plimit=1500   -HP:bdt=2m:heap=2m 
437.leslie3d:  -march=bdver1   -Ofast   -CG:pre_minreg_level=2   -LNO:simd=0   -LNO:fusion=2   -HP:bdt=2m:heap=2m   -mso 
459.GemsFDTD:  -march=bdver1   -Ofast   -IPA:plimit=1500   -OPT:unroll_size=0   -LNO:fission=2   -CG:load_exe=0   -CG:local_sched_alg=2   -HP 
465.tonto:  -march=bdver1   -Ofast   -OPT:alias=no_f90_pointer_alias   -LNO:blocking=off   -CG:load_exe=1   -IPA:plimit=525   -HP:bdt=2m:heap=2m 

Benchmarks using both Fortran and C:

435.gromacs:  -march=bdver1   -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -Ofast   -OPT:rsqrt=2   -HP:bdt=2m:heap=2m   -CG:local_sched_alg=2   -GRA:unspill=ON   -CG:load_exe=3   -LNO:simd=3 
436.cactusADM:  -march=bdver1   -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -Ofast   -LNO:blocking=off   -LNO:prefetch=2   -HP   -CG:locs_shallow_depth=1   -CG:load_exe=0   -CG:dsched=on   -WOPT:sib=on 
454.calculix:  -march=bdver1   -Ofast   -OPT:unroll_size=256   -GRA:optimize_boundary=on   -CG:dsched=on   -HP:bdt=2m:heap=2m 
481.wrf:  basepeak = yes 

The flags file that was used to format this result can be browsed at
http://www.spec.org/cpu2006/flags/x86-open64-451-flags-rate-revC-I.html.

You can also download the XML flags source by saving the following link:
http://www.spec.org/cpu2006/flags/x86-open64-451-flags-rate-revC-I.xml.