SPEC(R) CFP2006 Summary Tyan Tyan YR190-B8228, AMD Opteron 4228 HE Test Sponsor: Advanced Micro Devices Fri Jan 27 02:59:35 2012 CPU2006 License: 49 Test date: Jan-2012 Test sponsor: Advanced Micro Devices Hardware availability: Nov-2011 Tested by: Advanced Micro Devices Software availability: Jul-2011 Base Base Base Peak Peak Peak Benchmarks Ref. Run Time Ratio Ref. Run Time Ratio -------------- ------ --------- --------- ------ --------- --------- 410.bwaves 13590 95.9 142 * 13590 85.0 160 * 410.bwaves 13590 96.0 142 S 13590 85.1 160 S 410.bwaves 13590 95.9 142 S 13590 84.8 160 S 416.gamess 19580 970 20.2 S 19580 913 21.4 * 416.gamess 19580 970 20.2 S 19580 913 21.4 S 416.gamess 19580 970 20.2 * 19580 913 21.4 S 433.milc 9180 273 33.6 * 9180 228 40.3 * 433.milc 9180 273 33.6 S 9180 228 40.3 S 433.milc 9180 273 33.6 S 9180 228 40.3 S 434.zeusmp 9100 137 66.3 S 9100 134 67.8 S 434.zeusmp 9100 137 66.5 * 9100 135 67.6 S 434.zeusmp 9100 137 66.5 S 9100 135 67.6 * 435.gromacs 7140 322 22.2 * 7140 306 23.3 S 435.gromacs 7140 322 22.2 S 7140 306 23.3 S 435.gromacs 7140 322 22.2 S 7140 306 23.3 * 436.cactusADM 11950 71.5 167 S 11950 68.0 176 S 436.cactusADM 11950 71.3 168 * 11950 68.1 176 * 436.cactusADM 11950 71.2 168 S 11950 68.4 175 S 437.leslie3d 9400 370 25.4 S 9400 342 27.5 * 437.leslie3d 9400 371 25.3 * 9400 341 27.5 S 437.leslie3d 9400 371 25.3 S 9400 343 27.4 S 444.namd 8020 442 18.2 S 8020 430 18.6 S 444.namd 8020 442 18.2 S 8020 430 18.6 * 444.namd 8020 442 18.2 * 8020 430 18.6 S 447.dealII 11440 273 42.0 S 11440 246 46.5 S 447.dealII 11440 271 42.2 * 11440 246 46.5 * 447.dealII 11440 271 42.2 S 11440 246 46.5 S 450.soplex 8340 347 24.1 S 8340 321 26.0 S 450.soplex 8340 346 24.1 S 8340 321 26.0 * 450.soplex 8340 347 24.1 * 8340 322 25.9 S 453.povray 5320 231 23.1 * 5320 214 24.8 S 453.povray 5320 231 23.1 S 5320 215 24.8 S 453.povray 5320 231 23.0 S 5320 215 24.8 * 454.calculix 8250 276 29.9 S 8250 243 34.0 S 454.calculix 8250 276 29.9 * 8250 243 34.0 * 454.calculix 8250 276 29.9 S 8250 243 34.0 S 459.GemsFDTD 10610 256 41.4 S 10610 219 48.4 * 459.GemsFDTD 10610 257 41.3 * 10610 219 48.4 S 459.GemsFDTD 10610 257 41.3 S 10610 219 48.5 S 465.tonto 9840 457 21.5 S 9840 378 26.0 S 465.tonto 9840 457 21.6 S 9840 378 26.0 S 465.tonto 9840 457 21.5 * 9840 378 26.0 * 470.lbm 13740 125 110 * 13740 66.0 208 * 470.lbm 13740 125 110 S 13740 66.2 208 S 470.lbm 13740 125 110 S 13740 65.7 209 S 481.wrf 11170 263 42.5 * 11170 253 44.2 S 481.wrf 11170 263 42.5 S 11170 253 44.1 S 481.wrf 11170 262 42.7 S 11170 253 44.2 * 482.sphinx3 19490 810 24.1 S 19490 595 32.7 S 482.sphinx3 19490 810 24.1 * 19490 595 32.7 S 482.sphinx3 19490 810 24.1 S 19490 595 32.7 * ============================================================================== 410.bwaves 13590 95.9 142 * 13590 85.0 160 * 416.gamess 19580 970 20.2 * 19580 913 21.4 * 433.milc 9180 273 33.6 * 9180 228 40.3 * 434.zeusmp 9100 137 66.5 * 9100 135 67.6 * 435.gromacs 7140 322 22.2 * 7140 306 23.3 * 436.cactusADM 11950 71.3 168 * 11950 68.1 176 * 437.leslie3d 9400 371 25.3 * 9400 342 27.5 * 444.namd 8020 442 18.2 * 8020 430 18.6 * 447.dealII 11440 271 42.2 * 11440 246 46.5 * 450.soplex 8340 347 24.1 * 8340 321 26.0 * 453.povray 5320 231 23.1 * 5320 215 24.8 * 454.calculix 8250 276 29.9 * 8250 243 34.0 * 459.GemsFDTD 10610 257 41.3 * 10610 219 48.4 * 465.tonto 9840 457 21.5 * 9840 378 26.0 * 470.lbm 13740 125 110 * 13740 66.0 208 * 481.wrf 11170 263 42.5 * 11170 253 44.2 * 482.sphinx3 19490 810 24.1 * 19490 595 32.7 * SPECfp(R)_base2006 38.2 SPECfp2006 43.6 HARDWARE -------- CPU Name: AMD Opteron 4228 HE CPU Characteristics: AMD Turbo CORE technology up to 3.60 GHz CPU MHz: 2800 FPU: Integrated CPU(s) enabled: 12 cores, 2 chips, 6 cores/chip CPU(s) orderable: 1,2 chips Primary Cache: 192 KB I on chip per chip, 64 KB I shared / 2 cores; 16 KB D on chip per core Secondary Cache: 6 MB I+D on chip per chip, 2 MB shared / 2 cores L3 Cache: 8 MB I+D on chip per chip Other Cache: None Memory: 32 GB (4 x 8 GB 2Rx4 PC3-12800R-11, ECC) Disk Subsystem: 1 x 128 GB SATA, 7200 RPM Other Hardware: None SOFTWARE -------- Operating System: Red Hat Enterprise Linux Server release 6.1, Kernel 2.6.32-131.0.15.el6.x86_64 Compiler: C/C++/Fortran: Version 4.2.5.2 of x86 Open64 Compiler Suite (from AMD) Auto Parallel: Yes File System: ext3 System State: Run level 3 (Full multiuser with network) Base Pointers: 64-bit Peak Pointers: 32/64-bit Other Software: None Submit Notes ------------ The config file option 'submit' was used. 'numactl' was used to bind copies to the cores. See the configuration file for details. Operating System Notes ---------------------- 'ulimit -s unlimited' was used to set environment stack size 'ulimit -l 2097152' was used to set environment locked pages in memory limit Set transparent_hugepage=never as a boot parameter in /boot/grub/menu.lst Set kernel/randomize_va_space=0 in /etc/sysctl.conf cpuspeed stop was used to set the CPU frequency to its maximum. Set vm/nr_hugepages=2000 in /etc/sysctl.conf mount -t hugetlbfs nodev /mnt/hugepages General Notes ------------- Environment variables set by runspec before the start of the run: LD_LIBRARY_PATH = "/root/work/cpu2006v1.2/amd1104-speed-libs-revA/32:/root/work/cpu2006v1.2/amd1104-speed-libs-revA/64" O64_OMP_AFFINITY_MAP = "0,1,2,3,4,5,6,7,8,9,10,11" O64_OMP_SPIN_COUNT = "800000" O64_OMP_SPIN_USER_LOCK = "true" The x86 Open64 Compiler Suite is only available from (and supported by) AMD at http://developer.amd.com/cpu/open64 Binaries were compiled on a system with 2x AMD Opteron 6220 chips + 64GB Memory using RHEL 6.1 Base Compiler Invocation ------------------------ C benchmarks: opencc C++ benchmarks: openCC Fortran benchmarks: openf95 Benchmarks using both Fortran and C: opencc openf95 Base Portability Flags ---------------------- 410.bwaves: -DSPEC_CPU_LP64 416.gamess: -DSPEC_CPU_LP64 433.milc: -DSPEC_CPU_LP64 434.zeusmp: -DSPEC_CPU_LP64 435.gromacs: -DSPEC_CPU_LP64 436.cactusADM: -DSPEC_CPU_LP64 -fno-second-underscore 437.leslie3d: -DSPEC_CPU_LP64 444.namd: -DSPEC_CPU_LP64 447.dealII: -DSPEC_CPU_LP64 450.soplex: -DSPEC_CPU_LP64 453.povray: -DSPEC_CPU_LP64 454.calculix: -DSPEC_CPU_LP64 459.GemsFDTD: -DSPEC_CPU_LP64 465.tonto: -DSPEC_CPU_LP64 470.lbm: -DSPEC_CPU_LP64 481.wrf: -DSPEC_CPU_LP64 -DSPEC_CPU_LINUX -DSPEC_CPU_CASE_FLAG -fno-second-underscore 482.sphinx3: -DSPEC_CPU_LP64 Base Optimization Flags ----------------------- C benchmarks: -march=bdver1 -Ofast -HP:bdt=2m:heap=2m -apo -mso -OPT:alias=restricted -OPT:malloc_alg=2 -LNO:parallel_overhead=10000 C++ benchmarks: -march=bdver1 -Ofast -static -CG:load_exe=0 -CG:p2align=0 -INLINE:aggressive=on -HP:bdt=2m:heap=2m -D__OPEN64_FAST_SET Fortran benchmarks: -march=bdver1 -Ofast -LNO:blocking=off -LNO:fusion_peeling_limit=0 -LNO:parallel_overhead=10000 -OPT:rsqrt=2 -OPT:unroll_size=256 -HP:bdt=2m:heap=2m -apo Benchmarks using both Fortran and C: -march=bdver1 -Ofast -HP:bdt=2m:heap=2m -apo -mso -OPT:alias=restricted -OPT:malloc_alg=2 -LNO:parallel_overhead=10000 -LNO:blocking=off -LNO:fusion_peeling_limit=0 -OPT:rsqrt=2 -OPT:unroll_size=256 Peak Compiler Invocation ------------------------ C benchmarks: opencc C++ benchmarks: openCC Fortran benchmarks: openf95 Benchmarks using both Fortran and C: opencc openf95 Peak Portability Flags ---------------------- 410.bwaves: -DSPEC_CPU_LP64 416.gamess: -DSPEC_CPU_LP64 433.milc: -DSPEC_CPU_LP64 434.zeusmp: -DSPEC_CPU_LP64 435.gromacs: -DSPEC_CPU_LP64 436.cactusADM: -DSPEC_CPU_LP64 -fno-second-underscore 437.leslie3d: -DSPEC_CPU_LP64 444.namd: -DSPEC_CPU_LP64 447.dealII: -DSPEC_CPU_LP64 453.povray: -DSPEC_CPU_LP64 454.calculix: -DSPEC_CPU_LP64 459.GemsFDTD: -DSPEC_CPU_LP64 465.tonto: -DSPEC_CPU_LP64 470.lbm: -DSPEC_CPU_LP64 481.wrf: -DSPEC_CPU_LP64 -DSPEC_CPU_LINUX -DSPEC_CPU_CASE_FLAG -fno-second-underscore 482.sphinx3: -DSPEC_CPU_LP64 Peak Optimization Flags ----------------------- C benchmarks: 433.milc: -march=bdver1 -Ofast -CG:movnti=1 -CG:locs_best=on -HP:bdt=2m:heap=2m -IPA:plimit=7000 -IPA:callee_limit=1200 -OPT:struct_array_copy=2 -OPT:alias=field_sensitive 470.lbm: -march=bdver1 -Ofast -mso -apo -CG:sse_cse_regs=0 -LNO:prefetch_ahead=4 -CG:locs_shallow_depth=1 -CG:cmp_peep=on -CG:compute_to=on -OPT:unroll_times_max=8 -OPT:unroll_size=256 -OPT:unroll_level=2 -OPT:keep_ext=on -OPT:alias=restricted -m3dnow -IPA:inline=off 482.sphinx3: -march=bdver1 -fb_create fbdata(pass 1) -fb_opt fbdata(pass 2) -Ofast -LNO:loop_model_simd=on -LNO:simd_rm_unity_remainder=on -OPT:malloc_alg=2 -CG:cmp_peep=on -CG:local_sched_alg=2 -CG:use_incdec=off -INLINE:aggressive=on -WOPT:sib=on -HP C++ benchmarks: 444.namd: -march=bdver1 -fb_create fbdata(pass 1) -fb_opt fbdata(pass 2) -Ofast -LNO:ignore_feedback=off -CG:local_sched_alg=2 -CG:load_exe=0 -OPT:unroll_size=256 -fno-exceptions -HP:bdt=2m:heap=2m 447.dealII: -march=bdver1 -Ofast -LNO:simd=0 -D__OPEN64_FAST_SET -static -INLINE:aggressive=on -OPT:alias=disjoint -OPT:unroll_times_max=8 -OPT:unroll_size=256 -OPT:unroll_level=2 -HP:bdt=2m:heap=2m 450.soplex: -march=bdver1 -fb_create fbdata(pass 1) -fb_opt fbdata(pass 2) -O3 -INLINE:aggressive=on -OPT:RO=1 -OPT:IEEE_arith=3 -OPT:IEEE_NaN_Inf=off -OPT:fold_unsigned_relops=on -fno-exceptions -CG:p2align=0 -m32 -HP:bdt=2m:heap=2m -WOPT:sib=on 453.povray: -march=bdver1 -fb_create fbdata(pass 1) -fb_opt fbdata(pass 2) -Ofast -CG:pre_local_sched=off -INLINE:aggressive=on -HP:bdt=2m:heap=2m -OPT:transform=2 -OPT:alias=disjoint -WOPT:aggcm=0 Fortran benchmarks: 410.bwaves: -march=bdver1 -fb_create fbdata(pass 1) -fb_opt fbdata(pass 2) -Ofast -apo -OPT:Ofast -OPT:treeheight=on -LNO:blocking=off -LNO:prefetch=2 -LNO:pf2=0 -LNO:prefetch_ahead=3 -LNO:ignore_feedback=off -LNO:fu=4 -LNO:loop_model_simd=on -LNO:simd_rm_unity_remainder=on -WOPT:aggstr=0 -HP:bdt=2m:heap=2m -CG:cmp_peep=on -CG:p2align=0 416.gamess: -march=bdver1 -fb_create fbdata(pass 1) -fb_opt fbdata(pass 2) -O3 -LNO:fu=6 -LNO:blocking=0 -LNO:simd=0 -OPT:Ofast -OPT:ro=3 -OPT:unroll_size=256 -OPT:unroll_times_max=2 -CG:local_sched_alg=1 -HP:bdt=2m:heap=2m -WOPT:sib=on 434.zeusmp: -march=bdver1 -Ofast -apo -LNO:blocking=off -LNO:interchange=off -LNO:fusion_peeling_limit=0 -OPT:treeheight=on -OPT:unroll_size=256 -CG:cmp_peep=on -CG:compute_to=on -GRA:prioritize_by_density=on -HP:bdt=2m:heap=2m 437.leslie3d: -march=bdver1 -Ofast -LNO:prefetch=2 -LNO:blocking=off -CG:interior_ptrs=on -OPT:unroll_size=256 -GRA:prioritize_by_density=on -HP:bdt=2m:heap=2m 459.GemsFDTD: -march=bdver1 -Ofast -OPT:unroll_size=0 -LNO:fission=2 -CG:load_exe=0 -CG:local_sched_alg=2 -HP -apo 465.tonto: -march=bdver1 -Ofast -OPT:alias=no_f90_pointer_alias -LNO:blocking=off -CG:load_exe=1 -CG:local_sched_alg=1 -IPA:plimit=525 -HP Benchmarks using both Fortran and C: 435.gromacs: -march=bdver1 -fb_create fbdata(pass 1) -fb_opt fbdata(pass 2) -Ofast -OPT:rsqrt=2 -HP:bdt=2m:heap=2m 436.cactusADM: -march=bdver1 -fb_create fbdata(pass 1) -fb_opt fbdata(pass 2) -Ofast -LNO:blocking=off -LNO:prefetch=2 -HP:bdt=2m:heap=2m -CG:locs_shallow_depth=1 -CG:load_exe=0 -WOPT:sib=on -apo 454.calculix: -march=bdver1 -Ofast -OPT:unroll_size=256 -GRA:optimize_boundary=on -HP:bdt=2m:heap=2m 481.wrf: -march=bdver1 -Ofast -OPT:unroll_size=256 -LNO:blocking=off -LANG:copyinout=off -IPA:callee_limit=5000 -GRA:prioritize_by_density=on -CG:load_exe=1 -HP -WOPT:sib=on -apo The flags file that was used to format this result can be browsed at http://www.spec.org/cpu2006/flags/x86-open64-425-flags-speed-revA.html You can also download the XML flags source by saving the following link: http://www.spec.org/cpu2006/flags/x86-open64-425-flags-speed-revA.xml SPEC and SPECfp are registered trademarks of the Standard Performance Evaluation Corporation. All other brand and product names appearing in this result are trademarks or registered trademarks of their respective holders. ----------------------------------------------------------------------------- For questions about this result, please contact the tester. For other inquiries, please contact webmaster@spec.org. Copyright 2006-2014 Standard Performance Evaluation Corporation Tested with SPEC CPU2006 v1.2. Report generated on Thu Jul 24 02:00:05 2014 by CPU2006 ASCII formatter v6932. Originally published on 24 February 2012.