SPEC(R) CFP2006 Summary Tyan Tyan YR190B8228, AMD Opteron 4176 HE Test Sponsor: Advanced Micro Devices Sat Nov 6 05:46:21 2010 CPU2006 License: 49 Test date: Nov-2010 Test sponsor: Advanced Micro Devices Hardware availability: Aug-2010 Tested by: Advanced Micro Devices Software availability: May-2010 Base Base Base Peak Peak Peak Benchmarks Ref. Run Time Ratio Ref. Run Time Ratio -------------- ------ --------- --------- ------ --------- --------- 410.bwaves 13590 191 71.2 S 13590 117 116 S 410.bwaves 13590 191 71.1 * 13590 117 116 * 410.bwaves 13590 191 71.0 S 13590 117 116 S 416.gamess 19580 1263 15.5 S 19580 1156 16.9 * 416.gamess 19580 1268 15.4 S 19580 1158 16.9 S 416.gamess 19580 1268 15.4 * 19580 1154 17.0 S 433.milc 9180 498 18.4 S 9180 363 25.3 * 433.milc 9180 499 18.4 * 9180 362 25.4 S 433.milc 9180 501 18.3 S 9180 363 25.3 S 434.zeusmp 9100 240 38.0 S 9100 231 39.3 S 434.zeusmp 9100 240 38.0 * 9100 232 39.3 S 434.zeusmp 9100 240 37.9 S 9100 231 39.3 * 435.gromacs 7140 552 12.9 S 7140 428 16.7 S 435.gromacs 7140 552 12.9 S 7140 428 16.7 * 435.gromacs 7140 552 12.9 * 7140 429 16.7 S 436.cactusADM 11950 142 84.2 S 11950 93.5 128 * 436.cactusADM 11950 143 83.7 * 11950 93.5 128 S 436.cactusADM 11950 143 83.4 S 11950 93.8 127 S 437.leslie3d 9400 511 18.4 * 9400 476 19.7 S 437.leslie3d 9400 511 18.4 S 9400 470 20.0 S 437.leslie3d 9400 507 18.5 S 9400 475 19.8 * 444.namd 8020 647 12.4 S 8020 590 13.6 S 444.namd 8020 646 12.4 S 8020 589 13.6 * 444.namd 8020 646 12.4 * 8020 588 13.6 S 447.dealII 11440 475 24.1 S 11440 416 27.5 * 447.dealII 11440 475 24.1 S 11440 416 27.5 S 447.dealII 11440 475 24.1 * 11440 415 27.6 S 450.soplex 8340 577 14.5 S 8340 504 16.5 * 450.soplex 8340 579 14.4 * 8340 506 16.5 S 450.soplex 8340 579 14.4 S 8340 504 16.6 S 453.povray 5320 296 17.9 S 5320 285 18.6 * 453.povray 5320 295 18.0 S 5320 285 18.7 S 453.povray 5320 296 17.9 * 5320 285 18.6 S 454.calculix 8250 380 21.7 S 8250 357 23.1 * 454.calculix 8250 380 21.7 * 8250 357 23.1 S 454.calculix 8250 380 21.7 S 8250 361 22.8 S 459.GemsFDTD 10610 316 33.6 S 10610 301 35.3 S 459.GemsFDTD 10610 316 33.6 * 10610 301 35.2 S 459.GemsFDTD 10610 315 33.7 S 10610 301 35.3 * 465.tonto 9840 520 18.9 S 9840 492 20.0 S 465.tonto 9840 516 19.1 * 9840 492 20.0 S 465.tonto 9840 514 19.1 S 9840 492 20.0 * 470.lbm 13740 552 24.9 S 13740 71.7 192 S 470.lbm 13740 548 25.1 * 13740 71.7 192 S 470.lbm 13740 546 25.2 S 13740 71.7 192 * 481.wrf 11170 323 34.6 * 11170 323 34.6 * 481.wrf 11170 323 34.5 S 11170 323 34.5 S 481.wrf 11170 323 34.6 S 11170 323 34.6 S 482.sphinx3 19490 799 24.4 S 19490 754 25.9 S 482.sphinx3 19490 793 24.6 * 19490 760 25.6 S 482.sphinx3 19490 792 24.6 S 19490 757 25.7 * ============================================================================== 410.bwaves 13590 191 71.1 * 13590 117 116 * 416.gamess 19580 1268 15.4 * 19580 1156 16.9 * 433.milc 9180 499 18.4 * 9180 363 25.3 * 434.zeusmp 9100 240 38.0 * 9100 231 39.3 * 435.gromacs 7140 552 12.9 * 7140 428 16.7 * 436.cactusADM 11950 143 83.7 * 11950 93.5 128 * 437.leslie3d 9400 511 18.4 * 9400 475 19.8 * 444.namd 8020 646 12.4 * 8020 589 13.6 * 447.dealII 11440 475 24.1 * 11440 416 27.5 * 450.soplex 8340 579 14.4 * 8340 504 16.5 * 453.povray 5320 296 17.9 * 5320 285 18.6 * 454.calculix 8250 380 21.7 * 8250 357 23.1 * 459.GemsFDTD 10610 316 33.6 * 10610 301 35.3 * 465.tonto 9840 516 19.1 * 9840 492 20.0 * 470.lbm 13740 548 25.1 * 13740 71.7 192 * 481.wrf 11170 323 34.6 * 11170 323 34.6 * 482.sphinx3 19490 793 24.6 * 19490 757 25.7 * SPECfp(R)_base2006 24.3 SPECfp2006 31.3 HARDWARE -------- CPU Name: AMD Opteron 4176 HE CPU Characteristics: CPU MHz: 2400 FPU: Integrated CPU(s) enabled: 12 cores, 2 chips, 6 cores/chip CPU(s) orderable: 1,2 chips Primary Cache: 64 KB I + 64 KB D on chip per core Secondary Cache: 512 KB I+D on chip per core L3 Cache: 6 MB I+D on chip per chip Other Cache: None Memory: 32 GB (4 x 8 GB 2Rx4 PC3-10600R-9, ECC) Disk Subsystem: 1 x 128 GB SATA SSD Crucial RealSSD C300 CTFDDAC128MAG-1G1 Other Hardware: None SOFTWARE -------- Operating System: SUSE Linux Enterprise Server 11 (x86_64), Kernel 2.6.27.19-5-default Compiler: x86 Open64 4.2.3.2 Compiler Suite (from AMD) Auto Parallel: Yes File System: ext3 System State: Run level 3 (Full multiuser with network) Base Pointers: 64-bit Peak Pointers: 32/64-bit Other Software: None Submit Notes ------------ The config file option 'submit' was used. 'numactl' was used to bind copies to the cores. See the configuration file for details. Operating System Notes ---------------------- 'ulimit -s unlimited' was used to set environment stack size 'ulimit -l 2097152' was used to set environment locked pages in memory limit Set vm/nr_hugepages=2000 in /etc/sysctl.conf mount -t hugetlbfs nodev /mnt/hugepages powersave -f was used to set the CPU frequency to its maximum. Binaries were compiled on SLES10 SP2 with binutils 2.18 General Notes ------------- Environment variables set by runspec before the start of the run: LD_LIBRARY_PATH = "/root/work/cpu2006/amd1002-speed-libs-revA/64:/root/work/cpu2006/amd1002-speed-libs-revA/32" O64_OMP_AFFINITY_MAP = "0,1,2,3,4,5,6,7,8,9,10,11" O64_OMP_SPIN_USER_LOCK = "true" The x86 Open64 Compiler Suite is only available from (and supported by) AMD at http://developer.amd.com/cpu/open64 Base Compiler Invocation ------------------------ C benchmarks: opencc C++ benchmarks: openCC Fortran benchmarks: openf95 Benchmarks using both Fortran and C: opencc openf95 Base Portability Flags ---------------------- 410.bwaves: -DSPEC_CPU_LP64 416.gamess: -DSPEC_CPU_LP64 433.milc: -DSPEC_CPU_LP64 434.zeusmp: -DSPEC_CPU_LP64 435.gromacs: -DSPEC_CPU_LP64 436.cactusADM: -DSPEC_CPU_LP64 -fno-second-underscore 437.leslie3d: -DSPEC_CPU_LP64 444.namd: -DSPEC_CPU_LP64 447.dealII: -DSPEC_CPU_LP64 450.soplex: -DSPEC_CPU_LP64 453.povray: -DSPEC_CPU_LP64 454.calculix: -DSPEC_CPU_LP64 459.GemsFDTD: -DSPEC_CPU_LP64 465.tonto: -DSPEC_CPU_LP64 470.lbm: -DSPEC_CPU_LP64 481.wrf: -DSPEC_CPU_LP64 -DSPEC_CPU_LINUX -DSPEC_CPU_CASE_FLAG -fno-second-underscore 482.sphinx3: -DSPEC_CPU_LP64 Base Optimization Flags ----------------------- C benchmarks: -march=barcelona -Ofast -HP:bdt=2m:heap=2m C++ benchmarks: -march=barcelona -Ofast -static -INLINE:aggressive=on -HP:bdt=2m:heap=2m Fortran benchmarks: -march=barcelona -Ofast -apo -LNO:parallel_overhead=10000 -LNO:fusion_peeling_limit=0 -HP:bdt=2m:heap=2m Benchmarks using both Fortran and C: -march=barcelona -Ofast -HP:bdt=2m:heap=2m -apo -LNO:parallel_overhead=10000 -LNO:fusion_peeling_limit=0 Peak Compiler Invocation ------------------------ C benchmarks: opencc C++ benchmarks: openCC Fortran benchmarks: openf95 Benchmarks using both Fortran and C: opencc openf95 Peak Portability Flags ---------------------- 410.bwaves: -DSPEC_CPU_LP64 416.gamess: -DSPEC_CPU_LP64 433.milc: -DSPEC_CPU_LP64 434.zeusmp: -DSPEC_CPU_LP64 435.gromacs: -DSPEC_CPU_LP64 436.cactusADM: -DSPEC_CPU_LP64 -fno-second-underscore 437.leslie3d: -DSPEC_CPU_LP64 444.namd: -DSPEC_CPU_LP64 453.povray: -DSPEC_CPU_LP64 454.calculix: -DSPEC_CPU_LP64 459.GemsFDTD: -DSPEC_CPU_LP64 465.tonto: -DSPEC_CPU_LP64 470.lbm: -DSPEC_CPU_LP64 481.wrf: -DSPEC_CPU_LP64 -DSPEC_CPU_LINUX -DSPEC_CPU_CASE_FLAG -fno-second-underscore 482.sphinx3: -DSPEC_CPU_LP64 Peak Optimization Flags ----------------------- C benchmarks: 433.milc: -march=barcelona -Ofast -apo -CG:movnti=1 -CG:local_sched_alg=1 -CG:locs_shallow_depth=1 -CG:compute_to=on -HP:bdt=2m:heap=2m -LNO:prefetch=3 470.lbm: -march=barcelona -Ofast -mso -apo -CG:sse_cse_regs=0 -LNO:prefetch_ahead=4 -CG:locs_shallow_depth=1 -CG:cmp_peep=on -CG:compute_to=on -OPT:unroll_times_max=8 -OPT:unroll_size=256 -OPT:unroll_level=2 -OPT:keep_ext=on -OPT:alias=restricted -m3dnow -IPA:inline=off 482.sphinx3: -march=barcelona -fb_create fbdata(pass 1) -fb_opt fbdata(pass 2) -Ofast -OPT:malloc_alg=2 -CG:sse_cse_regs=0 -CG:locs_shallow_depth=1 -CG:cmp_peep=on -CG:local_sched_alg=1 -INLINE:aggressive=on C++ benchmarks: 444.namd: -march=barcelona -fb_create fbdata(pass 1) -fb_opt fbdata(pass 2) -Ofast -LNO:ignore_feedback=off -CG:local_sched_alg=2 -CG:load_exe=0 -CG:compute_to=on -OPT:unroll_size=256 -fno-exceptions -HP:bdt=2m:heap=2m 447.dealII: -march=barcelona -Ofast -static -INLINE:aggressive=on -LNO:opt=0 -fno-emit-exceptions -m32 -OPT:unroll_times_max=8 -OPT:unroll_size=256 -OPT:unroll_level=2 -HP:bdt=2m:heap=2m -GRA:unspill=on -CG:cmp_peep=on -TENV:frame_pointer=off 450.soplex: -march=barcelona -fb_create fbdata(pass 1) -fb_opt fbdata(pass 2) -O3 -INLINE:aggressive=on -OPT:IEEE_arith=3 -OPT:IEEE_NaN_Inf=off -OPT:fold_unsigned_relops=on -CG:load_exe=0 -fno-exceptions -m32 -HP:bdt=2m:heap=2m 453.povray: -march=barcelona -fb_create fbdata(pass 1) -fb_opt fbdata(pass 2) -Ofast -INLINE:aggressive=on -HP:bdt=2m:heap=2m Fortran benchmarks: 410.bwaves: -march=barcelona -Ofast -apo -OPT:malloc_alg=2 -CG:use_prefetchnta=on -CG:cmp_peep=on -LNO:blocking=off -LNO:prefetch=3 -LNO:prefetch_ahead=5 -LNO:ignore_feedback=off -LNO:apo_use_feedback=on -WOPT:aggstr=0 416.gamess: -march=barcelona -fb_create fbdata(pass 1) -fb_opt fbdata(pass 2) -O3 -LNO:fu=6 -LNO:blocking=0 -LNO:prefetch=0 -OPT:Ofast -OPT:ro=3 -OPT:unroll_size=256 -HP:bdt=2m:heap=2m 434.zeusmp: -march=barcelona -Ofast -apo -LNO:blocking=off -LNO:interchange=off -LNO:fusion_peeling_limit=0 -OPT:treeheight=on -OPT:unroll_size=256 -CG:cmp_peep=on -CG:compute_to=on -GRA:prioritize_by_density=on -HP:bdt=2m:heap=2m 437.leslie3d: -march=barcelona -Ofast -apo -OPT:unroll_size=256 -LNO:prefetch_ahead=4 -LNO:parallel_overhead=32768 -GRA:prioritize_by_density=on -m3dnow -HP:bdt=2m:heap=2m 459.GemsFDTD: -march=barcelona -Ofast -apo -LNO:fission=2 -LNO:prefetch_ahead=1 -CG:load_exe=0 -CG:local_sched_alg=1 -HP 465.tonto: -march=barcelona -Ofast -apo -OPT:alias=no_f90_pointer_alias -LNO:blocking=off -CG:load_exe=1 -IPA:plimit=525 -HP Benchmarks using both Fortran and C: 435.gromacs: -march=barcelona -Ofast -apo -OPT:rsqrt=2 -HP:bdt=2m:heap=2m 436.cactusADM: -march=barcelona -fb_create fbdata(pass 1) -fb_opt fbdata(pass 2) -Ofast -apo -LANG:heap_allocation_threshold=1000 -LNO:prefetch_ahead=1 -HP:bdt=2m:heap=2m 454.calculix: -march=barcelona -Ofast -LNO:prefetch_ahead=30 -CG:load_exe=0 -CG:ptr_load_use=0 -CG:local_sched_alg=2 -CG:compute_to=on -WOPT:unroll=2 -GRA:optimize_boundary=on -HP:bdt=2m:heap=2m -apo 481.wrf: basepeak = yes The flags files that were used to format this result can be browsed at http://www.spec.org/cpu2006/flags/x86-open64-423-flags-speed-revA.20101207.html http://www.spec.org/cpu2006/flags/amd-platform-speed-revA.html You can also download the XML flags sources by saving the following links: http://www.spec.org/cpu2006/flags/x86-open64-423-flags-speed-revA.20101207.xml http://www.spec.org/cpu2006/flags/amd-platform-speed-revA.xml SPEC and SPECfp are registered trademarks of the Standard Performance Evaluation Corporation. All other brand and product names appearing in this result are trademarks or registered trademarks of their respective holders. ----------------------------------------------------------------------------- For questions about this result, please contact the tester. For other inquiries, please contact webmaster@spec.org. Copyright 2006-2014 Standard Performance Evaluation Corporation Tested with SPEC CPU2006 v1.1. Report generated on Wed Jul 23 15:23:08 2014 by CPU2006 ASCII formatter v6932. Originally published on 3 February 2011.