SPEC(R) CFP2006 Summary Tyan Tyan YR190B8228, AMD Opteron 4170 HE Test Sponsor: Advanced Micro Devices Tue Nov 30 08:05:24 2010 CPU2006 License: 49 Test date: Nov-2010 Test sponsor: Advanced Micro Devices Hardware availability: Aug-2010 Tested by: Advanced Micro Devices Software availability: May-2010 Base Base Base Peak Peak Peak Benchmarks Ref. Run Time Ratio Ref. Run Time Ratio -------------- ------ --------- --------- ------ --------- --------- 410.bwaves 13590 208 65.5 S 13590 125 109 S 410.bwaves 13590 208 65.2 S 13590 126 108 S 410.bwaves 13590 208 65.4 * 13590 125 109 * 416.gamess 19580 1448 13.5 * 19580 1322 14.8 * 416.gamess 19580 1451 13.5 S 19580 1322 14.8 S 416.gamess 19580 1448 13.5 S 19580 1320 14.8 S 433.milc 9180 539 17.0 S 9180 393 23.4 S 433.milc 9180 534 17.2 S 9180 393 23.4 * 433.milc 9180 537 17.1 * 9180 394 23.3 S 434.zeusmp 9100 264 34.5 * 9100 255 35.7 * 434.zeusmp 9100 264 34.5 S 9100 254 35.9 S 434.zeusmp 9100 263 34.6 S 9100 255 35.7 S 435.gromacs 7140 630 11.3 * 7140 488 14.6 * 435.gromacs 7140 629 11.4 S 7140 488 14.6 S 435.gromacs 7140 631 11.3 S 7140 487 14.7 S 436.cactusADM 11950 153 77.9 S 11950 103 116 S 436.cactusADM 11950 152 78.7 S 11950 103 117 * 436.cactusADM 11950 153 78.0 * 11950 102 117 S 437.leslie3d 9400 560 16.8 * 9400 519 18.1 * 437.leslie3d 9400 558 16.8 S 9400 519 18.1 S 437.leslie3d 9400 561 16.8 S 9400 521 18.1 S 444.namd 8020 740 10.8 S 8020 672 11.9 S 444.namd 8020 738 10.9 S 8020 672 11.9 * 444.namd 8020 738 10.9 * 8020 671 11.9 S 447.dealII 11440 532 21.5 S 11440 470 24.3 * 447.dealII 11440 534 21.4 S 11440 470 24.3 S 447.dealII 11440 534 21.4 * 11440 470 24.3 S 450.soplex 8340 627 13.3 S 8340 551 15.1 S 450.soplex 8340 630 13.2 S 8340 554 15.1 * 450.soplex 8340 628 13.3 * 8340 554 15.0 S 453.povray 5320 337 15.8 S 5320 325 16.4 S 453.povray 5320 339 15.7 S 5320 327 16.3 S 453.povray 5320 337 15.8 * 5320 326 16.3 * 454.calculix 8250 432 19.1 S 8250 406 20.3 S 454.calculix 8250 429 19.2 S 8250 405 20.4 * 454.calculix 8250 431 19.1 * 8250 404 20.4 S 459.GemsFDTD 10610 335 31.6 S 10610 320 33.1 S 459.GemsFDTD 10610 336 31.6 * 10610 320 33.1 S 459.GemsFDTD 10610 336 31.6 S 10610 320 33.1 * 465.tonto 9840 585 16.8 S 9840 555 17.7 S 465.tonto 9840 580 17.0 * 9840 552 17.8 * 465.tonto 9840 580 17.0 S 9840 550 17.9 S 470.lbm 13740 599 22.9 S 13740 75.9 181 S 470.lbm 13740 604 22.7 S 13740 75.7 181 S 470.lbm 13740 600 22.9 * 13740 75.8 181 * 481.wrf 11170 360 31.0 S 11170 360 31.0 S 481.wrf 11170 359 31.1 S 11170 359 31.1 S 481.wrf 11170 359 31.1 * 11170 359 31.1 * 482.sphinx3 19490 891 21.9 * 19490 846 23.0 S 482.sphinx3 19490 891 21.9 S 19490 847 23.0 * 482.sphinx3 19490 891 21.9 S 19490 850 22.9 S ============================================================================== 410.bwaves 13590 208 65.4 * 13590 125 109 * 416.gamess 19580 1448 13.5 * 19580 1322 14.8 * 433.milc 9180 537 17.1 * 9180 393 23.4 * 434.zeusmp 9100 264 34.5 * 9100 255 35.7 * 435.gromacs 7140 630 11.3 * 7140 488 14.6 * 436.cactusADM 11950 153 78.0 * 11950 103 117 * 437.leslie3d 9400 560 16.8 * 9400 519 18.1 * 444.namd 8020 738 10.9 * 8020 672 11.9 * 447.dealII 11440 534 21.4 * 11440 470 24.3 * 450.soplex 8340 628 13.3 * 8340 554 15.1 * 453.povray 5320 337 15.8 * 5320 326 16.3 * 454.calculix 8250 431 19.1 * 8250 405 20.4 * 459.GemsFDTD 10610 336 31.6 * 10610 320 33.1 * 465.tonto 9840 580 17.0 * 9840 552 17.8 * 470.lbm 13740 600 22.9 * 13740 75.8 181 * 481.wrf 11170 359 31.1 * 11170 359 31.1 * 482.sphinx3 19490 891 21.9 * 19490 847 23.0 * SPECfp(R)_base2006 21.9 SPECfp2006 28.2 HARDWARE -------- CPU Name: AMD Opteron 4170 HE CPU Characteristics: CPU MHz: 2100 FPU: Integrated CPU(s) enabled: 12 cores, 2 chips, 6 cores/chip CPU(s) orderable: 1,2 chips Primary Cache: 64 KB I + 64 KB D on chip per core Secondary Cache: 512 KB I+D on chip per core L3 Cache: 6 MB I+D on chip per chip Other Cache: None Memory: 32 GB (4 x 8 GB 2Rx4 PC3-10600R-9, ECC) Disk Subsystem: 1 x 128 GB SATA SSD Crucial RealSSD C300 CTFDDAC128MAG-1G1 Other Hardware: None SOFTWARE -------- Operating System: SUSE Linux Enterprise Server 11 (x86_64), Kernel 2.6.27.19-5-default Compiler: x86 Open64 4.2.3.2 Compiler Suite (from AMD) Auto Parallel: Yes File System: ext3 System State: Run level 3 (Full multiuser with network) Base Pointers: 64-bit Peak Pointers: 32/64-bit Other Software: None Submit Notes ------------ The config file option 'submit' was used. 'numactl' was used to bind copies to the cores. See the configuration file for details. Operating System Notes ---------------------- 'ulimit -s unlimited' was used to set environment stack size 'ulimit -l 2097152' was used to set environment locked pages in memory limit Set vm/nr_hugepages=2000 in /etc/sysctl.conf mount -t hugetlbfs nodev /mnt/hugepages powersave -f was used to set the CPU frequency to its maximum. Binaries were compiled on SLES10 SP2 with binutils 2.18 General Notes ------------- Environment variables set by runspec before the start of the run: LD_LIBRARY_PATH = "/root/work/cpu2006/amd1002-speed-libs-revA/64:/root/work/cpu2006/amd1002-speed-libs-revA/32" O64_OMP_AFFINITY_MAP = "0,1,2,3,4,5,6,7,8,9,10,11" O64_OMP_SPIN_USER_LOCK = "true" The x86 Open64 Compiler Suite is only available from (and supported by) AMD at http://developer.amd.com/cpu/open64 Base Compiler Invocation ------------------------ C benchmarks: opencc C++ benchmarks: openCC Fortran benchmarks: openf95 Benchmarks using both Fortran and C: opencc openf95 Base Portability Flags ---------------------- 410.bwaves: -DSPEC_CPU_LP64 416.gamess: -DSPEC_CPU_LP64 433.milc: -DSPEC_CPU_LP64 434.zeusmp: -DSPEC_CPU_LP64 435.gromacs: -DSPEC_CPU_LP64 436.cactusADM: -DSPEC_CPU_LP64 -fno-second-underscore 437.leslie3d: -DSPEC_CPU_LP64 444.namd: -DSPEC_CPU_LP64 447.dealII: -DSPEC_CPU_LP64 450.soplex: -DSPEC_CPU_LP64 453.povray: -DSPEC_CPU_LP64 454.calculix: -DSPEC_CPU_LP64 459.GemsFDTD: -DSPEC_CPU_LP64 465.tonto: -DSPEC_CPU_LP64 470.lbm: -DSPEC_CPU_LP64 481.wrf: -DSPEC_CPU_LP64 -DSPEC_CPU_LINUX -DSPEC_CPU_CASE_FLAG -fno-second-underscore 482.sphinx3: -DSPEC_CPU_LP64 Base Optimization Flags ----------------------- C benchmarks: -march=barcelona -Ofast -HP:bdt=2m:heap=2m C++ benchmarks: -march=barcelona -Ofast -static -INLINE:aggressive=on -HP:bdt=2m:heap=2m Fortran benchmarks: -march=barcelona -Ofast -apo -LNO:parallel_overhead=10000 -LNO:fusion_peeling_limit=0 -HP:bdt=2m:heap=2m Benchmarks using both Fortran and C: -march=barcelona -Ofast -HP:bdt=2m:heap=2m -apo -LNO:parallel_overhead=10000 -LNO:fusion_peeling_limit=0 Peak Compiler Invocation ------------------------ C benchmarks: opencc C++ benchmarks: openCC Fortran benchmarks: openf95 Benchmarks using both Fortran and C: opencc openf95 Peak Portability Flags ---------------------- 410.bwaves: -DSPEC_CPU_LP64 416.gamess: -DSPEC_CPU_LP64 433.milc: -DSPEC_CPU_LP64 434.zeusmp: -DSPEC_CPU_LP64 435.gromacs: -DSPEC_CPU_LP64 436.cactusADM: -DSPEC_CPU_LP64 -fno-second-underscore 437.leslie3d: -DSPEC_CPU_LP64 444.namd: -DSPEC_CPU_LP64 453.povray: -DSPEC_CPU_LP64 454.calculix: -DSPEC_CPU_LP64 459.GemsFDTD: -DSPEC_CPU_LP64 465.tonto: -DSPEC_CPU_LP64 470.lbm: -DSPEC_CPU_LP64 481.wrf: -DSPEC_CPU_LP64 -DSPEC_CPU_LINUX -DSPEC_CPU_CASE_FLAG -fno-second-underscore 482.sphinx3: -DSPEC_CPU_LP64 Peak Optimization Flags ----------------------- C benchmarks: 433.milc: -march=barcelona -Ofast -apo -CG:movnti=1 -CG:local_sched_alg=1 -CG:locs_shallow_depth=1 -CG:compute_to=on -HP:bdt=2m:heap=2m -LNO:prefetch=3 470.lbm: -march=barcelona -Ofast -mso -apo -CG:sse_cse_regs=0 -LNO:prefetch_ahead=4 -CG:locs_shallow_depth=1 -CG:cmp_peep=on -CG:compute_to=on -OPT:unroll_times_max=8 -OPT:unroll_size=256 -OPT:unroll_level=2 -OPT:keep_ext=on -OPT:alias=restricted -m3dnow -IPA:inline=off 482.sphinx3: -march=barcelona -fb_create fbdata(pass 1) -fb_opt fbdata(pass 2) -Ofast -OPT:malloc_alg=2 -CG:sse_cse_regs=0 -CG:locs_shallow_depth=1 -CG:cmp_peep=on -CG:local_sched_alg=1 -INLINE:aggressive=on C++ benchmarks: 444.namd: -march=barcelona -fb_create fbdata(pass 1) -fb_opt fbdata(pass 2) -Ofast -LNO:ignore_feedback=off -CG:local_sched_alg=2 -CG:load_exe=0 -CG:compute_to=on -OPT:unroll_size=256 -fno-exceptions -HP:bdt=2m:heap=2m 447.dealII: -march=barcelona -Ofast -static -INLINE:aggressive=on -LNO:opt=0 -fno-emit-exceptions -m32 -OPT:unroll_times_max=8 -OPT:unroll_size=256 -OPT:unroll_level=2 -HP:bdt=2m:heap=2m -GRA:unspill=on -CG:cmp_peep=on -TENV:frame_pointer=off 450.soplex: -march=barcelona -fb_create fbdata(pass 1) -fb_opt fbdata(pass 2) -O3 -INLINE:aggressive=on -OPT:IEEE_arith=3 -OPT:IEEE_NaN_Inf=off -OPT:fold_unsigned_relops=on -CG:load_exe=0 -fno-exceptions -m32 -HP:bdt=2m:heap=2m 453.povray: -march=barcelona -fb_create fbdata(pass 1) -fb_opt fbdata(pass 2) -Ofast -INLINE:aggressive=on -HP:bdt=2m:heap=2m Fortran benchmarks: 410.bwaves: -march=barcelona -Ofast -apo -OPT:malloc_alg=2 -CG:use_prefetchnta=on -CG:cmp_peep=on -LNO:blocking=off -LNO:prefetch=3 -LNO:prefetch_ahead=5 -LNO:ignore_feedback=off -LNO:apo_use_feedback=on -WOPT:aggstr=0 416.gamess: -march=barcelona -fb_create fbdata(pass 1) -fb_opt fbdata(pass 2) -O3 -LNO:fu=6 -LNO:blocking=0 -LNO:prefetch=0 -OPT:Ofast -OPT:ro=3 -OPT:unroll_size=256 -HP:bdt=2m:heap=2m 434.zeusmp: -march=barcelona -Ofast -apo -LNO:blocking=off -LNO:interchange=off -LNO:fusion_peeling_limit=0 -OPT:treeheight=on -OPT:unroll_size=256 -CG:cmp_peep=on -CG:compute_to=on -GRA:prioritize_by_density=on -HP:bdt=2m:heap=2m 437.leslie3d: -march=barcelona -Ofast -apo -OPT:unroll_size=256 -LNO:prefetch_ahead=4 -LNO:parallel_overhead=32768 -GRA:prioritize_by_density=on -m3dnow -HP:bdt=2m:heap=2m 459.GemsFDTD: -march=barcelona -Ofast -apo -LNO:fission=2 -LNO:prefetch_ahead=1 -CG:load_exe=0 -CG:local_sched_alg=1 -HP 465.tonto: -march=barcelona -Ofast -apo -OPT:alias=no_f90_pointer_alias -LNO:blocking=off -CG:load_exe=1 -IPA:plimit=525 -HP Benchmarks using both Fortran and C: 435.gromacs: -march=barcelona -Ofast -apo -OPT:rsqrt=2 -HP:bdt=2m:heap=2m 436.cactusADM: -march=barcelona -fb_create fbdata(pass 1) -fb_opt fbdata(pass 2) -Ofast -apo -LANG:heap_allocation_threshold=1000 -LNO:prefetch_ahead=1 -HP:bdt=2m:heap=2m 454.calculix: -march=barcelona -Ofast -LNO:prefetch_ahead=30 -CG:load_exe=0 -CG:ptr_load_use=0 -CG:local_sched_alg=2 -CG:compute_to=on -WOPT:unroll=2 -GRA:optimize_boundary=on -HP:bdt=2m:heap=2m -apo 481.wrf: basepeak = yes The flags files that were used to format this result can be browsed at http://www.spec.org/cpu2006/flags/x86-open64-423-flags-speed-revA.20101207.html http://www.spec.org/cpu2006/flags/amd-platform-speed-revA.html You can also download the XML flags sources by saving the following links: http://www.spec.org/cpu2006/flags/x86-open64-423-flags-speed-revA.20101207.xml http://www.spec.org/cpu2006/flags/amd-platform-speed-revA.xml SPEC and SPECfp are registered trademarks of the Standard Performance Evaluation Corporation. All other brand and product names appearing in this result are trademarks or registered trademarks of their respective holders. ----------------------------------------------------------------------------- For questions about this result, please contact the tester. For other inquiries, please contact webmaster@spec.org. Copyright 2006-2014 Standard Performance Evaluation Corporation Tested with SPEC CPU2006 v1.1. Report generated on Wed Jul 23 15:22:14 2014 by CPU2006 ASCII formatter v6932. Originally published on 3 February 2011.