SPEC(R) CFP2006 Summary Tyan Tyan YR190-B8228, AMD Opteron 4332 HE Test Sponsor: Advanced Micro Devices Mon Oct 1 16:14:42 2012 CPU2006 License: 49 Test date: Oct-2012 Test sponsor: Advanced Micro Devices Hardware availability: Dec-2012 Tested by: Advanced Micro Devices Software availability: Aug-2012 Base Base Base Peak Peak Peak Benchmarks Copies Run Time Rate Copies Run Time Rate -------------- ------ --------- --------- ------ --------- --------- 410.bwaves 12 1012 161 S 12 992 164 S 410.bwaves 12 1013 161 * 12 993 164 * 410.bwaves 12 1013 161 S 12 994 164 S 416.gamess 12 1518 155 S 12 1398 168 * 416.gamess 12 1523 154 S 12 1401 168 S 416.gamess 12 1522 154 * 12 1394 169 S 433.milc 12 814 135 S 12 701 157 S 433.milc 12 813 135 S 12 701 157 * 433.milc 12 814 135 * 12 701 157 S 434.zeusmp 12 579 189 S 12 558 196 * 434.zeusmp 12 574 190 S 12 567 193 S 434.zeusmp 12 575 190 * 12 558 196 S 435.gromacs 12 455 188 * 12 366 234 S 435.gromacs 12 454 189 S 12 368 233 * 435.gromacs 12 456 188 S 12 369 232 S 436.cactusADM 12 633 226 S 12 558 257 S 436.cactusADM 12 636 226 * 12 562 255 S 436.cactusADM 12 636 225 S 12 560 256 * 437.leslie3d 12 1014 111 S 12 822 137 S 437.leslie3d 12 1015 111 S 12 821 137 S 437.leslie3d 12 1014 111 * 12 821 137 * 444.namd 12 627 154 * 12 533 181 * 444.namd 12 632 152 S 12 534 180 S 444.namd 12 621 155 S 12 532 181 S 447.dealII 12 407 337 S 12 368 374 S 447.dealII 12 405 339 * 12 371 370 * 447.dealII 12 401 342 S 12 383 358 S 450.soplex 12 787 127 S 12 717 140 * 450.soplex 12 760 132 * 12 733 136 S 450.soplex 12 759 132 S 12 700 143 S 453.povray 12 308 207 S 12 269 237 S 453.povray 12 308 207 * 12 270 236 * 453.povray 12 308 207 S 12 270 236 S 454.calculix 12 334 297 S 12 318 311 S 454.calculix 12 329 300 S 12 318 311 * 454.calculix 12 330 300 * 12 318 312 S 459.GemsFDTD 12 1224 104 S 12 1089 117 S 459.GemsFDTD 12 1231 103 S 12 1093 117 S 459.GemsFDTD 12 1226 104 * 12 1090 117 * 465.tonto 12 657 180 S 12 590 200 S 465.tonto 12 649 182 * 12 591 200 * 465.tonto 12 646 183 S 12 592 199 S 470.lbm 12 769 214 S 12 769 214 * 470.lbm 12 772 213 S 12 768 215 S 470.lbm 12 770 214 * 12 773 213 S 481.wrf 12 702 191 S 12 699 192 S 481.wrf 12 704 190 S 12 699 192 * 481.wrf 12 703 191 * 12 700 192 S 482.sphinx3 12 1391 168 S 12 1094 214 S 482.sphinx3 12 1390 168 * 12 1082 216 S 482.sphinx3 12 1382 169 S 12 1092 214 * ============================================================================== 410.bwaves 12 1013 161 * 12 993 164 * 416.gamess 12 1522 154 * 12 1398 168 * 433.milc 12 814 135 * 12 701 157 * 434.zeusmp 12 575 190 * 12 558 196 * 435.gromacs 12 455 188 * 12 368 233 * 436.cactusADM 12 636 226 * 12 560 256 * 437.leslie3d 12 1014 111 * 12 821 137 * 444.namd 12 627 154 * 12 533 181 * 447.dealII 12 405 339 * 12 371 370 * 450.soplex 12 760 132 * 12 717 140 * 453.povray 12 308 207 * 12 270 236 * 454.calculix 12 330 300 * 12 318 311 * 459.GemsFDTD 12 1226 104 * 12 1090 117 * 465.tonto 12 649 182 * 12 591 200 * 470.lbm 12 770 214 * 12 769 214 * 481.wrf 12 703 191 * 12 699 192 * 482.sphinx3 12 1390 168 * 12 1092 214 * SPECfp(R)_rate_base2006 177 SPECfp_rate2006 197 HARDWARE -------- CPU Name: AMD Opteron 4332 HE CPU Characteristics: AMD Turbo CORE technology up to 3.70 GHz CPU MHz: 3000 FPU: Integrated CPU(s) enabled: 12 cores, 2 chips, 6 cores/chip CPU(s) orderable: 1,2 chips Primary Cache: 192 KB I on chip per chip, 64 KB I shared / 2 cores; 16 KB D on chip per core Secondary Cache: 6 MB I+D on chip per chip, 2 MB shared / 2 cores L3 Cache: 8 MB I+D on chip per chip Other Cache: None Memory: 32 GB (4 x 8 GB 2Rx4 PC3-12800R-11, ECC) Disk Subsystem: 1 x 128 GB SSD Other Hardware: None SOFTWARE -------- Operating System: Red Hat Enterprise Linux Server release 6.3, Kernel 2.6.32-279.el6.x86_64 Compiler: C/C++/Fortran: Version 4.5.2 of x86 Open64 Compiler Suite (from AMD) Auto Parallel: No File System: ext3 System State: Run level 3 (Full multiuser with network) Base Pointers: 64-bit Peak Pointers: 32/64-bit Other Software: None Submit Notes ------------ The config file option 'submit' was used. 'numactl' was used to bind copies to the cores. See the configuration file for details. Operating System Notes ---------------------- 'ulimit -s unlimited' was used to set environment stack size 'ulimit -l 2097152' was used to set environment locked pages in memory limit Set transparent_hugepage=never as a boot parameter in /boot/grub/menu.lst Set vm/nr_hugepages=5760 in /etc/sysctl.conf mount -t hugetlbfs nodev /mnt/hugepages General Notes ------------- Environment variables set by runspec before the start of the run: HUGETLB_LIMIT = "480" LD_LIBRARY_PATH = "/root/work/cpu2006v1.2/amd1206-rate-libs-revA/32:/root/work/cpu2006v1.2/amd1206-rate-libs-revA/64" The x86 Open64 Compiler Suite is only available from (and supported by) AMD at http://developer.amd.com/cpu/open64 Binaries were compiled on a system with 2x AMD Opteron 6386SE chips + 128GB Memory using RHEL 6.3 Base Compiler Invocation ------------------------ C benchmarks: opencc C++ benchmarks: openCC Fortran benchmarks: openf95 Benchmarks using both Fortran and C: opencc openf95 Base Portability Flags ---------------------- 410.bwaves: -DSPEC_CPU_LP64 416.gamess: -DSPEC_CPU_LP64 433.milc: -DSPEC_CPU_LP64 434.zeusmp: -DSPEC_CPU_LP64 435.gromacs: -DSPEC_CPU_LP64 436.cactusADM: -DSPEC_CPU_LP64 -fno-second-underscore 437.leslie3d: -DSPEC_CPU_LP64 444.namd: -DSPEC_CPU_LP64 447.dealII: -DSPEC_CPU_LP64 450.soplex: -DSPEC_CPU_LP64 453.povray: -DSPEC_CPU_LP64 454.calculix: -DSPEC_CPU_LP64 459.GemsFDTD: -DSPEC_CPU_LP64 465.tonto: -DSPEC_CPU_LP64 470.lbm: -DSPEC_CPU_LP64 481.wrf: -DSPEC_CPU_LINUX -DSPEC_CPU_CASE_FLAG -DSPEC_CPU_LP64 -fno-second-underscore 482.sphinx3: -DSPEC_CPU_LP64 Base Optimization Flags ----------------------- C benchmarks: -Ofast -OPT:malloc_alg=1 -HP:bd=2m:heap=2m -IPA:plimit=8000 -IPA:small_pu=100 -mso -march=bdver1 C++ benchmarks: -Ofast -static -CG:load_exe=0 -OPT:malloc_alg=1 -INLINE:aggressive=on -HP:bd=2m:heap=2m -D__OPEN64_FAST_SET -march=bdver1 Fortran benchmarks: -Ofast -LNO:blocking=off -LNO:simd_peel_align=on -OPT:rsqrt=2 -OPT:unroll_size=256 -HP:bd=2m:heap=2m -mso -march=bdver1 Benchmarks using both Fortran and C: -Ofast -OPT:malloc_alg=1 -HP:bd=2m:heap=2m -IPA:plimit=8000 -IPA:small_pu=100 -mso -march=bdver1 -LNO:blocking=off -LNO:simd_peel_align=on -OPT:rsqrt=2 -OPT:unroll_size=256 Peak Compiler Invocation ------------------------ C benchmarks: opencc C++ benchmarks: openCC Fortran benchmarks: openf95 Benchmarks using both Fortran and C: opencc openf95 Peak Portability Flags ---------------------- 410.bwaves: -DSPEC_CPU_LP64 416.gamess: -DSPEC_CPU_LP64 433.milc: -DSPEC_CPU_LP64 434.zeusmp: -DSPEC_CPU_LP64 435.gromacs: -DSPEC_CPU_LP64 436.cactusADM: -DSPEC_CPU_LP64 -fno-second-underscore 437.leslie3d: -DSPEC_CPU_LP64 444.namd: -DSPEC_CPU_LP64 453.povray: -DSPEC_CPU_LP64 454.calculix: -DSPEC_CPU_LP64 459.GemsFDTD: -DSPEC_CPU_LP64 465.tonto: -DSPEC_CPU_LP64 470.lbm: -DSPEC_CPU_LP64 481.wrf: -DSPEC_CPU_LINUX -DSPEC_CPU_CASE_FLAG -DSPEC_CPU_LP64 -fno-second-underscore Peak Optimization Flags ----------------------- C benchmarks: 433.milc: -Ofast -CG:movnti=1 -CG:locs_best=on -HP:bdt=2m:heap=2m -IPA:plimit=7000 -IPA:callee_limit=1200 -OPT:struct_array_copy=2 -OPT:alias=field_sensitive -mso -march=bdver1 470.lbm: -Ofast -CG:cmp_peep=on -OPT:keep_ext=on -HP:bdt=2m:heap=2m -IPA:plimit=8000 -IPA:small_pu=100 -march=bdver1 -mso 482.sphinx3: -fb_create fbdata(pass 1) -fb_opt fbdata(pass 2) -Ofast -m32 -IPA:plimit=1000 -OPT:malloc_alg=2 -CG:cmp_peep=on -CG:p2align=0 -CG:load_exe=1 -CG:dsched=on -INLINE:aggressive=on -LNO:prefetch=2 -LNO:prefetch_ahead=4 -mso -march=bdver2 C++ benchmarks: 444.namd: -Ofast -IPA:plimit=3000 -LNO:ignore_feedback=off -CG:local_sched_alg=0 -CG:load_exe=0 -OPT:unroll_size=256 -fno-exceptions -HP:bdt=2m:heap=2m -LNO:if_select_conv=1 -OPT:alias=disjoint -LNO:psimd_iso_unroll=ON -march=bdver1 447.dealII: -Ofast -D__OPEN64_FAST_SET -static -INLINE:aggressive=on -LNO:opt=1 -LNO:simd=2 -fno-emit-exceptions -m32 -OPT:unroll_times_max=8 -OPT:unroll_size=256 -OPT:unroll_level=2 -HP:bdt=2m:heap=2m -GRA:unspill=on -CG:cmp_peep=on -CG:movext_icmp=off -TENV:frame_pointer=off -march=bdver1 450.soplex: -fb_create fbdata(pass 1) -fb_opt fbdata(pass 2) -O3 -LNO:ignore_feedback=off -INLINE:aggressive=on -OPT:RO=1 -OPT:IEEE_arith=3 -OPT:IEEE_NaN_Inf=off -OPT:fold_unsigned_relops=on -fno-exceptions -CG:p2align=0 -m32 -mno-fma4 -HP:bdt=2m:heap=2m -WOPT:sib=on -march=bdver1 453.povray: -fb_create fbdata(pass 1) -fb_opt fbdata(pass 2) -Ofast -CG:pre_local_sched=off -CG:p2align=0 -CG:p2align_split=on -CG:dsched=on -INLINE:aggressive=on -HP:bd=2m:heap=2m -OPT:transform=2 -OPT:alias=disjoint -WOPT:aggcm=0 -march=bdver2 Fortran benchmarks: 410.bwaves: -fb_create fbdata(pass 1) -fb_opt fbdata(pass 2) -Ofast -OPT:Ofast -OPT:treeheight=on -LNO:blocking=off -LNO:ignore_feedback=off -LNO:fu=4 -LNO:loop_model_simd=on -LNO:simd_rm_unity_remainder=on -WOPT:aggstr=0 -HP:bdt=2m:heap=2m -CG:cmp_peep=on -march=bdver1 416.gamess: -fb_create fbdata(pass 1) -fb_opt fbdata(pass 2) -Ofast -LNO:fu=6 -LNO:blocking=0 -LNO:simd=2 -OPT:ro=3 -OPT:recip=on -CG:local_sched_alg=1 -HP:bdt=2m:heap=2m -WOPT:sib=on -march=bdver1 434.zeusmp: -fb_create fbdata(pass 1) -fb_opt fbdata(pass 2) -Ofast -LNO:blocking=off -LNO:interchange=off -IPA:plimit=1500 -HP:bdt=2m:heap=2m -march=bdver1 437.leslie3d: -Ofast -CG:pre_minreg_level=2 -LNO:simd=0 -LNO:fusion=2 -HP:bdt=2m:heap=2m -mso -march=bdver1 459.GemsFDTD: -Ofast -IPA:plimit=1500 -OPT:unroll_size=1024 -OPT:unroll_times_max=16 -LNO:fission=2 -CG:local_sched_alg=2 -HP -march=bdver1 465.tonto: -Ofast -OPT:alias=no_f90_pointer_alias -LNO:blocking=off -CG:load_exe=1 -CG:local_sched_alg=3 -IPA:plimit=525 -HP:bdt=2m:heap=2m -march=bdver1 Benchmarks using both Fortran and C: 435.gromacs: -Ofast -OPT:rsqrt=2 -HP:bdt=2m:heap=2m -CG:local_sched_alg=2 -CG:load_exe=3 -GRA:unspill=on -march=bdver1 -LNO:simd=3 436.cactusADM: -fb_create fbdata(pass 1) -fb_opt fbdata(pass 2) -Ofast -LNO:blocking=off -LNO:prefetch=2 -LNO:pf2=0 -LNO:prefetch_ahead=4 -HP -CG:locs_shallow_depth=1 -CG:load_exe=0 -CG:dsched=on -WOPT:sib=on -march=bdver1 454.calculix: -Ofast -OPT:unroll_size=256 -OPT:alias=disjoint -GRA:optimize_boundary=on -CG:dsched=on -HP:bdt=2m:heap=2m -march=bdver1 481.wrf: -Ofast -LNO:blocking=off -LANG:copyinout=off -IPA:callee_limit=5000 -GRA:prioritize_by_density=on -HP -WOPT:sib=on -march=bdver1 The flags file that was used to format this result can be browsed at http://www.spec.org/cpu2006/flags/x86-open64-452-flags-rate-revA-II.html You can also download the XML flags source by saving the following link: http://www.spec.org/cpu2006/flags/x86-open64-452-flags-rate-revA-II.xml SPEC and SPECfp are registered trademarks of the Standard Performance Evaluation Corporation. All other brand and product names appearing in this result are trademarks or registered trademarks of their respective holders. ----------------------------------------------------------------------------- For questions about this result, please contact the tester. For other inquiries, please contact webmaster@spec.org. Copyright 2006-2014 Standard Performance Evaluation Corporation Tested with SPEC CPU2006 v1.2. Report generated on Thu Jul 24 13:24:09 2014 by CPU2006 ASCII formatter v6932. Originally published on 4 December 2012.