SPEC(R) CFP2006 Summary Supermicro Supermicro A+ Server 1022G-NTF, AMD Opteron 6344 Test Sponsor: Advanced Micro Devices Sat Sep 22 03:53:11 2012 CPU2006 License: 49 Test date: Sep-2012 Test sponsor: Advanced Micro Devices Hardware availability: Nov-2012 Tested by: Advanced Micro Devices Software availability: Aug-2012 Base Base Base Peak Peak Peak Benchmarks Copies Run Time Rate Copies Run Time Rate -------------- ------ --------- --------- ------ --------- --------- 410.bwaves 24 1011 323 S 24 990 330 S 410.bwaves 24 1010 323 S 24 990 329 * 410.bwaves 24 1010 323 * 24 991 329 S 416.gamess 24 1739 270 * 24 1593 295 S 416.gamess 24 1740 270 S 24 1590 296 * 416.gamess 24 1732 271 S 24 1585 297 S 433.milc 24 816 270 S 24 696 317 * 433.milc 24 815 270 * 24 696 317 S 433.milc 24 815 270 S 24 695 317 S 434.zeusmp 24 609 359 * 24 592 369 S 434.zeusmp 24 608 359 S 24 600 364 S 434.zeusmp 24 614 356 S 24 596 366 * 435.gromacs 24 522 328 * 24 421 407 * 435.gromacs 24 521 329 S 24 420 408 S 435.gromacs 24 522 328 S 24 421 407 S 436.cactusADM 24 656 437 S 24 580 495 * 436.cactusADM 24 652 440 S 24 582 493 S 436.cactusADM 24 655 438 * 24 578 496 S 437.leslie3d 24 1015 222 * 24 834 271 S 437.leslie3d 24 1015 222 S 24 836 270 S 437.leslie3d 24 1016 222 S 24 835 270 * 444.namd 24 709 271 S 24 609 316 S 444.namd 24 709 272 * 24 609 316 S 444.namd 24 707 272 S 24 609 316 * 447.dealII 24 448 613 S 24 424 648 * 447.dealII 24 445 617 * 24 424 648 S 447.dealII 24 442 622 S 24 428 641 S 450.soplex 24 759 264 * 24 701 286 * 450.soplex 24 758 264 S 24 701 286 S 450.soplex 24 760 263 S 24 701 286 S 453.povray 24 352 363 S 24 310 412 S 453.povray 24 352 363 * 24 307 415 S 453.povray 24 352 363 S 24 308 414 * 454.calculix 24 377 525 * 24 362 548 S 454.calculix 24 375 528 S 24 361 548 * 454.calculix 24 379 523 S 24 361 548 S 459.GemsFDTD 24 1222 208 S 24 1095 232 S 459.GemsFDTD 24 1228 207 S 24 1094 233 S 459.GemsFDTD 24 1223 208 * 24 1095 233 * 465.tonto 24 730 323 * 24 663 356 * 465.tonto 24 734 322 S 24 659 358 S 465.tonto 24 728 325 S 24 701 337 S 470.lbm 24 777 424 S 24 777 424 S 470.lbm 24 781 422 S 24 781 422 S 470.lbm 24 780 423 * 24 780 423 * 481.wrf 24 716 375 * 24 710 378 * 481.wrf 24 734 365 S 24 711 377 S 481.wrf 24 714 375 S 24 709 378 S 482.sphinx3 24 1424 328 S 24 1182 396 * 482.sphinx3 24 1430 327 * 24 1224 382 S 482.sphinx3 24 1430 327 S 24 1179 397 S ============================================================================== 410.bwaves 24 1010 323 * 24 990 329 * 416.gamess 24 1739 270 * 24 1590 296 * 433.milc 24 815 270 * 24 696 317 * 434.zeusmp 24 609 359 * 24 596 366 * 435.gromacs 24 522 328 * 24 421 407 * 436.cactusADM 24 655 438 * 24 580 495 * 437.leslie3d 24 1015 222 * 24 835 270 * 444.namd 24 709 272 * 24 609 316 * 447.dealII 24 445 617 * 24 424 648 * 450.soplex 24 759 264 * 24 701 286 * 453.povray 24 352 363 * 24 308 414 * 454.calculix 24 377 525 * 24 361 548 * 459.GemsFDTD 24 1223 208 * 24 1095 233 * 465.tonto 24 730 323 * 24 663 356 * 470.lbm 24 780 423 * 24 780 423 * 481.wrf 24 716 375 * 24 710 378 * 482.sphinx3 24 1430 327 * 24 1182 396 * SPECfp(R)_rate_base2006 334 SPECfp_rate2006 369 HARDWARE -------- CPU Name: AMD Opteron 6344 CPU Characteristics: AMD Turbo CORE technology up to 3.20 GHz CPU MHz: 2600 FPU: Integrated CPU(s) enabled: 24 cores, 2 chips, 12 cores/chip CPU(s) orderable: 1,2 chips Primary Cache: 384 KB I on chip per chip, 64 KB I shared / 2 cores; 16 KB D on chip per core Secondary Cache: 12 MB I+D on chip per chip, 2 MB shared / 2 cores L3 Cache: 16 MB I+D on chip per chip, 8 MB shared / 6 cores Other Cache: None Memory: 128 GB (16 x 8 GB 2Rx4 PC3-12800R-11, ECC) Disk Subsystem: 1 x 250 GB SATA, 7200 RPM Other Hardware: None SOFTWARE -------- Operating System: Red Hat Enterprise Linux Server release 6.3, Kernel 2.6.32-279.el6.x86_64 Compiler: C/C++/Fortran: Version 4.5.2 of x86 Open64 Compiler Suite (from AMD) Auto Parallel: No File System: ext3 System State: Run level 3 (Full multiuser with network) Base Pointers: 64-bit Peak Pointers: 32/64-bit Other Software: None Submit Notes ------------ The config file option 'submit' was used. 'numactl' was used to bind copies to the cores. See the configuration file for details. Operating System Notes ---------------------- 'ulimit -s unlimited' was used to set environment stack size 'ulimit -l 2097152' was used to set environment locked pages in memory limit Set transparent_hugepage=never as a boot parameter in /boot/grub/menu.lst Set vm/nr_hugepages=21504 in /etc/sysctl.conf mount -t hugetlbfs nodev /mnt/hugepages General Notes ------------- Environment variables set by runspec before the start of the run: HUGETLB_LIMIT = "896" LD_LIBRARY_PATH = "/root/work/cpu2006v1.2/amd1206-rate-libs-revA/32:/root/work/cpu2006v1.2/amd1206-rate-libs-revA/64" The x86 Open64 Compiler Suite is only available from (and supported by) AMD at http://developer.amd.com/cpu/open64 Binaries were compiled on a system with 2x AMD Opteron 6386SE chips + 128GB Memory using RHEL 6.3 Base Compiler Invocation ------------------------ C benchmarks: opencc C++ benchmarks: openCC Fortran benchmarks: openf95 Benchmarks using both Fortran and C: opencc openf95 Base Portability Flags ---------------------- 410.bwaves: -DSPEC_CPU_LP64 416.gamess: -DSPEC_CPU_LP64 433.milc: -DSPEC_CPU_LP64 434.zeusmp: -DSPEC_CPU_LP64 435.gromacs: -DSPEC_CPU_LP64 436.cactusADM: -DSPEC_CPU_LP64 -fno-second-underscore 437.leslie3d: -DSPEC_CPU_LP64 444.namd: -DSPEC_CPU_LP64 447.dealII: -DSPEC_CPU_LP64 450.soplex: -DSPEC_CPU_LP64 453.povray: -DSPEC_CPU_LP64 454.calculix: -DSPEC_CPU_LP64 459.GemsFDTD: -DSPEC_CPU_LP64 465.tonto: -DSPEC_CPU_LP64 470.lbm: -DSPEC_CPU_LP64 481.wrf: -DSPEC_CPU_LINUX -DSPEC_CPU_CASE_FLAG -DSPEC_CPU_LP64 -fno-second-underscore 482.sphinx3: -DSPEC_CPU_LP64 Base Optimization Flags ----------------------- C benchmarks: -Ofast -OPT:malloc_alg=1 -HP:bd=2m:heap=2m -IPA:plimit=8000 -IPA:small_pu=100 -mso -march=bdver1 C++ benchmarks: -Ofast -static -CG:load_exe=0 -OPT:malloc_alg=1 -INLINE:aggressive=on -HP:bd=2m:heap=2m -D__OPEN64_FAST_SET -march=bdver1 Fortran benchmarks: -Ofast -LNO:blocking=off -LNO:simd_peel_align=on -OPT:rsqrt=2 -OPT:unroll_size=256 -HP:bd=2m:heap=2m -mso -march=bdver1 Benchmarks using both Fortran and C: -Ofast -OPT:malloc_alg=1 -HP:bd=2m:heap=2m -IPA:plimit=8000 -IPA:small_pu=100 -mso -march=bdver1 -LNO:blocking=off -LNO:simd_peel_align=on -OPT:rsqrt=2 -OPT:unroll_size=256 Peak Compiler Invocation ------------------------ C benchmarks: opencc C++ benchmarks: openCC Fortran benchmarks: openf95 Benchmarks using both Fortran and C: opencc openf95 Peak Portability Flags ---------------------- 410.bwaves: -DSPEC_CPU_LP64 416.gamess: -DSPEC_CPU_LP64 433.milc: -DSPEC_CPU_LP64 434.zeusmp: -DSPEC_CPU_LP64 435.gromacs: -DSPEC_CPU_LP64 436.cactusADM: -DSPEC_CPU_LP64 -fno-second-underscore 437.leslie3d: -DSPEC_CPU_LP64 444.namd: -DSPEC_CPU_LP64 453.povray: -DSPEC_CPU_LP64 454.calculix: -DSPEC_CPU_LP64 459.GemsFDTD: -DSPEC_CPU_LP64 465.tonto: -DSPEC_CPU_LP64 470.lbm: -DSPEC_CPU_LP64 481.wrf: -DSPEC_CPU_LINUX -DSPEC_CPU_CASE_FLAG -DSPEC_CPU_LP64 -fno-second-underscore Peak Optimization Flags ----------------------- C benchmarks: 433.milc: -Ofast -CG:movnti=1 -CG:locs_best=on -HP:bdt=2m:heap=2m -IPA:plimit=7000 -IPA:callee_limit=1200 -OPT:struct_array_copy=2 -OPT:alias=field_sensitive -mso -march=bdver1 470.lbm: basepeak = yes 482.sphinx3: -fb_create fbdata(pass 1) -fb_opt fbdata(pass 2) -Ofast -m32 -IPA:plimit=1000 -OPT:malloc_alg=2 -CG:cmp_peep=on -CG:p2align=0 -CG:load_exe=1 -CG:dsched=on -INLINE:aggressive=on -LNO:prefetch=2 -LNO:prefetch_ahead=4 -mso -march=bdver2 C++ benchmarks: 444.namd: -Ofast -IPA:plimit=3000 -LNO:ignore_feedback=off -CG:local_sched_alg=0 -CG:load_exe=0 -OPT:unroll_size=256 -fno-exceptions -HP:bdt=2m:heap=2m -LNO:if_select_conv=1 -OPT:alias=disjoint -LNO:psimd_iso_unroll=ON -march=bdver1 447.dealII: -Ofast -D__OPEN64_FAST_SET -static -INLINE:aggressive=on -LNO:opt=1 -LNO:simd=2 -fno-emit-exceptions -m32 -OPT:unroll_times_max=8 -OPT:unroll_size=256 -OPT:unroll_level=2 -HP:bdt=2m:heap=2m -GRA:unspill=on -CG:cmp_peep=on -CG:movext_icmp=off -TENV:frame_pointer=off -march=bdver1 450.soplex: -fb_create fbdata(pass 1) -fb_opt fbdata(pass 2) -O3 -LNO:ignore_feedback=off -INLINE:aggressive=on -OPT:RO=1 -OPT:IEEE_arith=3 -OPT:IEEE_NaN_Inf=off -OPT:fold_unsigned_relops=on -fno-exceptions -CG:p2align=0 -m32 -mno-fma4 -HP:bdt=2m:heap=2m -WOPT:sib=on -march=bdver1 453.povray: -fb_create fbdata(pass 1) -fb_opt fbdata(pass 2) -Ofast -CG:pre_local_sched=off -CG:p2align=0 -CG:p2align_split=on -CG:dsched=on -INLINE:aggressive=on -HP:bd=2m:heap=2m -OPT:transform=2 -OPT:alias=disjoint -WOPT:aggcm=0 -march=bdver2 Fortran benchmarks: 410.bwaves: -fb_create fbdata(pass 1) -fb_opt fbdata(pass 2) -Ofast -OPT:Ofast -OPT:treeheight=on -LNO:blocking=off -LNO:ignore_feedback=off -LNO:fu=4 -LNO:loop_model_simd=on -LNO:simd_rm_unity_remainder=on -WOPT:aggstr=0 -HP:bdt=2m:heap=2m -CG:cmp_peep=on -march=bdver1 416.gamess: -fb_create fbdata(pass 1) -fb_opt fbdata(pass 2) -Ofast -LNO:fu=6 -LNO:blocking=0 -LNO:simd=2 -OPT:ro=3 -OPT:recip=on -CG:local_sched_alg=1 -HP:bdt=2m:heap=2m -WOPT:sib=on -march=bdver1 434.zeusmp: -fb_create fbdata(pass 1) -fb_opt fbdata(pass 2) -Ofast -LNO:blocking=off -LNO:interchange=off -IPA:plimit=1500 -HP:bdt=2m:heap=2m -march=bdver1 437.leslie3d: -Ofast -CG:pre_minreg_level=2 -LNO:simd=0 -LNO:fusion=2 -HP:bdt=2m:heap=2m -mso -march=bdver1 459.GemsFDTD: -Ofast -IPA:plimit=1500 -OPT:unroll_size=1024 -OPT:unroll_times_max=16 -LNO:fission=2 -CG:local_sched_alg=2 -HP -march=bdver1 465.tonto: -Ofast -OPT:alias=no_f90_pointer_alias -LNO:blocking=off -CG:load_exe=1 -CG:local_sched_alg=3 -IPA:plimit=525 -HP:bdt=2m:heap=2m -march=bdver1 Benchmarks using both Fortran and C: 435.gromacs: -Ofast -OPT:rsqrt=2 -HP:bdt=2m:heap=2m -CG:local_sched_alg=2 -CG:load_exe=3 -GRA:unspill=on -march=bdver1 -LNO:simd=3 436.cactusADM: -fb_create fbdata(pass 1) -fb_opt fbdata(pass 2) -Ofast -LNO:blocking=off -LNO:prefetch=2 -LNO:pf2=0 -LNO:prefetch_ahead=4 -HP -CG:locs_shallow_depth=1 -CG:load_exe=0 -CG:dsched=on -WOPT:sib=on -march=bdver1 454.calculix: -Ofast -OPT:unroll_size=256 -OPT:alias=disjoint -GRA:optimize_boundary=on -CG:dsched=on -HP:bdt=2m:heap=2m -march=bdver1 481.wrf: -Ofast -LNO:blocking=off -LANG:copyinout=off -IPA:callee_limit=5000 -GRA:prioritize_by_density=on -HP -WOPT:sib=on -march=bdver1 The flags file that was used to format this result can be browsed at http://www.spec.org/cpu2006/flags/x86-open64-452-flags-rate-revA-II.html You can also download the XML flags source by saving the following link: http://www.spec.org/cpu2006/flags/x86-open64-452-flags-rate-revA-II.xml SPEC and SPECfp are registered trademarks of the Standard Performance Evaluation Corporation. All other brand and product names appearing in this result are trademarks or registered trademarks of their respective holders. ----------------------------------------------------------------------------- For questions about this result, please contact the tester. For other inquiries, please contact webmaster@spec.org. Copyright 2006-2014 Standard Performance Evaluation Corporation Tested with SPEC CPU2006 v1.2. Report generated on Thu Jul 24 13:01:35 2014 by CPU2006 ASCII formatter v6932. Originally published on 5 November 2012.