SPEC(R) CFP2006 Summary Supermicro Supermicro A+ Server 1022G-NTF, AMD Opteron 6378 Test Sponsor: Advanced Micro Devices Tue Sep 18 10:22:38 2012 CPU2006 License: 49 Test date: Sep-2012 Test sponsor: Advanced Micro Devices Hardware availability: Nov-2012 Tested by: Advanced Micro Devices Software availability: Aug-2012 Base Base Base Peak Peak Peak Benchmarks Copies Run Time Rate Copies Run Time Rate -------------- ------ --------- --------- ------ --------- --------- 410.bwaves 32 1363 319 S 32 1341 324 S 410.bwaves 32 1365 319 S 32 1340 324 S 410.bwaves 32 1364 319 * 32 1341 324 * 416.gamess 32 1883 333 S 32 1738 360 S 416.gamess 32 1900 330 S 32 1734 361 S 416.gamess 32 1900 330 * 32 1734 361 * 433.milc 32 1097 268 S 32 939 313 * 433.milc 32 1096 268 * 32 940 313 S 433.milc 32 1096 268 S 32 939 313 S 434.zeusmp 32 679 429 S 32 659 442 S 434.zeusmp 32 679 429 * 32 664 439 S 434.zeusmp 32 684 426 S 32 662 440 * 435.gromacs 32 568 402 * 32 456 501 S 435.gromacs 32 568 402 S 32 455 502 S 435.gromacs 32 568 402 S 32 455 502 * 436.cactusADM 32 765 500 S 32 681 561 S 436.cactusADM 32 748 511 * 32 678 564 S 436.cactusADM 32 747 512 S 32 679 563 * 437.leslie3d 32 1364 221 * 32 1055 285 S 437.leslie3d 32 1364 220 S 32 1055 285 * 437.leslie3d 32 1363 221 S 32 1056 285 S 444.namd 32 778 330 S 32 669 384 S 444.namd 32 787 326 S 32 665 386 S 444.namd 32 784 327 * 32 666 385 * 447.dealII 32 496 739 S 32 454 806 S 447.dealII 32 504 726 S 32 459 798 S 447.dealII 32 503 727 * 32 457 801 * 450.soplex 32 1012 264 S 32 920 290 * 450.soplex 32 1004 266 * 32 925 289 S 450.soplex 32 1003 266 S 32 920 290 S 453.povray 32 383 445 * 32 338 503 S 453.povray 32 382 445 S 32 338 504 * 453.povray 32 383 445 S 32 337 505 S 454.calculix 32 409 646 * 32 396 667 S 454.calculix 32 410 644 S 32 393 671 * 454.calculix 32 408 647 S 32 392 673 S 459.GemsFDTD 32 1663 204 S 32 1459 233 S 459.GemsFDTD 32 1666 204 * 32 1458 233 S 459.GemsFDTD 32 1667 204 S 32 1458 233 * 465.tonto 32 803 392 S 32 734 429 S 465.tonto 32 837 376 S 32 740 425 * 465.tonto 32 818 385 * 32 765 411 S 470.lbm 32 1013 434 S 32 1013 434 S 470.lbm 32 1010 435 S 32 1010 435 S 470.lbm 32 1011 435 * 32 1011 435 * 481.wrf 32 913 391 * 32 913 392 * 481.wrf 32 910 393 S 32 911 392 S 481.wrf 32 918 390 S 32 913 391 S 482.sphinx3 32 1831 341 S 32 1406 444 S 482.sphinx3 32 1829 341 * 32 1444 432 S 482.sphinx3 32 1827 341 S 32 1407 443 * ============================================================================== 410.bwaves 32 1364 319 * 32 1341 324 * 416.gamess 32 1900 330 * 32 1734 361 * 433.milc 32 1096 268 * 32 939 313 * 434.zeusmp 32 679 429 * 32 662 440 * 435.gromacs 32 568 402 * 32 455 502 * 436.cactusADM 32 748 511 * 32 679 563 * 437.leslie3d 32 1364 221 * 32 1055 285 * 444.namd 32 784 327 * 32 666 385 * 447.dealII 32 503 727 * 32 457 801 * 450.soplex 32 1004 266 * 32 920 290 * 453.povray 32 383 445 * 32 338 504 * 454.calculix 32 409 646 * 32 393 671 * 459.GemsFDTD 32 1666 204 * 32 1458 233 * 465.tonto 32 818 385 * 32 740 425 * 470.lbm 32 1011 435 * 32 1011 435 * 481.wrf 32 913 391 * 32 913 392 * 482.sphinx3 32 1829 341 * 32 1407 443 * SPECfp(R)_rate_base2006 370 SPECfp_rate2006 413 HARDWARE -------- CPU Name: AMD Opteron 6378 CPU Characteristics: AMD Turbo CORE technology up to 3.30 GHz CPU MHz: 2400 FPU: Integrated CPU(s) enabled: 32 cores, 2 chips, 16 cores/chip CPU(s) orderable: 1,2 chips Primary Cache: 512 KB I on chip per chip, 64 KB I shared / 2 cores; 16 KB D on chip per core Secondary Cache: 16 MB I+D on chip per chip, 2 MB shared / 2 cores L3 Cache: 16 MB I+D on chip per chip, 8 MB shared / 8 cores Other Cache: None Memory: 128 GB (16 x 8 GB 2Rx4 PC3-12800R-11, ECC) Disk Subsystem: 1 x 250 GB SATA, 7200 RPM Other Hardware: None SOFTWARE -------- Operating System: Red Hat Enterprise Linux Server release 6.3, Kernel 2.6.32-279.el6.x86_64 Compiler: C/C++/Fortran: Version 4.5.2 of x86 Open64 Compiler Suite (from AMD) Auto Parallel: No File System: ext3 System State: Run level 3 (Full multiuser with network) Base Pointers: 64-bit Peak Pointers: 32/64-bit Other Software: None Submit Notes ------------ The config file option 'submit' was used. 'numactl' was used to bind copies to the cores. See the configuration file for details. Operating System Notes ---------------------- 'ulimit -s unlimited' was used to set environment stack size 'ulimit -l 2097152' was used to set environment locked pages in memory limit Set transparent_hugepage=never as a boot parameter in /boot/grub/menu.lst Set vm/nr_hugepages=28672 in /etc/sysctl.conf mount -t hugetlbfs nodev /mnt/hugepages General Notes ------------- Environment variables set by runspec before the start of the run: HUGETLB_LIMIT = "896" LD_LIBRARY_PATH = "/root/work/cpu2006v1.2/amd1206-rate-libs-revA/32:/root/work/cpu2006v1.2/amd1206-rate-libs-revA/64" The x86 Open64 Compiler Suite is only available from (and supported by) AMD at http://developer.amd.com/cpu/open64 Binaries were compiled on a system with 2x AMD Opteron 6386SE chips + 128GB Memory using RHEL 6.3 Base Compiler Invocation ------------------------ C benchmarks: opencc C++ benchmarks: openCC Fortran benchmarks: openf95 Benchmarks using both Fortran and C: opencc openf95 Base Portability Flags ---------------------- 410.bwaves: -DSPEC_CPU_LP64 416.gamess: -DSPEC_CPU_LP64 433.milc: -DSPEC_CPU_LP64 434.zeusmp: -DSPEC_CPU_LP64 435.gromacs: -DSPEC_CPU_LP64 436.cactusADM: -DSPEC_CPU_LP64 -fno-second-underscore 437.leslie3d: -DSPEC_CPU_LP64 444.namd: -DSPEC_CPU_LP64 447.dealII: -DSPEC_CPU_LP64 450.soplex: -DSPEC_CPU_LP64 453.povray: -DSPEC_CPU_LP64 454.calculix: -DSPEC_CPU_LP64 459.GemsFDTD: -DSPEC_CPU_LP64 465.tonto: -DSPEC_CPU_LP64 470.lbm: -DSPEC_CPU_LP64 481.wrf: -DSPEC_CPU_LINUX -DSPEC_CPU_CASE_FLAG -DSPEC_CPU_LP64 -fno-second-underscore 482.sphinx3: -DSPEC_CPU_LP64 Base Optimization Flags ----------------------- C benchmarks: -Ofast -OPT:malloc_alg=1 -HP:bd=2m:heap=2m -IPA:plimit=8000 -IPA:small_pu=100 -mso -march=bdver1 C++ benchmarks: -Ofast -static -CG:load_exe=0 -OPT:malloc_alg=1 -INLINE:aggressive=on -HP:bd=2m:heap=2m -D__OPEN64_FAST_SET -march=bdver1 Fortran benchmarks: -Ofast -LNO:blocking=off -LNO:simd_peel_align=on -OPT:rsqrt=2 -OPT:unroll_size=256 -HP:bd=2m:heap=2m -mso -march=bdver1 Benchmarks using both Fortran and C: -Ofast -OPT:malloc_alg=1 -HP:bd=2m:heap=2m -IPA:plimit=8000 -IPA:small_pu=100 -mso -march=bdver1 -LNO:blocking=off -LNO:simd_peel_align=on -OPT:rsqrt=2 -OPT:unroll_size=256 Peak Compiler Invocation ------------------------ C benchmarks: opencc C++ benchmarks: openCC Fortran benchmarks: openf95 Benchmarks using both Fortran and C: opencc openf95 Peak Portability Flags ---------------------- 410.bwaves: -DSPEC_CPU_LP64 416.gamess: -DSPEC_CPU_LP64 433.milc: -DSPEC_CPU_LP64 434.zeusmp: -DSPEC_CPU_LP64 435.gromacs: -DSPEC_CPU_LP64 436.cactusADM: -DSPEC_CPU_LP64 -fno-second-underscore 437.leslie3d: -DSPEC_CPU_LP64 444.namd: -DSPEC_CPU_LP64 453.povray: -DSPEC_CPU_LP64 454.calculix: -DSPEC_CPU_LP64 459.GemsFDTD: -DSPEC_CPU_LP64 465.tonto: -DSPEC_CPU_LP64 470.lbm: -DSPEC_CPU_LP64 481.wrf: -DSPEC_CPU_LINUX -DSPEC_CPU_CASE_FLAG -DSPEC_CPU_LP64 -fno-second-underscore Peak Optimization Flags ----------------------- C benchmarks: 433.milc: -Ofast -CG:movnti=1 -CG:locs_best=on -HP:bdt=2m:heap=2m -IPA:plimit=7000 -IPA:callee_limit=1200 -OPT:struct_array_copy=2 -OPT:alias=field_sensitive -mso -march=bdver1 470.lbm: basepeak = yes 482.sphinx3: -fb_create fbdata(pass 1) -fb_opt fbdata(pass 2) -Ofast -m32 -IPA:plimit=1000 -OPT:malloc_alg=2 -CG:cmp_peep=on -CG:p2align=0 -CG:load_exe=1 -CG:dsched=on -INLINE:aggressive=on -LNO:prefetch=2 -LNO:prefetch_ahead=4 -mso -march=bdver2 C++ benchmarks: 444.namd: -Ofast -IPA:plimit=3000 -LNO:ignore_feedback=off -CG:local_sched_alg=0 -CG:load_exe=0 -OPT:unroll_size=256 -fno-exceptions -HP:bdt=2m:heap=2m -LNO:if_select_conv=1 -OPT:alias=disjoint -LNO:psimd_iso_unroll=ON -march=bdver1 447.dealII: -Ofast -D__OPEN64_FAST_SET -static -INLINE:aggressive=on -LNO:opt=1 -LNO:simd=2 -fno-emit-exceptions -m32 -OPT:unroll_times_max=8 -OPT:unroll_size=256 -OPT:unroll_level=2 -HP:bdt=2m:heap=2m -GRA:unspill=on -CG:cmp_peep=on -CG:movext_icmp=off -TENV:frame_pointer=off -march=bdver1 450.soplex: -fb_create fbdata(pass 1) -fb_opt fbdata(pass 2) -O3 -LNO:ignore_feedback=off -INLINE:aggressive=on -OPT:RO=1 -OPT:IEEE_arith=3 -OPT:IEEE_NaN_Inf=off -OPT:fold_unsigned_relops=on -fno-exceptions -CG:p2align=0 -m32 -mno-fma4 -HP:bdt=2m:heap=2m -WOPT:sib=on -march=bdver1 453.povray: -fb_create fbdata(pass 1) -fb_opt fbdata(pass 2) -Ofast -CG:pre_local_sched=off -CG:p2align=0 -CG:p2align_split=on -CG:dsched=on -INLINE:aggressive=on -HP:bd=2m:heap=2m -OPT:transform=2 -OPT:alias=disjoint -WOPT:aggcm=0 -march=bdver2 Fortran benchmarks: 410.bwaves: -fb_create fbdata(pass 1) -fb_opt fbdata(pass 2) -Ofast -OPT:Ofast -OPT:treeheight=on -LNO:blocking=off -LNO:ignore_feedback=off -LNO:fu=4 -LNO:loop_model_simd=on -LNO:simd_rm_unity_remainder=on -WOPT:aggstr=0 -HP:bdt=2m:heap=2m -CG:cmp_peep=on -march=bdver1 416.gamess: -fb_create fbdata(pass 1) -fb_opt fbdata(pass 2) -Ofast -LNO:fu=6 -LNO:blocking=0 -LNO:simd=2 -OPT:ro=3 -OPT:recip=on -CG:local_sched_alg=1 -HP:bdt=2m:heap=2m -WOPT:sib=on -march=bdver1 434.zeusmp: -fb_create fbdata(pass 1) -fb_opt fbdata(pass 2) -Ofast -LNO:blocking=off -LNO:interchange=off -IPA:plimit=1500 -HP:bdt=2m:heap=2m -march=bdver1 437.leslie3d: -Ofast -CG:pre_minreg_level=2 -LNO:simd=0 -LNO:fusion=2 -HP:bdt=2m:heap=2m -mso -march=bdver1 459.GemsFDTD: -Ofast -IPA:plimit=1500 -OPT:unroll_size=1024 -OPT:unroll_times_max=16 -LNO:fission=2 -CG:local_sched_alg=2 -HP -march=bdver1 465.tonto: -Ofast -OPT:alias=no_f90_pointer_alias -LNO:blocking=off -CG:load_exe=1 -CG:local_sched_alg=3 -IPA:plimit=525 -HP:bdt=2m:heap=2m -march=bdver1 Benchmarks using both Fortran and C: 435.gromacs: -Ofast -OPT:rsqrt=2 -HP:bdt=2m:heap=2m -CG:local_sched_alg=2 -CG:load_exe=3 -GRA:unspill=on -march=bdver1 -LNO:simd=3 436.cactusADM: -fb_create fbdata(pass 1) -fb_opt fbdata(pass 2) -Ofast -LNO:blocking=off -LNO:prefetch=2 -LNO:pf2=0 -LNO:prefetch_ahead=4 -HP -CG:locs_shallow_depth=1 -CG:load_exe=0 -CG:dsched=on -WOPT:sib=on -march=bdver1 454.calculix: -Ofast -OPT:unroll_size=256 -OPT:alias=disjoint -GRA:optimize_boundary=on -CG:dsched=on -HP:bdt=2m:heap=2m -march=bdver1 481.wrf: -Ofast -LNO:blocking=off -LANG:copyinout=off -IPA:callee_limit=5000 -GRA:prioritize_by_density=on -HP -WOPT:sib=on -march=bdver1 The flags file that was used to format this result can be browsed at http://www.spec.org/cpu2006/flags/x86-open64-452-flags-rate-revA-II.html You can also download the XML flags source by saving the following link: http://www.spec.org/cpu2006/flags/x86-open64-452-flags-rate-revA-II.xml SPEC and SPECfp are registered trademarks of the Standard Performance Evaluation Corporation. All other brand and product names appearing in this result are trademarks or registered trademarks of their respective holders. ----------------------------------------------------------------------------- For questions about this result, please contact the tester. For other inquiries, please contact webmaster@spec.org. Copyright 2006-2014 Standard Performance Evaluation Corporation Tested with SPEC CPU2006 v1.2. Report generated on Thu Jul 24 12:59:36 2014 by CPU2006 ASCII formatter v6932. Originally published on 5 November 2012.