SPEC(R) CFP2006 Summary Sugon Sugon A320-G30 (AMD EPYC 7351P) Wed Dec 6 20:11:57 2017 CPU2006 License: 9046 Test date: Dec-2017 Test sponsor: Sugon Hardware availability: Dec-2017 Tested by: Sugon Software availability: Oct-2017 Base Base Base Peak Peak Peak Benchmarks Copies Run Time Rate Copies Run Time Rate -------------- ------ --------- --------- ------ --------- --------- 410.bwaves 32 637 683 S 16 297 731 S 410.bwaves 32 638 682 * 16 298 730 * 410.bwaves 32 638 681 S 16 298 730 S 416.gamess 32 1020 614 S 32 911 688 S 416.gamess 32 1021 614 S 32 909 689 * 416.gamess 32 1020 614 * 32 908 690 S 433.milc 32 556 529 S 16 217 677 * 433.milc 32 557 528 S 16 217 676 S 433.milc 32 556 528 * 16 217 677 S 434.zeusmp 32 313 932 S 32 298 978 * 434.zeusmp 32 312 933 * 32 298 977 S 434.zeusmp 32 312 933 S 32 297 981 S 435.gromacs 32 379 603 S 32 280 815 * 435.gromacs 32 381 600 * 32 281 814 S 435.gromacs 32 383 597 S 32 280 815 S 436.cactusADM 32 410 933 * 16 199 962 S 436.cactusADM 32 414 924 S 16 189 1010 * 436.cactusADM 32 410 933 S 16 187 1020 S 437.leslie3d 32 606 496 S 16 263 573 S 437.leslie3d 32 608 495 S 16 261 577 * 437.leslie3d 32 608 495 * 16 261 577 S 444.namd 32 479 535 * 32 396 648 S 444.namd 32 479 535 S 32 392 654 S 444.namd 32 480 535 S 32 395 650 * 447.dealII 32 313 1170 S 32 296 1240 * 447.dealII 32 306 1200 * 32 298 1230 S 447.dealII 32 301 1220 S 32 291 1260 S 450.soplex 32 531 503 * 16 274 488 S 450.soplex 32 531 503 S 16 272 491 S 450.soplex 32 530 503 S 16 273 488 * 453.povray 32 224 759 S 32 176 965 S 453.povray 32 222 766 * 32 174 977 S 453.povray 32 222 768 S 32 175 972 * 454.calculix 32 307 861 S 32 326 809 * 454.calculix 32 308 857 S 32 326 809 S 454.calculix 32 307 859 * 32 328 805 S 459.GemsFDTD 32 761 446 * 16 375 453 * 459.GemsFDTD 32 760 447 S 16 374 454 S 459.GemsFDTD 32 762 446 S 16 375 453 S 465.tonto 32 440 716 S 16 242 651 * 465.tonto 32 441 714 S 16 244 646 S 465.tonto 32 441 714 * 16 240 656 S 470.lbm 32 564 779 S 16 261 841 * 470.lbm 32 565 779 * 16 261 843 S 470.lbm 32 565 778 S 16 261 841 S 481.wrf 32 478 748 S 16 250 715 S 481.wrf 32 481 742 * 16 251 712 S 481.wrf 32 482 742 S 16 250 714 * 482.sphinx3 32 985 633 * 16 433 720 * 482.sphinx3 32 987 632 S 16 433 720 S 482.sphinx3 32 985 633 S 16 434 718 S ============================================================================== 410.bwaves 32 638 682 * 16 298 730 * 416.gamess 32 1020 614 * 32 909 689 * 433.milc 32 556 528 * 16 217 677 * 434.zeusmp 32 312 933 * 32 298 978 * 435.gromacs 32 381 600 * 32 280 815 * 436.cactusADM 32 410 933 * 16 189 1010 * 437.leslie3d 32 608 495 * 16 261 577 * 444.namd 32 479 535 * 32 395 650 * 447.dealII 32 306 1200 * 32 296 1240 * 450.soplex 32 531 503 * 16 273 488 * 453.povray 32 222 766 * 32 175 972 * 454.calculix 32 307 859 * 32 326 809 * 459.GemsFDTD 32 761 446 * 16 375 453 * 465.tonto 32 441 714 * 16 242 651 * 470.lbm 32 565 779 * 16 261 841 * 481.wrf 32 481 742 * 16 250 714 * 482.sphinx3 32 985 633 * 16 433 720 * SPECfp(R)_rate_base2006 680 SPECfp_rate2006 742 HARDWARE -------- CPU Name: AMD EPYC 7351P CPU Characteristics: AMD Turbo CORE technology up to 2.90 GHz CPU MHz: 2400 FPU: Integrated CPU(s) enabled: 16 cores, 1 chip, 16 cores/chip, 2 threads/core CPU(s) orderable: 1 chip Primary Cache: 64 KB I + 32 KB D on chip per core Secondary Cache: 512 KB I+D on chip per core L3 Cache: 64 MB I+D on chip per chip, 8 MB shared / 2 cores Other Cache: None Memory: 512 GB (8 x 64 GB 4Rx4 PC4-2667V-L) Disk Subsystem: 1 x 2000 GB SATA, 7200 RPM Other Hardware: None SOFTWARE -------- Operating System: Red Hat Enterprise Linux Server 7.4 Kernel 3.10.0-693.2.2 Compiler: C/C++/Fortran: Version 4.5.2.1 of x86 Open64 Compiler Suite (from AMD) Auto Parallel: No File System: xfs System State: Run level 3 (Multi User) Base Pointers: 64-bit Peak Pointers: 32/64-bit Other Software: None Submit Notes ------------ The config file option 'submit' was used. 'numactl' was used to bind copies to the cores. See the configuration file for details. Operating System Notes ---------------------- 'ulimit -s unlimited' was used to set environment stack size 'ulimit -l 2097152' was used to set environment locked pages in memory limit runspec command invoked through numactl i.e.: numactl --interleave=all runspec Set dirty_ratio=8 to limit dirty cache to 8% of memory Set swappiness=1 to swap only if necessary Set zone_reclaim_mode=1 to free local node memory and avoid remote memory sync then drop_caches=3 to reset caches before invoking runcpu Transparent huge pages were enabled for this run (OS default) Set vm/nr_hugepages=14336 in /etc/sysctl.conf mount -t hugetlbfs nodev /mnt/hugepages Platform Notes -------------- BIOS settings: Determinism Slider = Power cTDP Control = Manual cTDP = 200 General Notes ------------- Environment variables set by runspec before the start of the run: HUGETLB_LIMIT = "896" LD_LIBRARY_PATH = "/home/cpu2006/amd1603-rate-libs-revB/32:/home/cpu2006/amd1603-rate-libs-revB/64" The binaries were built with the AMD supported x86 Open64 Compiler Suite, which is only available from AMD at http://developer.amd.com/tools-and-sdks/cpu-development/x86-open64-compiler-suite/ Binaries were compiled on a system with 2 x AMD Opteron 6378 chips + 128 GB Memory using RHEL 6.3 Base Compiler Invocation ------------------------ C benchmarks: opencc C++ benchmarks: openCC Fortran benchmarks: openf95 Benchmarks using both Fortran and C: opencc openf95 Base Portability Flags ---------------------- 410.bwaves: -DSPEC_CPU_LP64 416.gamess: -DSPEC_CPU_LP64 433.milc: -DSPEC_CPU_LP64 434.zeusmp: -DSPEC_CPU_LP64 435.gromacs: -DSPEC_CPU_LP64 436.cactusADM: -DSPEC_CPU_LP64 -fno-second-underscore 437.leslie3d: -DSPEC_CPU_LP64 444.namd: -DSPEC_CPU_LP64 447.dealII: -DSPEC_CPU_LP64 450.soplex: -DSPEC_CPU_LP64 453.povray: -DSPEC_CPU_LP64 454.calculix: -DSPEC_CPU_LP64 459.GemsFDTD: -DSPEC_CPU_LP64 465.tonto: -DSPEC_CPU_LP64 470.lbm: -DSPEC_CPU_LP64 481.wrf: -DSPEC_CPU_LINUX -DSPEC_CPU_CASE_FLAG -DSPEC_CPU_LP64 -fno-second-underscore 482.sphinx3: -DSPEC_CPU_LP64 Base Optimization Flags ----------------------- C benchmarks: -Ofast -OPT:malloc_alg=1 -HP:bd=2m:heap=2m -IPA:plimit=8000 -IPA:small_pu=100 -mso -march=bdver1 -mno-fma4 -mno-xop -mno-tbm -WB, -Wl, -z,muldefs C++ benchmarks: -Ofast -static -CG:load_exe=0 -OPT:malloc_alg=1 -INLINE:aggressive=on -HP:bd=2m:heap=2m -D__OPEN64_FAST_SET -march=bdver2 -mno-fma4 -mno-xop -mno-tbm -WB, -Wl, -z,muldefs Fortran benchmarks: -Ofast -LNO:blocking=off -LNO:simd_peel_align=on -OPT:rsqrt=2 -OPT:unroll_size=256 -HP:bd=2m:heap=2m -mso -march=bdver1 -mno-fma4 -mno-xop -mno-tbm -WB, -Wl, -z,muldefs Benchmarks using both Fortran and C: -Ofast -OPT:malloc_alg=1 -HP:bd=2m:heap=2m -IPA:plimit=8000 -IPA:small_pu=100 -mso -march=bdver1 -mno-fma4 -mno-xop -mno-tbm -WB, -Wl, -z,muldefs -LNO:blocking=off -LNO:simd_peel_align=on -OPT:rsqrt=2 -OPT:unroll_size=256 Peak Compiler Invocation ------------------------ C benchmarks: opencc C++ benchmarks: openCC Fortran benchmarks: openf95 Benchmarks using both Fortran and C: opencc openf95 Peak Portability Flags ---------------------- 410.bwaves: -DSPEC_CPU_LP64 416.gamess: -DSPEC_CPU_LP64 433.milc: -DSPEC_CPU_LP64 434.zeusmp: -DSPEC_CPU_LP64 435.gromacs: -DSPEC_CPU_LP64 436.cactusADM: -DSPEC_CPU_LP64 -fno-second-underscore 437.leslie3d: -DSPEC_CPU_LP64 444.namd: -DSPEC_CPU_LP64 453.povray: -DSPEC_CPU_LP64 454.calculix: -DSPEC_CPU_LP64 459.GemsFDTD: -DSPEC_CPU_LP64 465.tonto: -DSPEC_CPU_LP64 470.lbm: -DSPEC_CPU_LP64 481.wrf: -DSPEC_CPU_LINUX -DSPEC_CPU_CASE_FLAG -DSPEC_CPU_LP64 -fno-second-underscore Peak Optimization Flags ----------------------- C benchmarks: 433.milc: -Ofast -CG:movnti=1 -CG:locs_best=on -HP:bdt=2m:heap=2m -IPA:plimit=7000 -IPA:callee_limit=1200 -OPT:struct_array_copy=2 -OPT:alias=field_sensitive -mso -march=bdver1 -mno-fma4 470.lbm: -Ofast -CG:cmp_peep=on -OPT:keep_ext=on -HP:bdt=2m:heap=2m -IPA:plimit=8000 -IPA:small_pu=100 -march=bdver1 -mno-fma4 -mso 482.sphinx3: -Ofast -m32 -IPA:plimit=1000 -OPT:malloc_alg=2 -CG:cmp_peep=on -CG:p2align=0 -CG:load_exe=1 -CG:dsched=on -INLINE:aggressive=on -LNO:prefetch=2 -LNO:prefetch_ahead=4 -mso -march=bdver2 -WB, -mno-fma4 -mno-tbm -mno-xop C++ benchmarks: 444.namd: -Ofast -IPA:plimit=3000 -LNO:ignore_feedback=off -CG:local_sched_alg=0 -CG:load_exe=0 -OPT:unroll_size=256 -fno-exceptions -HP:bdt=2m:heap=2m -LNO:if_select_conv=1 -OPT:alias=disjoint -LNO:psimd_iso_unroll=ON -march=bdver2 -mno-fma4 -WB, -mno-xop -mno-tbm 447.dealII: -Ofast -D__OPEN64_FAST_SET -static -INLINE:aggressive=on -LNO:opt=1 -LNO:simd=2 -fno-emit-exceptions -m32 -OPT:unroll_times_max=8 -OPT:unroll_size=256 -OPT:unroll_level=2 -HP:bdt=2m:heap=2m -GRA:unspill=on -CG:cmp_peep=on -CG:movext_icmp=off -TENV:frame_pointer=off -march=bdver1 -mno-fma4 450.soplex: -fb_create fbdata(pass 1) -fb_opt fbdata(pass 2) -O3 -LNO:ignore_feedback=off -INLINE:aggressive=on -OPT:RO=1 -OPT:IEEE_arith=3 -OPT:IEEE_NaN_Inf=off -OPT:fold_unsigned_relops=on -fno-exceptions -CG:p2align=0 -m32 -mno-fma4 -HP:bdt=2m:heap=2m -WOPT:sib=on -march=bdver1 453.povray: -fb_create fbdata(pass 1) -fb_opt fbdata(pass 2) -Ofast -CG:pre_local_sched=off -CG:p2align=0 -CG:p2align_split=on -CG:dsched=on -INLINE:aggressive=on -HP:bd=2m:heap=2m -OPT:transform=2 -OPT:alias=disjoint -WOPT:aggcm=0 -march=bdver2 -mno-fma4 -WB, -mno-xop -mno-tbm -Wl, -z,muldefs Fortran benchmarks: 410.bwaves: -fb_create fbdata(pass 1) -fb_opt fbdata(pass 2) -Ofast -OPT:Ofast -OPT:treeheight=on -LNO:blocking=off -LNO:ignore_feedback=off -LNO:fu=4 -LNO:loop_model_simd=on -LNO:simd_rm_unity_remainder=on -WOPT:aggstr=0 -HP:bdt=2m:heap=2m -CG:cmp_peep=on -march=bdver2 -mno-fma4 416.gamess: -fb_create fbdata(pass 1) -fb_opt fbdata(pass 2) -Ofast -LNO:fu=6 -LNO:blocking=0 -LNO:simd=2 -OPT:ro=3 -OPT:recip=on -CG:local_sched_alg=1 -HP:bdt=2m:heap=2m -WOPT:sib=on -march=bdver1 -mno-fma4 434.zeusmp: -fb_create fbdata(pass 1) -fb_opt fbdata(pass 2) -Ofast -LNO:blocking=off -LNO:interchange=off -IPA:plimit=1500 -HP:bdt=2m:heap=2m -march=bdver2 -mno-fma4 437.leslie3d: -Ofast -CG:pre_minreg_level=2 -LNO:simd=0 -LNO:fusion=2 -HP:bdt=2m:heap=2m -mso -march=bdver1 -mno-fma4 459.GemsFDTD: -Ofast -IPA:plimit=1500 -OPT:unroll_size=1024 -OPT:unroll_times_max=16 -LNO:fission=2 -CG:local_sched_alg=2 -HP -march=bdver1 -mno-fma4 465.tonto: -Ofast -OPT:alias=no_f90_pointer_alias -LNO:blocking=off -CG:load_exe=1 -CG:local_sched_alg=3 -IPA:plimit=525 -HP:bdt=2m:heap=2m -march=bdver2 -WB, -mno-fma4 -mno-tbm -mno-xop Benchmarks using both Fortran and C: 435.gromacs: -Ofast -OPT:rsqrt=2 -HP:bdt=2m:heap=2m -CG:local_sched_alg=2 -CG:load_exe=3 -GRA:unspill=on -march=bdver2 -mno-fma4 -LNO:simd=3 436.cactusADM: -fb_create fbdata(pass 1) -fb_opt fbdata(pass 2) -Ofast -LNO:blocking=off -LNO:prefetch=2 -LNO:pf2=0 -LNO:prefetch_ahead=4 -HP -CG:locs_shallow_depth=1 -CG:load_exe=0 -CG:dsched=on -WOPT:sib=on -march=bdver2 -mno-fma4 454.calculix: -Ofast -OPT:unroll_size=256 -OPT:alias=disjoint -GRA:optimize_boundary=on -CG:dsched=on -HP:bdt=2m:heap=2m -march=bdver1 -mno-fma4 481.wrf: -Ofast -LNO:blocking=off -LANG:copyinout=off -IPA:callee_limit=5000 -GRA:prioritize_by_density=on -HP -WOPT:sib=on -march=bdver1 -mno-fma4 The flags files that were used to format this result can be browsed at http://www.spec.org/cpu2006/flags/x86-openflags-rate-revA-I.html http://www.spec.org/cpu2006/flags/Sugon-Naples-Platform-Settings-revC-I.html You can also download the XML flags sources by saving the following links: http://www.spec.org/cpu2006/flags/x86-openflags-rate-revA-I.xml http://www.spec.org/cpu2006/flags/Sugon-Naples-Platform-Settings-revC-I.xml SPEC and SPECfp are registered trademarks of the Standard Performance Evaluation Corporation. All other brand and product names appearing in this result are trademarks or registered trademarks of their respective holders. ----------------------------------------------------------------------------- For questions about this result, please contact the tester. For other inquiries, please contact webmaster@spec.org. Copyright 2006-2017 Standard Performance Evaluation Corporation Tested with SPEC CPU2006 v1.2. Report generated on Wed Dec 27 12:04:48 2017 by CPU2006 ASCII formatter v6932. Originally published on 26 December 2017.