SPEC(R) ACCEL(TM) OCL Summary Cray NVIDIA Tesla K20 Cray XK7 Test Sponsor: Indiana University Fri Mar 10 15:56:26 2017 ACCEL License: 3440A Test date: Mar-2017 Test sponsor: Indiana University Hardware availability: Apr-2013 Tested by: Indiana University Software availability: Jan-2017 Base Base Base Peak Peak Peak Benchmarks Ref. Run Time Ratio Ref. Run Time Ratio -------------- ------ --------- --------- ------ --------- --------- 101.tpacf 107 77.0 1.39 * 101.tpacf 107 77.1 1.39 S 101.tpacf 107 77.0 1.39 S 103.stencil 125 65.2 1.92 S 103.stencil 125 65.3 1.91 S 103.stencil 125 65.3 1.91 * 104.lbm 112 47.5 2.36 S 104.lbm 112 47.5 2.36 * 104.lbm 112 47.5 2.36 S 110.fft 111 63.2 1.76 * 110.fft 111 63.3 1.75 S 110.fft 111 63.2 1.76 S 112.spmv 147 90.2 1.63 * 112.spmv 147 90.3 1.63 S 112.spmv 147 90.1 1.63 S 114.mriq 109 22.6 4.83 S 114.mriq 109 22.6 4.83 S 114.mriq 109 22.6 4.83 * 116.histo 114 111 1.02 * 116.histo 114 111 1.02 S 116.histo 114 111 1.02 S 117.bfs 117 69.7 1.68 S 117.bfs 117 69.7 1.68 * 117.bfs 117 70.1 1.67 S 118.cutcp 99 43.7 2.27 S 118.cutcp 99 43.7 2.26 * 118.cutcp 99 43.9 2.26 S 120.kmeans 100 94.5 1.06 * 120.kmeans 100 94.1 1.06 S 120.kmeans 100 94.6 1.06 S 121.lavamd 109 21.5 5.07 * 121.lavamd 109 20.9 5.22 S 121.lavamd 109 21.8 4.99 S 122.cfd 126 79.9 1.58 * 122.cfd 126 79.8 1.58 S 122.cfd 126 80.0 1.58 S 123.nw 115 81.8 1.41 S 123.nw 115 81.8 1.41 * 123.nw 115 82.1 1.40 S 124.hotspot 114 47.1 2.42 * 124.hotspot 114 47.0 2.43 S 124.hotspot 114 47.4 2.41 S 125.lud 119 111 1.07 * 125.lud 119 111 1.07 S 125.lud 119 111 1.07 S 126.ge 155 51.5 3.01 S 126.ge 155 51.6 3.01 * 126.ge 155 51.7 3.00 S 127.srad 114 76.1 1.50 S 127.srad 114 76.3 1.49 S 127.srad 114 76.3 1.49 * 128.heartwall 106 157 0.675 * 128.heartwall 106 157 0.675 S 128.heartwall 106 157 0.675 S 140.bplustree 108 113 0.952 S 140.bplustree 108 113 0.953 S 140.bplustree 108 113 0.953 * ============================================================================== 101.tpacf 107 77.0 1.39 * 103.stencil 125 65.3 1.91 * 104.lbm 112 47.5 2.36 * 110.fft 111 63.2 1.76 * 112.spmv 147 90.2 1.63 * 114.mriq 109 22.6 4.83 * 116.histo 114 111 1.02 * 117.bfs 117 69.7 1.68 * 118.cutcp 99 43.7 2.26 * 120.kmeans 100 94.5 1.06 * 121.lavamd 109 21.5 5.07 * 122.cfd 126 79.9 1.58 * 123.nw 115 81.8 1.41 * 124.hotspot 114 47.1 2.42 * 125.lud 119 111 1.07 * 126.ge 155 51.6 3.01 * 127.srad 114 76.3 1.49 * 128.heartwall 106 157 0.675 * 140.bplustree 108 113 0.953 * SPECaccel_ocl_base 1.72 SPECaccel_ocl_peak Not Run HARDWARE -------- CPU Name: AMD Opteron 6276 CPU Characteristics: AMD Turbo CORE Technology up to 3.2GHz, Turbo CORE off CPU MHz: 2300 CPU MHz Maximum: 3200 FPU: Integrated CPU(s) enabled: 16 cores, 1 chip, 16 cores/chip CPU(s) orderable: 1 chip Primary Cache: 32 KB I + 16 KB D on chip per core Secondary Cache: 16 MB I+D on chip per chip, 2 MB shared / 2 cores L3 Cache: 16 MB I+D on chip per chip, 8 MB shared / 8 cores Other Cache: None Memory: 32 GB (4 x 8 GB 2Rx4 PC3L-12800R-11, ECC) Disk Subsystem: None Other Hardware: None ACCELERATOR ----------- Accel Model Name: Tesla K20 Accel Vendor: NVIDIA Accel Name: NVIDIA Tesla K20 Type of Accel: GPU Accel Connection: PCIe 2.0 16x Does Accel Use ECC: yes Accel Description: NVIDIA Tesla K20m GPU, 2496 CUDA cores, 706MHz, 5 GB GDDR5 RAM Accel Driver: NVIDIA UNIX x86_64 Kernel Module 352.68 SOFTWARE -------- Operating System: SUSE Linux Enterprise Server 11 (x86_64), Cray Linux Environment 5.2 3.0.101-0.46.1_1.0502.8871-cray_gem_c Compiler: PGI Accelerator Fortran/C/C++ Server, Release 17.1 File System: NFSv3 (DDN SFA12KE) over 10GB Ethernet System State: Run level 3 (Multi-user) Other Software: NVIDIA CUDA 7.5.18 Platform Notes -------------- Sysinfo program /N/dc2/projects/hpc/lijunj/SPEC/accel-1.1-run/bigred2/Docs/sysinfo $Rev: 6965 $ $Date:: 2015-04-21 #$ c05a7f14b1b1765e3fe1df68447e8a35 running on nid00221 Fri Mar 10 15:56:31 2017 This section contains SUT (System Under Test) info as seen by some common utilities. To remove or add to this section, see: http://www.spec.org/accel/Docs/config.html#sysinfo From /proc/cpuinfo model name : AMD Opteron(TM) Processor 6276 1 "physical id"s (chips) 16 "processors" cores, siblings (Caution: counting these is hw and system dependent. The following excerpts from /proc/cpuinfo might not be reliable. Use with caution.) cpu cores : 8 siblings : 16 physical 0: cores 0 1 2 3 4 5 6 7 cache size : 2048 KB From /proc/meminfo MemTotal: 33083764 kB HugePages_Total: 0 Hugepagesize: 2048 kB /usr/bin/lsb_release -d SUSE Linux Enterprise Server 11 (x86_64) From /etc/*release* /etc/*version* SuSE-release: SUSE Linux Enterprise Server 11 (x86_64) VERSION = 11 PATCHLEVEL = 3 uname -a: Linux nid00221 3.0.101-0.46.1_1.0502.8871-cray_gem_c #1 SMP Sat Oct 22 15:26:43 UTC 2016 x86_64 x86_64 x86_64 GNU/Linux SPEC is set to: /N/dc2/projects/hpc/lijunj/SPEC/accel-1.1-run/bigred2 Filesystem Type Size Used Avail Use% Mounted on 10.10.0.171@o2ib:/dc2 lustre 5.3P 5.0P 222T 96% /N/dc2 Cannot run dmidecode; consider saying 'chmod +s /usr/sbin/dmidecode' (End of data from sysinfo program) (End of data from sysinfo program) Base Runtime Environment ------------------------ C benchmarks: OpenCL Platform: NVIDIA CUDA, OpenCL 1.2 CUDA 7.5.23 OpenCL Device #0: Tesla K20, v 352.68 C++ benchmarks: OpenCL Platform: NVIDIA CUDA, OpenCL 1.2 CUDA 7.5.23 OpenCL Device #0: Tesla K20, v 352.68 Base Compiler Invocation ------------------------ C benchmarks: pgcc C++ benchmarks: pgc++ Base Portability Flags ---------------------- 118.cutcp: -D__GNUC__ Base Optimization Flags ----------------------- C benchmarks: -fast -ta=tesla:cc35 -ta=tesla:cuda7.5 -Mfprelaxed C++ benchmarks: -fast -ta=tesla:cc35 -ta=tesla:cuda7.5 -Mfprelaxed Base Other Flags ---------------- C benchmarks (except as noted below): -I/opt/nvidia/cudatoolkit7.5/7.5.18-1.0502.10743.2.1/include -L/opt/nvidia/cudatoolkit7.5/7.5.18-1.0502.10743.2.1/lib64 -lOpenCL 116.histo: -DSPEC_LOCAL_MEMORY_HEADROOM=1 -I/opt/nvidia/cudatoolkit7.5/7.5.18-1.0502.10743.2.1/include -L/opt/nvidia/cudatoolkit7.5/7.5.18-1.0502.10743.2.1/lib64 -lOpenCL C++ benchmarks: -I/opt/nvidia/cudatoolkit7.5/7.5.18-1.0502.10743.2.1/include -L/opt/nvidia/cudatoolkit7.5/7.5.18-1.0502.10743.2.1/lib64 -lOpenCL The flags file that was used to format this result can be browsed at http://www.spec.org/accel/flags/pgi2017_flags.20170426.html You can also download the XML flags source by saving the following link: http://www.spec.org/accel/flags/pgi2017_flags.20170426.xml SPEC and SPEC ACCEL are registered trademarks or trademarks of the Standard Performance Evaluation Corporation. All other brand and product names appearing in this result are trademarks or registered trademarks of their respective holders. --------------------------------------------------------------------------------- For questions about this result, please contact the tester. For other inquiries, please contact webmaster@spec.org. Copyright 2014-2017 Standard Performance Evaluation Corporation Tested with SPEC ACCEL v1.1. Report generated on Wed Apr 26 11:41:28 2017 by ACCEL ASCII formatter v1290. Originally published on 26 April 2017.