ASUS (Test Sponsor: NVIDIA Corporation) NVIDIA Tesla K40c ASUS P9X79 Motherboard |
SPECaccel_acc_base = 2.59 SPECaccel_acc_energy_base = 3.01 |
SPECaccel_acc_peak = 2.73 SPECaccel_acc_energy_peak = 3.13 |
ACCEL license: | 019 | Test date: | Feb-2014 |
---|---|---|---|
Test sponsor: | NVIDIA Corporation | Hardware Availability: | Nov-2013 |
Tested by: | NVIDIA Corporation | Software Availability: | Feb-2014 |
Hardware | |
---|---|
CPU Name: | Intel Core i7-3930K |
CPU Characteristics: | |
CPU MHz: | 3200 |
CPU MHz Maximum: | 3800 |
FPU: | Integrated |
CPU(s) enabled: | 6 cores, 1 chip, 6 cores/chip, 2 threads/core |
CPU(s) orderable: | 1 chip |
Primary Cache: | 32 KB I + 32 KB D on chip per core |
Secondary Cache: | 256 KB I+D on chip per core |
L3 Cache: | 12 MB I+D on chip per chip |
Other Cache: | None |
Memory: | 8 GB (2 x 4 GB 2Rx4 PC3-14900R-9, running at 1600 MHz) |
Disk Subsystem: | 1000 GB Seagate ST1000DM003 7200 RPM SATA |
Other Hardware: | None |
Accelerator | |
---|---|
Accel Model Name: | Tesla K40c |
Accel Vendor: | NVIDIA |
Accel Name: | NVIDIA Tesla K40c |
Type of Accel: | GPU |
Accel Connection: | PCIe 3.0 16x |
Does Accel Use ECC: | Yes |
Accel Description: | GPU Boost set to maximum graphic clock frequency of 875 MHz. See notes below. |
Accel Driver: | NVIDIA UNIX x86_64 Kernel Module 319.60 |
Software | |
---|---|
Operating System: | Red Hat Enterprise Linux Server release 6.4 (Santiago) 2.6.32-358.el6.x86_64 |
Compiler: | PGI Accelerator Server Complete, Release 14.2 |
File System: | ext4 |
System State: | Run level 3 (multi-user) |
Other Software: | FFTW 3.3.3 |
Power | |
---|---|
Power Supply: | 1200 W |
Power Supply Details: | Thermaltake SMART M1200W |
Max. Power (W): | 398.64 |
Idle Power (W): | 94.87 |
Min. Temperature (C): | 26.56 |
Power Analyzer | |
---|---|
Power Analyzer: | Power Analyzer |
Hardware Vendor: | Xitron Technologies, Inc. |
Model: | 2801 |
Serial Number: | 28011109005 |
Input Connection: | RS232 via USB-adapter |
Metrology Institute: | NIST |
Calibration By: | Micro Precision Calibration, Inc. |
Calibration Label: | 220081222038459 |
Calibration Date: | 02.20.2014 |
PTDaemon Version: | 1.6.2 (372e138a; 2013-12-04) |
Setup Description: | connected to the single power supply that powers the system |
Current Ranges Used: | 2.0A |
Voltage Range Used: | 135V |
Temperature Meter | |
---|---|
Temperature Meter: | Temperature Meter |
Hardware Vendor: | Digi |
Model: | DigiWATCHPORT_H |
Serial Number: | WS34682143 |
Input Connection: | USB |
PTDaemon Version: | 1.6.2 (372e138a; 2013-12-04) |
Setup Description: | Position 5mm above intake fan |
Benchmark | Seconds | Ratio | Energy (kJ) |
Maximum Power |
Average Power |
Energy Ratio |
Seconds | Ratio | Energy (kJ) |
Maximum Power |
Average Power |
Energy Ratio |
Seconds | Ratio | Energy (kJ) |
Maximum Power |
Average Power |
Energy Ratio |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Results appear in the order in which they were run. Bold underlined text indicates a median measurement. | ||||||||||||||||||
303.ostencil | 45.8 | 3.17 | 16.8 | 384 | 367 | 3.15 | 45.7 | 3.17 | 16.6 | 374 | 364 | 3.18 | 45.8 | 3.17 | 16.8 | 375 | 366 | 3.16 |
304.olbm | 194 | 2.35 | 56.1 | 296 | 290 | 2.78 | 194 | 2.35 | 56.3 | 298 | 291 | 2.77 | 194 | 2.35 | 56.1 | 299 | 290 | 2.78 |
314.omriq | 354 | 2.70 | 131 | 375 | 369 | 2.92 | 354 | 2.70 | 131 | 375 | 371 | 2.91 | 354 | 2.70 | 132 | 376 | 372 | 2.90 |
350.md | 90.6 | 2.78 | 31.6 | 354 | 349 | 2.97 | 90.7 | 2.78 | 31.7 | 367 | 349 | 2.97 | 90.7 | 2.78 | 31.6 | 363 | 349 | 2.97 |
351.palm | 184 | 2.01 | 42.0 | 238 | 228 | 2.54 | 189 | 1.96 | 43.1 | 237 | 228 | 2.48 | 184 | 2.01 | 42.0 | 241 | 228 | 2.55 |
352.ep | 330 | 1.60 | 83.8 | 275 | 254 | 1.96 | 330 | 1.61 | 83.9 | 256 | 254 | 1.96 | 330 | 1.61 | 83.5 | 255 | 253 | 1.97 |
353.clvrleaf | 145 | 3.07 | 46.0 | 322 | 317 | 3.42 | 145 | 3.07 | 45.9 | 322 | 317 | 3.43 | 145 | 3.07 | 45.8 | 325 | 316 | 3.44 |
354.cg | 144 | 2.84 | 41.5 | 328 | 289 | 3.37 | 144 | 2.84 | 41.5 | 328 | 289 | 3.37 | 144 | 2.84 | 41.4 | 326 | 288 | 3.38 |
355.seismic | 133 | 2.79 | 38.2 | 298 | 288 | 3.38 | 133 | 2.79 | 38.1 | 298 | 287 | 3.39 | 133 | 2.79 | 38.1 | 297 | 287 | 3.39 |
356.sp | 118 | 2.35 | 33.9 | 292 | 288 | 2.74 | 118 | 2.35 | 33.8 | 291 | 288 | 2.74 | 118 | 2.35 | 33.8 | 291 | 287 | 2.74 |
357.csp | 143 | 1.89 | 39.3 | 286 | 275 | 2.24 | 143 | 1.89 | 39.1 | 285 | 274 | 2.25 | 143 | 1.89 | 39.2 | 288 | 275 | 2.25 |
359.miniGhost | 132 | 2.80 | 37.0 | 304 | 281 | 3.21 | 132 | 2.80 | 36.8 | 304 | 280 | 3.23 | 132 | 2.80 | 36.8 | 302 | 280 | 3.23 |
360.ilbdc | 97.3 | 3.77 | 29.7 | 315 | 305 | 4.37 | 97.3 | 3.77 | 29.6 | 314 | 305 | 4.38 | 97.4 | 3.77 | 29.7 | 314 | 305 | 4.38 |
363.swim | 88.7 | 2.59 | 23.5 | 266 | 265 | 3.23 | 88.4 | 2.60 | 23.4 | 265 | 264 | 3.25 | 88.4 | 2.60 | 23.4 | 265 | 264 | 3.25 |
370.bt | 75.7 | 2.95 | 21.3 | 284 | 281 | 3.56 | 75.7 | 2.95 | 21.1 | 281 | 279 | 3.58 | 75.7 | 2.95 | 21.1 | 282 | 279 | 3.58 |
Benchmark | Seconds | Ratio | Energy (kJ) |
Maximum Power |
Average Power |
Energy Ratio |
Seconds | Ratio | Energy (kJ) |
Maximum Power |
Average Power |
Energy Ratio |
Seconds | Ratio | Energy (kJ) |
Maximum Power |
Average Power |
Energy Ratio |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Results appear in the order in which they were run. Bold underlined text indicates a median measurement. | ||||||||||||||||||
303.ostencil | 45.7 | 3.17 | 16.8 | 373 | 366 | 3.16 | 45.7 | 3.17 | 16.8 | 373 | 368 | 3.15 | 45.7 | 3.17 | 16.7 | 374 | 364 | 3.18 |
304.olbm | 194 | 2.35 | 56.2 | 303 | 290 | 2.78 | 194 | 2.35 | 56.0 | 295 | 289 | 2.79 | 194 | 2.35 | 56.0 | 295 | 289 | 2.79 |
314.omriq | 354 | 2.70 | 131 | 375 | 370 | 2.92 | 354 | 2.70 | 131 | 399 | 371 | 2.92 | 354 | 2.70 | 131 | 391 | 370 | 2.92 |
350.md | 86.7 | 2.91 | 31.0 | 364 | 358 | 3.03 | 86.7 | 2.91 | 31.1 | 365 | 359 | 3.02 | 86.7 | 2.91 | 31.0 | 363 | 358 | 3.03 |
351.palm | 145 | 2.54 | 34.7 | 256 | 238 | 3.09 | 145 | 2.55 | 34.6 | 251 | 239 | 3.09 | 145 | 2.55 | 34.5 | 251 | 238 | 3.10 |
352.ep | 330 | 1.60 | 83.6 | 255 | 253 | 1.97 | 330 | 1.61 | 83.8 | 272 | 254 | 1.96 | 330 | 1.60 | 83.9 | 273 | 254 | 1.96 |
353.clvrleaf | 145 | 3.07 | 45.9 | 322 | 316 | 3.43 | 145 | 3.07 | 45.6 | 322 | 315 | 3.45 | 145 | 3.07 | 45.7 | 321 | 315 | 3.45 |
354.cg | 144 | 2.84 | 41.5 | 328 | 288 | 3.38 | 144 | 2.84 | 41.5 | 339 | 289 | 3.38 | 144 | 2.84 | 41.7 | 328 | 290 | 3.36 |
355.seismic | 133 | 2.79 | 38.1 | 297 | 287 | 3.39 | 133 | 2.79 | 38.2 | 297 | 288 | 3.38 | 133 | 2.79 | 38.2 | 304 | 288 | 3.38 |
356.sp | 118 | 2.35 | 33.7 | 291 | 287 | 2.75 | 118 | 2.35 | 33.8 | 291 | 288 | 2.74 | 118 | 2.35 | 33.7 | 292 | 287 | 2.75 |
357.csp | 143 | 1.89 | 39.2 | 278 | 274 | 2.25 | 143 | 1.89 | 39.1 | 278 | 274 | 2.25 | 143 | 1.89 | 39.2 | 278 | 275 | 2.24 |
359.miniGhost | 121 | 3.04 | 35.1 | 336 | 289 | 3.39 | 121 | 3.04 | 35.1 | 339 | 289 | 3.39 | 121 | 3.04 | 35.2 | 333 | 290 | 3.38 |
360.ilbdc | 97.3 | 3.77 | 29.6 | 312 | 304 | 4.40 | 97.3 | 3.77 | 29.6 | 326 | 304 | 4.40 | 97.3 | 3.77 | 29.6 | 312 | 305 | 4.38 |
363.swim | 57.2 | 4.02 | 17.2 | 302 | 300 | 4.42 | 57.2 | 4.02 | 17.1 | 300 | 298 | 4.45 | 57.3 | 4.02 | 17.1 | 300 | 299 | 4.43 |
370.bt | 75.7 | 2.95 | 21.2 | 283 | 280 | 3.56 | 75.7 | 2.95 | 21.2 | 283 | 280 | 3.56 | 75.8 | 2.94 | 21.5 | 291 | 283 | 3.52 |
Sysinfo program /local/home/SPECACCEL/Docs/sysinfo $Rev: 6874 $ $Date:: 2013-11-20 #$ 0953404ef7e75a5f9bbb534c6de3f831 running on sbe02 Mon Feb 24 17:06:03 2014 This section contains SUT (System Under Test) info as seen by some common utilities. To remove or add to this section, see: http://www.spec.org/accel/Docs/config.html#sysinfo From /proc/cpuinfo model name : Intel(R) Core(TM) i7-3930K CPU @ 3.20GHz 1 "physical id"s (chips) 12 "processors" cores, siblings (Caution: counting these is hw and system dependent. The following excerpts from /proc/cpuinfo might not be reliable. Use with caution.) cpu cores : 6 siblings : 12 physical 0: cores 0 1 2 3 4 5 cache size : 12288 KB From /proc/meminfo MemTotal: 8130700 kB HugePages_Total: 0 Hugepagesize: 2048 kB /usr/bin/lsb_release -d Red Hat Enterprise Linux Server release 6.4 (Santiago) From /etc/*release* /etc/*version* redhat-release: Red Hat Enterprise Linux Server release 6.4 (Santiago) system-release: Red Hat Enterprise Linux Server release 6.4 (Santiago) system-release-cpe: cpe:/o:redhat:enterprise_linux:6server:ga:server uname -a: Linux sbe02 2.6.32-358.el6.x86_64 #1 SMP Tue Jan 29 11:47:41 EST 2013 x86_64 x86_64 x86_64 GNU/Linux run-level 3 Feb 24 15:50 SPEC is set to: /local/home/SPECACCEL Filesystem Type Size Used Avail Use% Mounted on /dev/mapper/VolGroup-lv_home ext4 860G 52G 765G 7% /local Additional information from dmidecode: Warning: Use caution when you interpret this section. The 'dmidecode' program reads system data which is "intended to allow hardware to be accurately determined", but the intent may not be met, as there are frequent changes to hardware, firmware, and the "DMTF SMBIOS" standard. (End of data from sysinfo program) Information from pgaccelinfo CUDA Driver Version: 5050 NVRM version: NVIDIA UNIX x86_64 Kernel Module 319.60 Wed Sep 25 14:28:26 PDT 2013 Device Number: 0 Device Name: Tesla K40c Device Revision Number: 3.5 Global Memory Size: 12079136768 Number of Multiprocessors: 15 Number of SP Cores: 2880 Number of DP Cores: 960 Concurrent Copy and Execution: Yes Total Constant Memory: 65536 Total Shared Memory per Block: 49152 Registers per Block: 65536 Warp Size: 32 Maximum Threads per Block: 1024 Maximum Block Dimensions: 1024, 1024, 64 Maximum Grid Dimensions: 2147483647 x 65535 x 65535 Maximum Memory Pitch: 2147483647B Texture Alignment: 512B Clock Rate: 875 MHz Max. Clock Rate: 875 MHz Execution Timeout: No Integrated Device: No Can Map Host Memory: Yes Compute Mode: default Concurrent Kernels: Yes ECC Enabled: Yes Memory Clock Rate: 3004 MHz Memory Bus Width: 384 bits L2 Cache Size: 1572864 bytes Max Threads Per SMP: 2048 Async Engines: 2 Unified Addressing: Yes
GPU Boost mode enabled by setting the device to persistant mode: "nvidia-smi -pm 1" and then setting the memory and graphic clock using: "nvidia-smi -ac <MEM>,<GRH>". For this run, the memory clock was not changed from the default 3004 MHz. The graphic clock was set to the maximum frequency of 875 MHz. nvidai-smi -ac 3004,875 Kit built system using a CoolMaster HAF X case
pgcc |
pgfortran |
pgcc pgfortran |
-fast -Mfprelaxed -acc -ta=tesla:cc35 -ta=tesla:cuda5.5 |
-fast -Mfprelaxed -acc -ta=tesla:cc35 -ta=tesla:cuda5.5 |
353.clvrleaf: | -fast -Mfprelaxed -acc -ta=tesla:cc35 -ta=tesla:cuda5.5 |
359.miniGhost: | -fast -Mfprelaxed -acc -ta=tesla:cc35 -ta=tesla:cuda5.5 -Mnomain |
pgcc |
pgfortran |
pgcc pgfortran |
351.palm: | -DSPEC_HOST_FFTW3 |
-fast -Mfprelaxed -acc -ta=tesla:cc35 -ta=tesla:cuda5.5 |
350.md: | -fast -Mfprelaxed -acc -ta=tesla:cc35 -ta=tesla:cuda5.5 -ta=tesla:maxregcount:48 |
351.palm: | -fast -Mfprelaxed -acc=noautopar -ta=tesla:cc35 -ta=tesla:cuda5.5 -ta=tesla:fastmath -lfftw3 |
355.seismic: | -fast -Mfprelaxed -acc -ta=tesla:cc35 -ta=tesla:cuda5.5 |
356.sp: | Same as 355.seismic |
360.ilbdc: | Same as 355.seismic |
363.swim: | -fast -Mfprelaxed -acc -ta=tesla:cc35 -ta=tesla:cuda5.5 -ta=tesla:pin |
353.clvrleaf: | -fast -Mfprelaxed -acc -ta=tesla:cc35 -ta=tesla:cuda5.5 |
359.miniGhost: | -fast -Mfprelaxed -acc -ta=tesla:cc35 -ta=tesla:cuda5.5 -ta=tesla:maxregcount:32 -Mnomain |