| CPU2006 license: | 6 | Test date: | Jun-2008 |
|---|---|---|---|
| Test sponsor: | Sun Microsystems | Hardware Availability: | Jul-2008 |
| Tested by: | Sun Microsystems | Software Availability: | Jul-2008 |
| Hardware | |
|---|---|
| CPU Name: | SPARC64 VII |
| CPU Characteristics: | |
| CPU MHz: | 2400 |
| FPU: | Integrated |
| CPU(s) enabled: | 32 cores, 8 chips, 4 cores/chip, 2 threads/core |
| CPU(s) orderable: | 1 to 4 CMU; each CMU contains 2 CPU chips |
| Primary Cache: | 64 KB I + 64 KB D on chip per core |
| Secondary Cache: | 5 MB I+D on chip per chip |
| L3 Cache: | None |
| Other Cache: | None |
| Memory: | 128 GB (64 x 2 GB) |
| Disk Subsystem: | 158 GB RAID 0 Solaris Volume 3 x Seagate 73 GB 10000 RPM Stripe interlace 512 Kbytes |
| Other Hardware: | None |
| Software | |
|---|---|
| Operating System: | Solaris 10 5/08 with patch 137111-03 |
| Compiler: | Sun Studio 12 with patches 124867-06, 124861-07, 124863-05, 127000-05 (see patch information below) |
| Auto Parallel: | Yes |
| File System: | ufs |
| System State: | Default |
| Base Pointers: | 32-bit |
| Peak Pointers: | 32-bit |
| Other Software: | None |
| Benchmark | Base | Peak | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Copies | Seconds | Ratio | Seconds | Ratio | Seconds | Ratio | Copies | Seconds | Ratio | Seconds | Ratio | Seconds | Ratio | |
| Results appear in the order in which they were run. Bold underlined text indicates a median measurement. | ||||||||||||||
| 410.bwaves | 63 | 6040 | 142 | 6040 | 142 | 6040 | 142 | 32 | 3035 | 143 | 3036 | 143 | 3036 | 143 |
| 416.gamess | 63 | 3439 | 359 | 3434 | 359 | 3411 | 362 | 63 | 3211 | 384 | 3196 | 386 | 3208 | 384 |
| 433.milc | 63 | 6583 | 87.9 | 6585 | 87.8 | 6587 | 87.8 | 63 | 6563 | 88.1 | 6559 | 88.2 | 6558 | 88.2 |
| 434.zeusmp | 63 | 2315 | 248 | 2327 | 246 | 2332 | 246 | 63 | 2315 | 248 | 2327 | 246 | 2332 | 246 |
| 435.gromacs | 63 | 1149 | 391 | 1142 | 394 | 1147 | 392 | 63 | 1017 | 442 | 1037 | 434 | 1023 | 440 |
| 436.cactusADM | 63 | 2437 | 309 | 2444 | 308 | 2450 | 307 | 63 | 2437 | 309 | 2444 | 308 | 2450 | 307 |
| 437.leslie3d | 63 | 5017 | 118 | 5013 | 118 | 5022 | 118 | 32 | 2339 | 129 | 2339 | 129 | 2339 | 129 |
| 444.namd | 63 | 1143 | 442 | 1132 | 446 | 1141 | 443 | 63 | 1127 | 448 | 1127 | 448 | 1136 | 445 |
| 447.dealII | 63 | 1924 | 375 | 1889 | 382 | 1902 | 379 | 63 | 1858 | 388 | 1857 | 388 | 1852 | 389 |
| 450.soplex | 63 | 5667 | 92.7 | 5699 | 92.2 | 5662 | 92.8 | 32 | 2716 | 98.2 | 2717 | 98.2 | 2711 | 98.4 |
| 453.povray | 63 | 865 | 387 | 863 | 388 | 871 | 385 | 63 | 613 | 547 | 655 | 512 | 660 | 508 |
| 454.calculix | 63 | 1204 | 432 | 1195 | 435 | 1179 | 441 | 63 | 1204 | 432 | 1195 | 435 | 1179 | 441 |
| 459.GemsFDTD | 63 | 7208 | 92.7 | 7220 | 92.6 | 7215 | 92.6 | 32 | 3606 | 94.2 | 3598 | 94.4 | 3600 | 94.3 |
| 465.tonto | 63 | 2314 | 268 | 2311 | 268 | 2292 | 270 | 63 | 2220 | 279 | 2177 | 285 | 2198 | 282 |
| 470.lbm | 63 | 8800 | 98.4 | 8800 | 98.4 | 8800 | 98.4 | 1 | 97.1 | 142 | 97.1 | 142 | 97.1 | 141 |
| 481.wrf | 63 | 3641 | 193 | 3654 | 193 | 3672 | 192 | 32 | 1769 | 202 | 1774 | 202 | 1769 | 202 |
| 482.sphinx3 | 63 | 9138 | 134 | 9176 | 134 | 9337 | 132 | 63 | 8757 | 140 | 8752 | 140 | 8788 | 140 |
Sun Studio compiler patches are available at
http://developers.sun.com/sunstudio/downloads/patches/ss12_patches.jsp
Processes were assigned to specific processors using 'pbind' commands. The config file option 'submit' was used, along with a list of processors in the 'BIND' variable, to generate the pbind commands. (For details, please see the config file.)
Environment Variable Settings:
The maximum number of threads a program can create was set with:
OMP_NUM_THREADS=63
Program threads were bound to processors with:
SUNW_MP_PROCBIND="1-63"
Behavior of parallel threads was set with:
SUNW_MP_THR_IDLE=SPIN
SPIN specifies that an idle thread should spin while waiting at barrier
or waiting for new parallel regions to work on.
ulimit -s 131072 was used to limit the space consumed
by the stack (making more space available for the heap)
System Tunables (/etc/system parameters):
tune_t_fsflushr=10
Controls how many seconds elapse between runs of the
page flush daemon, fsflush.
autoup=600
Causes pages older than the listed number of seconds to
be written by fsflush.
bufhwm=3000
Memory byte limit for caching I/O buffers
segmap_percent=1
Set maximum percent memory for file system cache
lpg_alloc_prefer=1
Set lgroup page allocation to strongly prefer local pages
Other System Settings:
The webconsole service was turned off using
svcadm disable webconsole
Memory is 8-way interleaved by filling all slots with the same capacity DIMMs. This result is measured on a Sun SPARC Enterprise M5000 Server. Note that the Sun SPARC Enterprise M5000 and Fujitsu SPARC Enterprise M5000 are electrically equivalent.
| cc |
| CC |
| f90 |
| cc f90 |
| -fast -fma=fused -xipo=2 -xpagesize=4M -xprefetch_level=1 -xalias_level=std -xprefetch_auto_type=indirect_array_access |
| -xdepend -library=stlport4 -fast -fma=fused -xipo=2 -xpagesize=4M -xprefetch_level=1 -xalias_level=compatible |
| -fast -fma=fused -xipo=2 -xpagesize=4M -xprefetch_level=1 |
| -fast(cc) -fast(f90) -fma=fused -xipo=2 -xpagesize=4M -xprefetch_level=1 -xalias_level=std -xprefetch_auto_type=indirect_array_access |
| -xjobs=16 -V -# |
| -xjobs=16 -verbose=diags,version |
| -xjobs=16 -V -v |
| -xjobs=16 -V -# -v |
| cc |
| CC |
| f90 |
| cc f90 |
| 444.namd: | -xdepend -library=stlport4 -fast -xpagesize=4M -xalias_level=compatible -xprefetch_level=1 -fma=fused |
| 447.dealII: | -xdepend -library=stlport4 -xprofile=collect:./feedback(pass 1) -xprofile=use:./feedback(pass 2) -fast -xpagesize=4M -xalias_level=compatible -xipo=2 -xrestrict -fma=fused |
| 450.soplex: | -xdepend -library=stlport4 -xprofile=collect:./feedback(pass 1) -xprofile=use:./feedback(pass 2) -fast -xpagesize=4M -xalias_level=compatible -xipo=2 -xprefetch=no -fsimple=0 -xrestrict |
| 453.povray: | Same as 447.dealII |
| 410.bwaves: | -fast -xpagesize=4M -xipo=2 -xprefetch_level=2 -fma=fused |
| 416.gamess: | -xprofile=collect:./feedback(pass 1) -xprofile=use:./feedback(pass 2) -fast -xpagesize=4M -xipo=2 -xprefetch_level=2 -fma=fused |
| 434.zeusmp: | basepeak = yes |
| 437.leslie3d: | -fast -xpagesize=4M -xprefetch=no |
| 459.GemsFDTD: | -fast -xpagesize=4M -fsimple=1 -xprefetch=no -fma=fused |
| 465.tonto: | -fast -xpagesize=4M -xipo=2 -lfast -ll2amm |
| 435.gromacs: | -xprofile=collect:./feedback(pass 1) -xprofile=use:./feedback(pass 2) -fast(cc) -fast(f90) -xpagesize=4M -xipo=2 -xinline= -xchip=generic -fsimple=0 -fma=fused |
| 436.cactusADM: | basepeak = yes |
| 454.calculix: | basepeak = yes |
| 481.wrf: | -xprofile=collect:./feedback(pass 1) -xprofile=use:./feedback(pass 2) -fast(cc) -fast(f90) -xpagesize=4M -xipo=2 -xprefetch_level=2 |