CPU2006 license: | 11 | Test date: | Jun-2008 |
---|---|---|---|
Test sponsor: | IBM Corporation | Hardware Availability: | Jul-2008 |
Tested by: | Advanced Micro Devices | Software Availability: | Jun-2008 |
Hardware | |
---|---|
CPU Name: | AMD Opteron 2352 |
CPU Characteristics: | |
CPU MHz: | 2100 |
FPU: | Integrated |
CPU(s) enabled: | 8 cores, 2 chips, 4 cores/chip |
CPU(s) orderable: | 1,2 chips |
Primary Cache: | 64 KB I + 64 KB D on chip per core |
Secondary Cache: | 512 KB I+D on chip per core |
L3 Cache: | 2 MB I+D on chip per chip |
Other Cache: | None |
Memory: | 16 GB (8 x 2 GB, DDR2-667 CL5 Reg Dual Rank) |
Disk Subsystem: | 1 x 160 GB SATA, 7200 RPM |
Other Hardware: | None |
Software | |
---|---|
Operating System: | SuSE Linux Enterprise Server 10 (x86_64) SP1, Kernel 2.6.16.46-0.12-smp |
Compiler: | PGI Server Complete Version 7.2 PathScale Compiler Suite Version 3.2 |
Auto Parallel: | No |
File System: | ReiserFS |
System State: | Run level 3 (Full multiuser with network) |
Base Pointers: | 64-bit |
Peak Pointers: | 32/64-bit |
Other Software: | None |
Benchmark | Base | Peak | ||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Copies | Seconds | Ratio | Seconds | Ratio | Seconds | Ratio | Copies | Seconds | Ratio | Seconds | Ratio | Seconds | Ratio | |
Results appear in the order in which they were run. Bold underlined text indicates a median measurement. | ||||||||||||||
410.bwaves | 8 | 1479 | 73.5 | 1472 | 73.9 | 1472 | 73.9 | 8 | 1354 | 80.3 | 1356 | 80.2 | 1358 | 80.0 |
416.gamess | 8 | 1520 | 103 | 1519 | 103 | 1521 | 103 | 8 | 1404 | 112 | 1409 | 111 | 1399 | 112 |
433.milc | 8 | 1220 | 60.2 | 1220 | 60.2 | 1222 | 60.1 | 8 | 1200 | 61.2 | 1201 | 61.2 | 1201 | 61.2 |
434.zeusmp | 8 | 869 | 83.8 | 878 | 82.9 | 872 | 83.4 | 8 | 869 | 83.8 | 878 | 82.9 | 872 | 83.4 |
435.gromacs | 8 | 688 | 83.0 | 689 | 83.0 | 688 | 83.0 | 8 | 565 | 101 | 566 | 101 | 565 | 101 |
436.cactusADM | 8 | 1082 | 88.3 | 1100 | 86.9 | 1088 | 87.9 | 8 | 972 | 98.4 | 963 | 99.3 | 976 | 98.0 |
437.leslie3d | 8 | 1488 | 50.5 | 1489 | 50.5 | 1489 | 50.5 | 8 | 1380 | 54.5 | 1379 | 54.5 | 1381 | 54.5 |
444.namd | 8 | 828 | 77.5 | 828 | 77.5 | 827 | 77.6 | 8 | 727 | 88.3 | 725 | 88.5 | 726 | 88.4 |
447.dealII | 8 | 857 | 107 | 862 | 106 | 860 | 106 | 8 | 617 | 148 | 609 | 150 | 607 | 151 |
450.soplex | 8 | 1344 | 49.6 | 1244 | 53.6 | 1244 | 53.6 | 8 | 1263 | 52.8 | 1233 | 54.1 | 1221 | 54.6 |
453.povray | 8 | 400 | 107 | 401 | 106 | 400 | 106 | 8 | 346 | 123 | 345 | 124 | 345 | 123 |
454.calculix | 8 | 620 | 106 | 616 | 107 | 616 | 107 | 8 | 524 | 126 | 525 | 126 | 527 | 125 |
459.GemsFDTD | 8 | 1849 | 45.9 | 1847 | 46.0 | 1847 | 45.9 | 8 | 1654 | 51.3 | 1654 | 51.3 | 1653 | 51.3 |
465.tonto | 8 | 892 | 88.2 | 891 | 88.3 | 891 | 88.3 | 8 | 763 | 103 | 759 | 104 | 756 | 104 |
470.lbm | 8 | 2238 | 49.1 | 2294 | 47.9 | 2236 | 49.2 | 8 | 2222 | 49.5 | 2222 | 49.5 | 2224 | 49.4 |
481.wrf | 8 | 1062 | 84.2 | 1072 | 83.4 | 1066 | 83.8 | 8 | 1003 | 89.1 | 1003 | 89.1 | 1013 | 88.2 |
482.sphinx3 | 8 | 1808 | 86.2 | 1799 | 86.7 | 1798 | 86.7 | 8 | 1704 | 91.5 | 1690 | 92.3 | 1688 | 92.4 |
'numactl' was used to bind copies to the cores 'ulimit -s unlimited' was used to set environment stack size 'ulimit -l 2097152' was used to set environment locked pages in memory limit Environment variable PGI_HUGE_PAGES set to 150 Set vm/nr_hugepages=1200 in /etc/sysctl.conf mount -t hugetlbfs nodev /mnt/hugepages
pgcc |
pgcpp |
pgf95 |
pgcc pgf95 |
410.bwaves: | -DSPEC_CPU_LP64 |
416.gamess: | -DSPEC_CPU_LP64 |
433.milc: | -DSPEC_CPU_LP64 |
434.zeusmp: | -DSPEC_CPU_LP64 |
435.gromacs: | -DSPEC_CPU_LP64 -Mnomain |
436.cactusADM: | -DSPEC_CPU_LP64 -Mnomain |
437.leslie3d: | -DSPEC_CPU_LP64 |
444.namd: | -DSPEC_CPU_LP64 |
447.dealII: | -DSPEC_CPU_LP64 |
450.soplex: | -DSPEC_CPU_LP64 |
453.povray: | -DSPEC_CPU_LP64 |
454.calculix: | -DSPEC_CPU_LP64 -Mnomain |
459.GemsFDTD: | -DSPEC_CPU_LP64 |
465.tonto: | -DSPEC_CPU_LP64 |
470.lbm: | -DSPEC_CPU_LP64 |
481.wrf: | -DSPEC_CPU_LP64 -DSPEC_CPU_CASE_FLAG -DSPEC_CPU_LINUX |
482.sphinx3: | -DSPEC_CPU_LP64 |
-fastsse -Msmartalloc=huge:150 -Mfprelaxed -Mipa=fast -Mipa=inline -tp barcelona-64 -Bstatic_pgi |
-fastsse -Msmartalloc=huge:150 -Mfprelaxed --zc_eh -Mipa=fast -Mipa=inline -tp barcelona-64 -Bstatic_pgi |
-fastsse -Mfprelaxed -Msmartalloc=huge:150 -Mipa=fast -Mipa=inline -tp barcelona-64 -Bstatic_pgi |
-fastsse -Msmartalloc=huge:150 -Mfprelaxed -Mipa=fast -Mipa=inline -tp barcelona-64 -Bstatic_pgi |
-Mipa=jobs:4 |
-Mipa=jobs:4 |
-Mipa=jobs:4 |
-Mipa=jobs:4 |
pgcc | |
470.lbm: | pathcc |
pathCC | |
444.namd: | pgcpp |
pgf95 | |
416.gamess: | pathf95 |
459.GemsFDTD: | pathf95 |
465.tonto: | pathf95 |
pgcc pgf95 | |
436.cactusADM: | pathcc pathf95 |
410.bwaves: | -DSPEC_CPU_LP64 |
416.gamess: | -DSPEC_CPU_LP64 |
433.milc: | -DSPEC_CPU_LP64 |
434.zeusmp: | -DSPEC_CPU_LP64 |
435.gromacs: | -DSPEC_CPU_LP64 -Mnomain |
436.cactusADM: | -DSPEC_CPU_LP64 -fno-second-underscore |
437.leslie3d: | -DSPEC_CPU_LP64 |
444.namd: | -DSPEC_CPU_LP64 |
453.povray: | -DSPEC_CPU_LP64 |
454.calculix: | -DSPEC_CPU_LP64 -Mnomain |
459.GemsFDTD: | -DSPEC_CPU_LP64 |
465.tonto: | -DSPEC_CPU_LP64 |
470.lbm: | -DSPEC_CPU_LP64 |
481.wrf: | -DSPEC_CPU_LP64 -DSPEC_CPU_CASE_FLAG -DSPEC_CPU_LINUX |
482.sphinx3: | -DSPEC_CPU_LP64 |
433.milc: | -fastsse -Msmartalloc=huge:150 -Msafeptr -Mfprelaxed -Mipa=inline -Mipa=arg -Mipa=const -Mipa=ptr -Mipa=shape -tp barcelona-64 -Bstatic_pgi |
470.lbm: | -march=barcelona -Ofast -CG:sse_cse_regs=0 -CG:locs_shallow_depth=1 -m3dnow |
482.sphinx3: | -Mpfi=indirect(pass 1) -Mpfo=indirect(pass 2) -Mipa=fast(pass 2) -Mipa=inline(pass 2) -fastsse -Mfprelaxed -Msmartalloc -tp barcelona-64 -Bstatic_pgi |
444.namd: | -Mpfi(pass 1) -Mpfo(pass 2) -Mipa=fast(pass 2) -Mipa=inline(pass 2) -fastsse -Munroll=n:4 -Munroll=m:8 -Msmartalloc=huge:150 -Mnodepchk -Mfprelaxed --zc_eh -tp barcelona-64 -Bstatic_pgi |
447.dealII: | -march=barcelona -Ofast -static -INLINE:aggressive=on -fno-exceptions -m32 |
450.soplex: | -march=barcelona -fb_create fbdata(pass 1) -fb_opt fbdata(pass 2) -O3 -TENV:frame_pointer=off -LNO:prefetch=1 -OPT:malloc_alg=1 -CG:load_exe=0 -m32 |
453.povray: | -march=barcelona -fb_create fbdata(pass 1) -fb_opt fbdata(pass 2) -Ofast |
410.bwaves: | -Mpfi(pass 1) -Mpfo(pass 2) -Mipa=fast(pass 2) -Mipa=inline(pass 2) -fastsse -Msmartalloc -Mprefetch=distance:12 -Mprefetch=nta -Mpre -Mfprelaxed -tp barcelona-64 -Bstatic_pgi |
416.gamess: | -march=barcelona -fb_create fbdata(pass 1) -fb_opt fbdata(pass 2) -O2 -OPT:Ofast -OPT:ro=3 -OPT:unroll_size=256 |
434.zeusmp: | basepeak = yes |
437.leslie3d: | -Mpfi=indirect(pass 1) -Mpfo=indirect(pass 2) -Mipa=fast(pass 2) -Mipa=inline(pass 2) -fastsse -Mvect=fuse -Msmartalloc=huge:150 -Mprefetch=distance:8 -Mprefetch=t0 -Mfprelaxed -tp barcelona-64 -Bstatic_pgi |
459.GemsFDTD: | -march=barcelona -Ofast -LNO:fission=2 -LNO:simd=2 -LNO:prefetch_ahead=1 -CG:load_exe=0 |
465.tonto: | -march=barcelona -Ofast -OPT:alias=no_f90_pointer_alias -LNO:blocking=off -CG:load_exe=1 -IPA:plimit=525 |
435.gromacs: | -fastsse -Msmartalloc=huge:150 -Mfprelaxed -Mfpapprox=rsqrt -Mipa=fast -Mipa=inline -tp barcelona-64 -Bstatic_pgi |
436.cactusADM: | -march=barcelona -fb_create fbdata(pass 1) -fb_opt fbdata(pass 2) -Ofast -LNO:blocking=off |
454.calculix: | -Mpfi=indirect(pass 1) -Mpfo=indirect(pass 2) -Mipa=fast(pass 2) -Mipa=inline(pass 2) -fastsse -Msmartalloc=huge:150 -Mprefetch=t0 -Mpre -Mfprelaxed -tp barcelona-64 -Bstatic_pgi |
481.wrf: | -fastsse -Mvect=noaltcode -Msmartalloc -Mprefetch=distance:8 -Mfprelaxed -tp barcelona-64 -Bstatic_pgi |
-Mipa=jobs:4(pass 2) | |
470.lbm: | No flags used |
444.namd: | -Mipa=jobs:4(pass 2) |
-Mipa=jobs:4(pass 2) | |
416.gamess: | No flags used |
459.GemsFDTD: | No flags used |
465.tonto: | No flags used |
-Mipa=jobs:4(pass 2) | |
436.cactusADM: | No flags used |
481.wrf: | No flags used |