SPEC® CFP2006 Result

Copyright 2006-2014 Standard Performance Evaluation Corporation

IBM Corporation

IBM System x3755 (AMD Opteron 8382)

SPECfp®2006 = 21.6

CPU2006 license: 11 Test date: Dec-2008
Test sponsor: IBM Corporation Hardware Availability: Mar-2009
Tested by: Advanced Micro Devices Software Availability: May-2008
Benchmark results graph
Hardware
CPU Name: AMD Opteron 8382
CPU Characteristics:
CPU MHz: 2600
FPU: Integrated
CPU(s) enabled: 16 cores, 4 chips, 4 cores/chip
CPU(s) orderable: 1,2,3,4 chips
Primary Cache: 64 KB I + 64 KB D on chip per core
Secondary Cache: 512 KB I+D on chip per core
L3 Cache: 6 MB I+D on chip per chip
Other Cache: None
Memory: 64 GB (16 x 4 GB, DDR2-667 CL5 Reg Dual Rank)
Disk Subsystem: 1 x 73.4 GB SAS, 15000 RPM
Other Hardware: None
Software
Operating System: SuSE Linux Enterprise Server 10 (x86_64) SP1,
Kernel 2.6.16.46-0.12-smp
Compiler: PGI Server Complete Version 7.2
Auto Parallel: Yes
File System: ReiserFS
System State: Run level 3 (Full multiuser with network)
Base Pointers: 32/64-bit
Peak Pointers: 64-bit
Other Software: binutils 2.18.50

Results Table

Benchmark Base Peak
Seconds Ratio Seconds Ratio Seconds Ratio Seconds Ratio Seconds Ratio Seconds Ratio
Results appear in the order in which they were run. Bold underlined text indicates a median measurement.
410.bwaves 250   54.4 267   51.0 246   55.1 250   54.4 267   51.0 246   55.1
416.gamess 1240   15.8 1238   15.8 1246   15.7 1175   16.7 1177   16.6 1177   16.6
433.milc 597   15.4 603   15.2 596   15.4 585   15.7 589   15.6 584   15.7
434.zeusmp 655   13.9 657   13.9 658   13.8 623   14.6 578   15.7 586   15.5
435.gromacs 484   14.7 482   14.8 484   14.8 400   17.9 400   17.8 402   17.8
436.cactusADM 95.7 125   93.3 128   94.9 126   93.5 128   91.6 130   93.4 128  
437.leslie3d 640   14.7 641   14.7 808   11.6 589   16.0 725   13.0 675   13.9
444.namd 636   12.6 636   12.6 637   12.6 554   14.5 555   14.4 555   14.4
447.dealII 607   18.8 609   18.8 607   18.8 549   20.8 549   20.8 550   20.8
450.soplex 733   11.4 734   11.4 733   11.4 733   11.4 734   11.4 733   11.4
453.povray 324   16.4 323   16.4 323   16.5 302   17.6 303   17.6 302   17.6
454.calculix 500   16.5 499   16.5 503   16.4 405   20.4 405   20.4 405   20.4
459.GemsFDTD 365   29.1 363   29.3 368   28.8 365   29.1 363   29.3 368   28.8
465.tonto 637   15.4 639   15.4 637   15.5 563   17.5 564   17.4 567   17.4
470.lbm 504   27.2 505   27.2 507   27.1 504   27.2 505   27.2 507   27.1
481.wrf 522   21.4 521   21.4 521   21.4 582   19.2 584   19.1 585   19.1
482.sphinx3 1135   17.2 1070   18.2 1001   19.5 922   21.1 921   21.2 921   21.2

Submit Notes

The config file option 'submit' was used.
 'numactl' was used to bind copies to the cores.

Operating System Notes

 Environment stack size set to 'unlimited'.
 The powersaved was disabled, set the CPU frequency to its maximum.
 Total number of huge pages available is 14336.
 'ulimit -l 2097152' was used to set environment locked pages in memory quantity.
 Set vm/nr_hugepages=14336 in /etc/sysctl.conf
 mount -t hugetlbfs nodev /mnt/hugepages

General Notes

Environment variables set by runspec before the start of the run:
LD_LIBRARY_PATH = "/root/work/cpu2006v1.1/pgi72/linux_lib64:/root/work/cpu2006v1.1/pgi72/linux_lib32"
NCPUS = "16"

Base Compiler Invocation

C benchmarks:

 pgcc 

C++ benchmarks:

 pgcpp 

Fortran benchmarks:

 pgf95 

Benchmarks using both Fortran and C:

 pgcc   pgf95 

Base Portability Flags

410.bwaves:  -DSPEC_CPU_LP64 
416.gamess:  -DSPEC_CPU_LP64 
433.milc:  -DSPEC_CPU_LP64 
434.zeusmp:  -DSPEC_CPU_LP64 
435.gromacs:  -DSPEC_CPU_LP64   -Mnomain 
436.cactusADM:  -DSPEC_CPU_LP64   -Mnomain 
437.leslie3d:  -DSPEC_CPU_LP64 
444.namd:  -DSPEC_CPU_LP64 
447.dealII:  -DSPEC_CPU_LP64 
450.soplex:  -DSPEC_CPU_LP64 
453.povray:  -DSPEC_CPU_LP64 
454.calculix:  -DSPEC_CPU_LP64   -Mnomain 
459.GemsFDTD:  -DSPEC_CPU_LP64 
465.tonto:  -DSPEC_CPU_LP64 
470.lbm:  -DSPEC_CPU_LP64 
481.wrf:  -DSPEC_CPU_LP64   -DSPEC_CPU_CASE_FLAG   -DSPEC_CPU_LINUX 
482.sphinx3:  -DSPEC_CPU_LP64 

Base Optimization Flags

C benchmarks:

 -Mvect=cachesize:6291456   -fastsse   -Msmartalloc=huge   -Mconcur   -Mfprelaxed   -Mipa=fast   -Mipa=inline   -tp barcelona-64   -Bstatic_pgi 

C++ benchmarks:

 -Mvect=cachesize:6291456   -fastsse   -Msmartalloc=huge   -Mfprelaxed   -Mconcur   --zc_eh   -Mipa=fast   -Mipa=inline   -tp barcelona-64   -Bstatic_pgi 

Fortran benchmarks:

 -Mvect=cachesize:6291456   -fastsse   -Mfprelaxed   -Msmartalloc=huge   -Mconcur   -Mipa=fast   -Mipa=inline   -tp barcelona-64   -Bstatic_pgi 

Benchmarks using both Fortran and C:

 -Mvect=cachesize:6291456   -fastsse   -Msmartalloc=huge   -Mconcur   -Mfprelaxed   -Mipa=fast   -Mipa=inline   -tp barcelona-64   -Bstatic_pgi 

Base Other Flags

C benchmarks:

 -Mipa=jobs:8 

C++ benchmarks:

 -Mipa=jobs:8 

Fortran benchmarks:

 -Mipa=jobs:8 

Benchmarks using both Fortran and C:

 -Mipa=jobs:8 

Peak Compiler Invocation

C benchmarks:

 pgcc 

C++ benchmarks:

 pgcpp 

Fortran benchmarks:

 pgf95 

Benchmarks using both Fortran and C:

 pgcc   pgf95 

Peak Portability Flags

410.bwaves:  -DSPEC_CPU_LP64 
416.gamess:  -DSPEC_CPU_LP64 
433.milc:  -DSPEC_CPU_LP64 
434.zeusmp:  -DSPEC_CPU_LP64 
435.gromacs:  -DSPEC_CPU_LP64   -Mnomain 
436.cactusADM:  -DSPEC_CPU_LP64   -Mnomain 
437.leslie3d:  -DSPEC_CPU_LP64 
444.namd:  -DSPEC_CPU_LP64 
450.soplex:  -DSPEC_CPU_LP64 
453.povray:  -DSPEC_CPU_LP64 
454.calculix:  -DSPEC_CPU_LP64   -Mnomain 
459.GemsFDTD:  -DSPEC_CPU_LP64 
465.tonto:  -DSPEC_CPU_LP64 
470.lbm:  -DSPEC_CPU_LP64 
481.wrf:  -DSPEC_CPU_LP64   -DSPEC_CPU_CASE_FLAG   -DSPEC_CPU_LINUX 
482.sphinx3:  -DSPEC_CPU_LP64 

Peak Optimization Flags

C benchmarks:

433.milc:  -Mvect=cachesize:6291456   -fastsse   -Msmartalloc=huge   -Msafeptr   -Mconcur   -Mfprelaxed   -Mipa=inline   -Mipa=arg   -Mipa=const   -Mipa=ptr   -Mipa=shape   -tp barcelona-64   -Bstatic_pgi 
470.lbm:  basepeak = yes 
482.sphinx3:  -Mpfi(pass 1)   -Mpfo(pass 2)   -Mipa=fast(pass 2)   -Mipa=inline(pass 2)   -Mvect=cachesize:6291456   -fastsse   -Mfprelaxed   -Msmartalloc   -tp barcelona-64   -Bstatic_pgi 

C++ benchmarks:

444.namd:  -Mpfi(pass 1)   -Mpfo(pass 2)   -Mipa=fast(pass 2)   -Mipa=inline(pass 2)   -Mvect=cachesize:6291456   -fastsse   -Munroll=n:4   -Munroll=m:8   -Msmartalloc=huge   -Mnodepchk   -Mfprelaxed   --zc_eh   -tp barcelona-64   -Bstatic_pgi 
447.dealII:  -Mvect=cachesize:6291456   -fastsse   -alias=ansi   -Msmartalloc=huge   -Mprefetch=t0   -Mnovect   -Mfprelaxed   --zc_eh   -Mipa=fast   -Mipa=inline   -tp barcelona-32   -Bstatic_pgi 
450.soplex:  basepeak = yes 
453.povray:  -Mpfi=indirect(pass 1)   -Mpfo=indirect(pass 2)   -Mipa=fast(pass 2)   -Mipa=inlinenopfo:3(pass 2)   -Mipa=staticfunc(pass 2)   -Mvect=cachesize:6291456   -fastsse   -Msmartalloc=huge   -Mprefetch=t0   -Mfprelaxed   -tp barcelona-64   -Bstatic_pgi 

Fortran benchmarks:

410.bwaves:  basepeak = yes 
416.gamess:  -Mpfi(pass 1)   -Mpfo(pass 2)   -Mipa=fast(pass 2)   -Mipa=inline(pass 2)   -Mvect=cachesize:6291456   -fastsse   -Msmartalloc=huge   -Mvect=noaltcode   -Mprefetch=t0   -Mfprelaxed   -tp barcelona-64   -Bstatic_pgi 
434.zeusmp:  -Mvect=cachesize:6291456   -fastsse   -Mfprelaxed   -Mconcur   -Mprefetch=distance:8   -Mprefetch=t0   -Msmartalloc=huge   -Msmartalloc=hugebss   -Mipa=fast   -Mipa=inline   -tp barcelona-64   -Bstatic_pgi 
437.leslie3d:  -Mpfi=indirect(pass 1)   -Mpfo=indirect(pass 2)   -Mconcur=noaltcode(pass 2)   -Mipa=fast(pass 2)   -Mipa=inline(pass 2)   -Mvect=cachesize:6291456   -fastsse   -Mvect=fuse   -Msmartalloc=huge   -Mprefetch=distance:8   -Mprefetch=t0   -Mfprelaxed   -tp barcelona-64   -Bstatic_pgi 
459.GemsFDTD:  basepeak = yes 
465.tonto:  -Mvect=cachesize:6291456   -fastsse   -O4   -Mvect=noaltcode   -Msmartalloc=huge   -Mprefetch=distance:8   -Mprefetch=t0   -Mfprelaxed   -Mipa=fast   -Mipa=inline   -tp barcelona-64   -Bstatic_pgi 

Benchmarks using both Fortran and C:

435.gromacs:  -Mvect=cachesize:6291456   -fastsse   -Msmartalloc=huge   -Mfprelaxed   -Mconcur   -Mfpapprox=rsqrt   -Mipa=fast   -Mipa=inline   -tp barcelona-64   -Bstatic_pgi 
436.cactusADM:  -Mvect=cachesize:6291456   -fastsse   -Msmartalloc=huge   -Mfprelaxed   -Mconcur   -Mdse   -Mipa=fast   -Mipa=inline   -tp barcelona-64   -Bstatic_pgi 
454.calculix:  -Mpfi=indirect(pass 1)   -Mpfo=indirect(pass 2)   -Mipa=fast(pass 2)   -Mipa=inline(pass 2)   -Mvect=cachesize:6291456   -fastsse   -Msmartalloc=huge   -Mloop32   -Mprefetch=t0   -Mpre   -Mfprelaxed   -tp barcelona-64   -Bstatic_pgi 
481.wrf:  -Mvect=cachesize:6291456   -fastsse   -Mvect=noaltcode   -Msmartalloc=huge   -Mprefetch=distance:8   -Mconcur=noaltcode   -Mfprelaxed   -tp barcelona-64   -Bstatic_pgi 

Peak Other Flags

C benchmarks:

 -Mipa=jobs:8(pass 2) 

C++ benchmarks:

 -Mipa=jobs:8(pass 2) 

Fortran benchmarks:

 -Mipa=jobs:8 

Benchmarks using both Fortran and C (except as noted below):

 -Mipa=jobs:8(pass 2) 
481.wrf:  No flags used 

The flags file that was used to format this result can be browsed at
http://www.spec.org/cpu2006/flags/pgi72_linux_flags.20090713.html.

You can also download the XML flags source by saving the following link:
http://www.spec.org/cpu2006/flags/pgi72_linux_flags.20090713.xml.