SPEC(R) CFP2006 Summary Tyan Transport TX46, AMD Opteron 8376 HE Test Sponsor: Advanced Micro Devices Sat Nov 22 08:24:16 2008 CPU2006 License: 49 Test date: Nov-2008 Test sponsor: Advanced Micro Devices Hardware availability: Jan-2009 Tested by: Advanced Micro Devices Software availability: Jun-2008 Base Base Base Peak Peak Peak Benchmarks Copies Run Time Rate Copies Run Time Rate -------------- ------ --------- --------- ------ --------- --------- 410.bwaves 16 1619 134 S 16 1604 136 S 410.bwaves 16 1613 135 * 16 1613 135 S 410.bwaves 16 1606 135 S 16 1611 135 * 416.gamess 16 1369 229 S 16 1228 255 S 416.gamess 16 1367 229 * 16 1226 256 * 416.gamess 16 1367 229 S 16 1226 256 S 433.milc 16 1308 112 S 16 1308 112 S 433.milc 16 1313 112 S 16 1313 112 S 433.milc 16 1311 112 * 16 1311 112 * 434.zeusmp 16 801 182 S 16 739 197 S 434.zeusmp 16 798 182 S 16 740 197 * 434.zeusmp 16 798 182 * 16 743 196 S 435.gromacs 16 556 205 S 16 455 251 S 435.gromacs 16 556 206 S 16 454 251 * 435.gromacs 16 556 205 * 16 454 252 S 436.cactusADM 16 976 196 * 4 188 254 S 436.cactusADM 16 976 196 S 4 186 257 S 436.cactusADM 16 978 195 S 4 186 257 * 437.leslie3d 16 1557 96.6 S 16 1415 106 * 437.leslie3d 16 1557 96.6 * 16 1415 106 S 437.leslie3d 16 1558 96.5 S 16 1412 106 S 444.namd 16 723 177 * 16 631 204 S 444.namd 16 723 177 S 16 626 205 * 444.namd 16 722 178 S 16 625 205 S 447.dealII 16 752 243 S 16 576 318 S 447.dealII 16 754 243 * 16 586 312 S 447.dealII 16 758 241 S 16 586 312 * 450.soplex 16 1252 107 S 16 1137 117 S 450.soplex 16 1175 114 S 16 1074 124 * 450.soplex 16 1177 113 * 16 1068 125 S 453.povray 16 353 241 S 16 305 279 S 453.povray 16 352 242 S 16 306 278 * 453.povray 16 353 241 * 16 308 277 S 454.calculix 16 552 239 * 16 466 283 S 454.calculix 16 552 239 S 16 467 283 S 454.calculix 16 552 239 S 16 466 283 * 459.GemsFDTD 16 1593 107 * 16 1541 110 * 459.GemsFDTD 16 1591 107 S 16 1542 110 S 459.GemsFDTD 16 1596 106 S 16 1535 111 S 465.tonto 16 731 215 * 16 603 261 * 465.tonto 16 730 216 S 16 604 261 S 465.tonto 16 733 215 S 16 603 261 S 470.lbm 16 1899 116 S 16 1906 115 S 470.lbm 16 1891 116 S 16 1903 115 S 470.lbm 16 1894 116 * 16 1906 115 * 481.wrf 16 1024 175 * 16 999 179 * 481.wrf 16 1027 174 S 16 1000 179 S 481.wrf 16 1017 176 S 16 997 179 S 482.sphinx3 16 1650 189 S 16 1573 198 S 482.sphinx3 16 1653 189 * 16 1574 198 * 482.sphinx3 16 1659 188 S 16 1576 198 S ============================================================================== 410.bwaves 16 1613 135 * 16 1611 135 * 416.gamess 16 1367 229 * 16 1226 256 * 433.milc 16 1311 112 * 16 1311 112 * 434.zeusmp 16 798 182 * 16 740 197 * 435.gromacs 16 556 205 * 16 454 251 * 436.cactusADM 16 976 196 * 4 186 257 * 437.leslie3d 16 1557 96.6 * 16 1415 106 * 444.namd 16 723 177 * 16 626 205 * 447.dealII 16 754 243 * 16 586 312 * 450.soplex 16 1177 113 * 16 1074 124 * 453.povray 16 353 241 * 16 306 278 * 454.calculix 16 552 239 * 16 466 283 * 459.GemsFDTD 16 1593 107 * 16 1541 110 * 465.tonto 16 731 215 * 16 603 261 * 470.lbm 16 1894 116 * 16 1906 115 * 481.wrf 16 1024 175 * 16 999 179 * 482.sphinx3 16 1653 189 * 16 1574 198 * SPECfp(R)_rate_base2006 167 SPECfp_rate2006 186 HARDWARE -------- CPU Name: AMD Opteron 8376 HE CPU Characteristics: CPU MHz: 2300 FPU: Integrated CPU(s) enabled: 16 cores, 4 chips, 4 cores/chip CPU(s) orderable: 2,4 chips Primary Cache: 64 KB I + 64 KB D on chip per core Secondary Cache: 512 KB I+D on chip per core L3 Cache: 6 MB I+D on chip per chip Other Cache: None Memory: 64 GB (16x4 GB, DDR2-800, CL5, Reg, Dual Rank) Disk Subsystem: 1 x 250 GB SATA, 7200 RPM Other Hardware: None SOFTWARE -------- Operating System: Red Hat Enterprise Linux Server release 5.2, Advanced Platform, Kernel 2.6.18-92.el5 Compiler: PGI Server Complete Version 7.2 PathScale Compiler Suite Version 3.2 Auto Parallel: Yes File System: ext3 System State: Run level 3 (Full multiuser with network) Base Pointers: 64-bit Peak Pointers: 32/64-bit Other Software: binutils 2.18 32-bit and 64-bit libhugetlbfs libraries Submit Notes ------------ The config file option 'submit' was used. 'numactl' was used to bind copies to the cores Operating System Notes ---------------------- The libhugetlbfs libraries were installed using the installation rpms that came with the distribution. 'ulimit -s unlimited' was used to set environment stack size 'ulimit -l 2097152' was used to set environment locked pages in memory limit Set vm/nr_hugepages=14336 in /etc/sysctl.conf mount -t hugetlbfs nodev /mnt/hugepages General Notes ------------- Environment variables set by runspec before the start of the run: HUGETLB_MORECORE = "yes" LD_LIBRARY_PATH = "/root/work/cpu2006v1.1/amd909gh-libs/64:/root/work/cpu2006v1.1/amd909gh-libs/32" NCPUS = "4" Base Compiler Invocation ------------------------ C benchmarks: pgcc C++ benchmarks: pgcpp Fortran benchmarks: pgf95 Benchmarks using both Fortran and C: pgcc pgf95 Base Portability Flags ---------------------- 410.bwaves: -DSPEC_CPU_LP64 416.gamess: -DSPEC_CPU_LP64 433.milc: -DSPEC_CPU_LP64 434.zeusmp: -DSPEC_CPU_LP64 435.gromacs: -DSPEC_CPU_LP64 -Mnomain 436.cactusADM: -DSPEC_CPU_LP64 -Mnomain 437.leslie3d: -DSPEC_CPU_LP64 444.namd: -DSPEC_CPU_LP64 447.dealII: -DSPEC_CPU_LP64 450.soplex: -DSPEC_CPU_LP64 453.povray: -DSPEC_CPU_LP64 454.calculix: -DSPEC_CPU_LP64 -Mnomain 459.GemsFDTD: -DSPEC_CPU_LP64 465.tonto: -DSPEC_CPU_LP64 470.lbm: -DSPEC_CPU_LP64 481.wrf: -DSPEC_CPU_LP64 -DSPEC_CPU_CASE_FLAG -DSPEC_CPU_LINUX 482.sphinx3: -DSPEC_CPU_LP64 Base Optimization Flags ----------------------- C benchmarks: -Mvect=cachesize:6291456 -fastsse -Msmartalloc=huge -Mfprelaxed -Mipa=fast -Mipa=inline -tp barcelona-64 -Bstatic_pgi C++ benchmarks: -Mvect=cachesize:6291456 -fastsse -Msmartalloc=huge -Mfprelaxed --zc_eh -Mipa=fast -Mipa=inline -tp barcelona-64 -Bstatic_pgi Fortran benchmarks: -Mvect=cachesize:6291456 -fastsse -Mfprelaxed -Msmartalloc=huge -Mipa=fast -Mipa=inline -tp barcelona-64 -Bstatic_pgi Benchmarks using both Fortran and C: -Mvect=cachesize:6291456 -fastsse -Msmartalloc=huge -Mfprelaxed -Mipa=fast -Mipa=inline -tp barcelona-64 -Bstatic_pgi Base Other Flags ---------------- C benchmarks: -Mipa=jobs:4 C++ benchmarks: -Mipa=jobs:4 Fortran benchmarks: -Mipa=jobs:4 Benchmarks using both Fortran and C: -Mipa=jobs:4 Peak Compiler Invocation ------------------------ C benchmarks: pgcc C++ benchmarks (except as noted below): pathCC 444.namd: pgcpp Fortran benchmarks (except as noted below): pathf95 410.bwaves: pgf95 434.zeusmp: pgf95 437.leslie3d: pgf95 Benchmarks using both Fortran and C (except as noted below): pgcc pgf95 435.gromacs: pathcc pathf95 481.wrf: pathcc pathf95 Peak Portability Flags ---------------------- 410.bwaves: -DSPEC_CPU_LP64 416.gamess: -DSPEC_CPU_LP64 433.milc: -DSPEC_CPU_LP64 434.zeusmp: -DSPEC_CPU_LP64 435.gromacs: -DSPEC_CPU_LP64 436.cactusADM: -DSPEC_CPU_LP64 -Mnomain 437.leslie3d: -DSPEC_CPU_LP64 444.namd: -DSPEC_CPU_LP64 453.povray: -DSPEC_CPU_LP64 454.calculix: -DSPEC_CPU_LP64 -Mnomain 459.GemsFDTD: -DSPEC_CPU_LP64 465.tonto: -DSPEC_CPU_LP64 470.lbm: -DSPEC_CPU_LP64 481.wrf: -DSPEC_CPU_LP64 -DSPEC_CPU_LINUX -fno-second-underscore 482.sphinx3: -DSPEC_CPU_LP64 Peak Optimization Flags ----------------------- C benchmarks: 433.milc: basepeak = yes 470.lbm: -Mvect=cachesize:6291456 -fastsse -Msmartalloc=huge -Mprefetch=t0 -Mloop32 -Mfprelaxed -Mipa=fast -Mipa=inline -tp barcelona-64 -Bstatic_pgi 482.sphinx3: -Mpfi=indirect(pass 1) -Mpfo=indirect(pass 2) -Mipa=fast(pass 2) -Mipa=inline(pass 2) -Mvect=cachesize:6291456 -fastsse -Mfprelaxed -Msmartalloc -tp barcelona-64 -Bstatic_pgi C++ benchmarks: 444.namd: -Mpfi(pass 1) -Mpfo(pass 2) -Mipa=fast(pass 2) -Mipa=inline(pass 2) -Mvect=cachesize:6291456 -fastsse -Munroll=n:4 -Munroll=m:8 -Msmartalloc=huge -Mnodepchk -Mfprelaxed --zc_eh -tp barcelona-64 -Bstatic_pgi 447.dealII: -march=barcelona -Ofast -static -INLINE:aggressive=on -fno-exceptions -m32 450.soplex: -march=barcelona -fb_create fbdata(pass 1) -fb_opt fbdata(pass 2) -L/usr/lib -lhugetlbfs(pass 2) -O3 -INLINE:aggressive=on -OPT:IEEE_arith=3 -OPT:IEEE_NaN_Inf=off -OPT:fold_unsigned_relops=on -OPT:malloc_alg=1 -CG:load_exe=0 -fno-exceptions -m32 453.povray: -march=barcelona -fb_create fbdata(pass 1) -fb_opt fbdata(pass 2) -Ofast -INLINE:aggressive=on Fortran benchmarks: 410.bwaves: -Mvect=cachesize:6291456 -fastsse -Msmartalloc -Mprefetch=nta -Mfprelaxed -Mipa=fast -Mipa=inline -tp barcelona-64 -Bstatic_pgi 416.gamess: -march=barcelona -fb_create fbdata(pass 1) -fb_opt fbdata(pass 2) -Wl,-T/usr/share/libhugetlbfs/ldscripts/elf_x86_64.xBDT(pass 2) -L/usr/lib64 -lhugetlbfs(pass 2) -O2 -OPT:Ofast -OPT:ro=3 -OPT:unroll_size=256 434.zeusmp: -Mvect=cachesize:6291456 -fastsse -Mfprelaxed -Mprefetch=distance:8 -Mprefetch=t0 -Msmartalloc=huge -Msmartalloc=hugebss -Mipa=fast -Mipa=inline -tp barcelona-64 -Bstatic_pgi 437.leslie3d: -Mpfi=indirect(pass 1) -Mpfo=indirect(pass 2) -Mipa=fast(pass 2) -Mipa=inline(pass 2) -Mvect=cachesize:6291456 -fastsse -Mvect=fuse -Msmartalloc=huge -Mprefetch=distance:8 -Mprefetch=t0 -Mfprelaxed -tp barcelona-64 -Bstatic_pgi 459.GemsFDTD: -march=barcelona -Ofast -LNO:fission=2 -LNO:simd=2 -LNO:prefetch_ahead=1 -CG:load_exe=0 -CG:prefer_lru_reg=off -OPT:malloc_alg=1 -Wl,-T/usr/share/libhugetlbfs/ldscripts/elf_x86_64.xBDT -L/usr/lib64 -lhugetlbfs 465.tonto: -march=barcelona -Ofast -OPT:alias=no_f90_pointer_alias -LNO:blocking=off -CG:load_exe=1 -IPA:plimit=525 -OPT:malloc_alg=1 -Wl,-T/usr/share/libhugetlbfs/ldscripts/elf_x86_64.xBDT -L/usr/lib64 -lhugetlbfs Benchmarks using both Fortran and C: 435.gromacs: -march=barcelona -Ofast -OPT:rsqrt=2 -OPT:malloc_alg=1 -Wl,-T/usr/share/libhugetlbfs/ldscripts/elf_x86_64.xBDT -L/usr/lib64 -lhugetlbfs 436.cactusADM: -Mvect=cachesize:6291456 -fastsse -Mconcur -Msmartalloc=huge -Mfprelaxed -Mipa=fast -Mipa=inline -tp barcelona-64 -Bstatic_pgi 454.calculix: -Mpfi=indirect(pass 1) -Mpfo=indirect(pass 2) -Mipa=fast(pass 2) -Mipa=inline(pass 2) -Mvect=cachesize:6291456 -fastsse -Msmartalloc=huge -Mprefetch=t0 -Mpre -Mfprelaxed -tp barcelona-64 -Bstatic_pgi 481.wrf: -march=barcelona -Ofast -LNO:blocking=off -LNO:prefetch_ahead=10 -LANG:copyinout=off -IPA:callee_limit=5000 -GRA:prioritize_by_density=on -OPT:malloc_alg=1 -m3dnow -Wl,-T/usr/share/libhugetlbfs/ldscripts/elf_x86_64.xBDT -L/usr/lib64 -lhugetlbfs Peak Other Flags ---------------- C benchmarks: -Mipa=jobs:4(pass 2) C++ benchmarks: 444.namd: -Mipa=jobs:4(pass 2) Fortran benchmarks (except as noted below): -Mipa=jobs:4(pass 2) 416.gamess: No flags used 459.GemsFDTD: No flags used 465.tonto: No flags used Benchmarks using both Fortran and C (except as noted below): -Mipa=jobs:4(pass 2) 435.gromacs: No flags used 481.wrf: No flags used The flags files that were used to format this result can be browsed at http://www.spec.org/cpu2006/flags/pgi72_linux_flags.html http://www.spec.org/cpu2006/flags/CPU2006_flags.20090710.html http://www.spec.org/cpu2006/flags/amd-platform-amd909gh.html You can also download the XML flags sources by saving the following links: http://www.spec.org/cpu2006/flags/pgi72_linux_flags.xml http://www.spec.org/cpu2006/flags/CPU2006_flags.20090710.xml http://www.spec.org/cpu2006/flags/amd-platform-amd909gh.xml SPEC and SPECfp are registered trademarks of the Standard Performance Evaluation Corporation. All other brand and product names appearing in this result are trademarks or registered trademarks of their respective holders. ----------------------------------------------------------------------------- For questions about this result, please contact the tester. For other inquiries, please contact webmaster@spec.org. Copyright 2006-2014 Standard Performance Evaluation Corporation Tested with SPEC CPU2006 v1.1. Report generated on Tue Jul 22 22:45:07 2014 by CPU2006 ASCII formatter v6932. Originally published on 13 January 2009.