SPEC CFP2000 Summary Hewlett-Packard Company AlphaServer GS1280 7/1150 Wed Dec 11 14:55:07 2002 SPEC License #2 Test date: Dec-2002 Hardware availability: Jan-2003 Tester: HP Software availability: Jan-2003 Base Base Base Peak Peak Peak Benchmarks Ref Time Run Time Ratio Ref Time Run Time Ratio ------------ -------- -------- -------- -------- -------- -------- 168.wupwise 1600 182 877 1600 75.3 2125 168.wupwise 1600 181 883 1600 75.5 2119 168.wupwise 1600 181 883* 1600 75.3 2124* 171.swim 3100 86.4 3590* 3100 86.4 3590* 171.swim 3100 86.4 3589 3100 86.4 3589 171.swim 3100 86.4 3590 3100 86.4 3590 172.mgrid 1800 254 708 1800 167 1079* 172.mgrid 1800 254 708* 1800 167 1079 172.mgrid 1800 254 707 1800 167 1079 173.applu 2100 138 1518* 2100 136 1547 173.applu 2100 138 1519 2100 136 1546* 173.applu 2100 139 1514 2100 136 1546 177.mesa 1400 151 929 1400 126 1109* 177.mesa 1400 151 928 1400 126 1109 177.mesa 1400 151 928* 1400 126 1109 178.galgel 2900 138 2104 2900 136 2129 178.galgel 2900 138 2105* 2900 136 2126 178.galgel 2900 138 2106 2900 136 2126* 179.art 2600 129 2014 2600 80.3 3237* 179.art 2600 129 2015 2600 80.3 3238 179.art 2600 129 2014* 2600 80.3 3237 183.equake 1300 250 520 1300 82.3 1580 183.equake 1300 250 519 1300 82.2 1581* 183.equake 1300 250 519* 1300 82.0 1585 187.facerec 1900 172 1105 1900 154 1233 187.facerec 1900 172 1106 1900 154 1234* 187.facerec 1900 172 1105* 1900 154 1234 188.ammp 2200 299 735 2200 260 847* 188.ammp 2200 299 735* 2200 260 847 188.ammp 2200 299 735 2200 260 846 189.lucas 2000 131 1523 2000 120 1664* 189.lucas 2000 131 1522* 2000 120 1668 189.lucas 2000 131 1521 2000 120 1663 191.fma3d 2100 206 1022 2100 154 1365 191.fma3d 2100 206 1019* 2100 154 1364 191.fma3d 2100 206 1018 2100 154 1364* 200.sixtrack 1100 234 469* 1100 216 510 200.sixtrack 1100 234 470 1100 216 510* 200.sixtrack 1100 234 469 1100 216 510 301.apsi 2600 209 1244 2600 195 1332 301.apsi 2600 209 1241 2600 195 1331* 301.apsi 2600 209 1242* 2600 195 1331 ======================================================================== 168.wupwise 1600 181 883* 1600 75.3 2124* 171.swim 3100 86.4 3590* 3100 86.4 3590* 172.mgrid 1800 254 708* 1800 167 1079* 173.applu 2100 138 1518* 2100 136 1546* 177.mesa 1400 151 928* 1400 126 1109* 178.galgel 2900 138 2105* 2900 136 2126* 179.art 2600 129 2014* 2600 80.3 3237* 183.equake 1300 250 519* 1300 82.2 1581* 187.facerec 1900 172 1105* 1900 154 1234* 188.ammp 2200 299 735* 2200 260 847* 189.lucas 2000 131 1522* 2000 120 1664* 191.fma3d 2100 206 1019* 2100 154 1364* 200.sixtrack 1100 234 469* 1100 216 510* 301.apsi 2600 209 1242* 2600 195 1331* SPECfp_base2000 1124 SPECfp2000 1482 HARDWARE -------- Hardware Vendor: Hewlett-Packard Company Model Name: AlphaServer GS1280 7/1150 CPU: Alpha 21364 CPU MHz: 1150 FPU: Integrated CPU(s) enabled: 1 core, 1 chip, 1 core/chip CPU(s) orderable: 2 to 16 Parallel: No Primary Cache: 64KB(I)+64KB(D) on chip Secondary Cache: 1.75MB on chip per CPU L3 Cache: None Other Cache: None Memory: 4GB Disk Subsystem: 36GB SCSI Other Hardware: None SOFTWARE -------- Operating System: Tru64 UNIX V5.1B (Rev. 2650) +IPK Compiler: Compaq C V6.5-011-48C5K Spike V5.2 (506A) Compaq Fortran V5.5-2602-48C8L Compaq Fortran 77 V5.5-2602-48C8L KAP Fortran V4.3 k3105171 000607 KAP Fortran 77 V4.1 k310440 980926 KAP C V4.1 k010726 000607 File System: ufs System State: Multi-user NOTES ----- Baseline C: cc -arch ev7 -fast -O4 ONESTEP Fortran: f90 -arch ev7 -fast -O5 ONESTEP Peak: All use: -arch ev7 -non_shared ONESTEP except these (which use only the tunings shown below): 173.applu 188.ammp 191.fma3d Individual benchmark tuning: 168.wupwise: kf77 -call_shared -inline all -tune ev67 -unroll 12 -automatic -align commons -arch ev67 -fkapargs=' -aggressive=c -fuse -fuselevel=1 -so=2 -r=1 -o=1 -interleave -ur=6 -ur2=060 ' +PFB 171.swim: same as base 172.mgrid: kf90 -call_shared -arch generic -O5 -inline manual -nopipeline -transform_loops -unroll 9 -automatic -fkapargs='-aggressive=a -fuse -interleave -ur=2 -ur3=5 -cachesize=128,16000 ' +PFB 173.applu: kf90 -O5 -transform_loops -fkapargs=' -o=0 -nointerleave -ur=14 -ur2=260 -ur3=18' +PFB 177.mesa: kcc -fast -O4 +CFB +IFB 178.galgel: f90 -O5 -fast -unroll 5 -automatic 179.art: kcc -assume whole_program -ldensemalloc -call_shared -assume restricted_pointers -unroll 16 -inline none -ckapargs=' -fuse -fuselevel=1 -ur=3' +PFB 183.equake: cc -call_shared -arch generic -fast -O4 -ldensemalloc -assume restricted_pointers -inline speed -unroll 13 -xtaso_short +PFB 187.facerec: f90 -O4 -nopipeline -inline all -non_shared -speculate all -unroll 7 -automatic -assume accuracy_sensitive -math_library fast +IFB 188.ammp: cc -arch host -O4 -ifo -assume nomath_errno -assume trusted_short_alignment -fp_reorder -readonly_strings -ldensemalloc -xtaso_short -assume restricted_pointers -unroll 9 -inline speed +CFB +IFB +PFB 189.lucas: kf90 -O5 -fkapargs='-ur=1' +PFB 191.fma3d: kf90 -arch ev6 -non_shared -O4 -transform_loops -fkapargs='-cachesize=128,16000 ' +PFB 200.sixtrack: f90 -fast -O5 -assume accuracy_sensitive -notransform_loops +PFB 301.apsi: kf90 -O5 -inline none -call_shared -speculate all -align commons -fkapargs=' -aggressive=ab -tune=ev5 -fuse -ur=1 -ur2=60 -ur3=20 -cachesize=128,16000' Most benchmarks are built using one or more types of profile-driven feedback. The types used are designated by abbreviations in the notes: +CFB: Code generation is optimized by the compiler, using feedback from a training run. These commands are done before the first compile (in phase "fdo_pre0"): mkdir /tmp/pp rm -f /tmp/pp/${baseexe}* and these flags are added to the first and second compiles: PASS1_CFLAGS = -prof_gen_noopt -prof_dir /tmp/pp PASS2_CFLAGS = -prof_use -prof_dir /tmp/pp (Peak builds use /tmp/pp above; base builds use /tmp/pb.) +IFB: Icache usage is improved by the post-link-time optimizer Spike, using feedback from a training run. These commands are used (in phase "fdo_postN"): mv ${baseexe} oldexe spike oldexe -feedback oldexe -o ${baseexe} +PFB: Prefetches are improved by the post-link-time optimizer Spike, using feedback from a training run. These commands are used (in phase "fdo_post_makeN"): rm -f *Counts* mv ${baseexe} oldexe pixie -stats dstride oldexe 1>pixie.out 2>pixie.err mv oldexe.pixie ${baseexe} A training run is carried out (in phase "fdo_runN"), and then this command (in phase "fdo_postN"): spike oldexe -fb oldexe -stride_prefetch -o ${baseexe} When Spike is used for both Icache and Prefetch improvements, only one spike command is actually issued, with the Icache options followed by the Prefetch options. vm: vm_bigpg_enabled = 1 vm_bigpg_thresh=16 vm_swap_eager = 0 proc: max_per_proc_address_space = 0x40000000000 max_per_proc_data_size = 0x40000000000 max_per_proc_stack_size = 0x40000000000 max_proc_per_user = 2048 max_threads_per_user = 0 maxusers = 16384 per_proc_address_space = 0x40000000000 per_proc_data_size = 0x40000000000 per_proc_stack_size = 0x40000000000 Portability: galgel: -fixed Information on UNIX V5.1B Patches can be found at http://ftp1.service.digital.com/public/unix/v5.1b/ Processes were bound to CPUs using 'runon'. In the GS1280, there are two CPUs per shelf. Each CPU has its own 4GB of memory. Neither of the CPUs can be physically removed. For 1 CPU result measurements, one CPU was turned off at boot time using the /etc/sysconfigtab setting "cpu_enabled_mask=0". The second CPU's 4GB of memory was also physically removed. ----------------------------------------------------------------------------- For questions about this result, please contact the tester. For other inquiries, please contact webmaster@spec.org. Copyright 1999-2004 Standard Performance Evaluation Corporation Generated on Wed Apr 13 13:13:45 2005 by SPEC CPU2000 ASCII formatter v2.1