Base Results
Optimized Results
Base and Optimized
Base Results
Optimized Results
Base and Optimized
Base Results
Optimized Results
Base and Optimized
Base Results
Optimized Results
Base and Optimized
Base Results
Optimized Results
Base and Optimized
Base Results
Optimized Results
Base and Optimized
Manufacturer/Processor Type, Speed, Count, Threads, Processes
Includes the manufacturer/processor type, processor speed, number of processors, threads, and number of processes.
Move mouse over this column for each row to display additional information, including; manufacturer, system name, interconnect, MPI, affiliation, and submission date.

Run Type

Run Type, indicates whether the benchmark was a base run or was optimized.

Processors

Processors, this is the number of processors used in the benchmark, entered in the form by the benchmark submitter.

G-HPL ( system performance )
HPL, Solves a randomly generated dense linear system of equations in double floating-point precision (IEEE 64-bit) arithmetic using MPI. The linear system matrix is stored in a two-dimensional block-cyclic fashion and multiple variants of code are provided for computational kernels and communication patterns. The solution method is LU factorization through Gaussian elimination with partial row pivoting followed by a backward substitution. Unit: Tera Flops per Second
G-PTRANS (A=A+B^T, MPI) ( system performance )
PTRANS (A=A+B^T, MPI), Implements a parallel matrix transpose for two-dimensional block-cyclic storage. It is an important benchmark because it exercises the communications of the computer heavily on a realistic problem where pairs of processors communicate with each other simultaneously. It is a useful test of the total communications capacity of the network. Unit: Giga Bytes per Second
G-RandomAccess ( system performance )
Global RandomAccess, also called GUPs, measures the rate at which the computer can update pseudo-random locations of its memory - this rate is expressed in billions (giga) of updates per second (GUP/s). Unit: Giga Updates per Second
EP-STREAM Triad ( per process )
The Embarrassingly Parallel STREAM benchmark is a simple synthetic benchmark program that measures sustainable memory bandwidth and the corresponding computation rate for simple numerical vector kernels. It is run in embarrassingly parallel manner - all computational nodes perform the benchmark at the same time, the arithmetic average rate is reported. Unit: Giga Bytes per Second
EP-STREAM-sys ( system performance - derived )
The Embarrassingly Parallel STREAM benchmark is a simple synthetic benchmark program that measures sustainable memory bandwidth and the corresponding computation rate for simple numerical vector kernels. It is run in embarrassingly parallel manner - all computational nodes perform the benchmark at the same time, the arithmetic average rate is multiplied by the number of processes to attain this derived value. ( EP-STREAM Triad * MPI Processes ) Unit: Giga Bytes per Second
EP-DGEMM ( per process )
Embarrassingly Parallel DGEMM, benchmark measures the floating-point execution rate of double precision real matrix-matrix multiply performed by the DGEMM subroutine from the BLAS (Basic Linear Algebra Subprograms). It is run in embarrassingly parallel manner - all computational nodes perform the benchmark at the same time, the arithmetic average rate is reported. Unit: Giga Flops per Second
G-FFTE ( system performance )
Global FFTE, performs the same test as FFTE but across the entire system by distributing the input vector in block fashion across all the nodes. Unit: Giga Flops per Second
Randomly Ordered Ring Bandwidth ( per process )
Randomly Ordered Ring Bandwidth, reports bandwidth achieved in the ring communication pattern. The communicating nodes are ordered randomly in the ring (with respect to the natural ordering of the MPI default communicator). The result is averaged over various random assignments of processes in the ring. Unit: Giga Bytes per second
Randomly-Ordered Ring Latency ( per process )
Randomly-Ordered Ring Latency, reports latency in the ring communication pattern. The communicating nodes are ordered randomly in the ring (with respect to the natural ordering of the MPI default communicator) in the ring. The result is averaged over various random assignments of processes in the ring. Unit: micro-seconds







Condensed Results - Base Runs Only - 192 Systems - Generated on Wed Aug 20 20:06:20 2008
System Information
System - Processor - Speed - Count - Threads - Processes
G-HPL G-PTRANS G-Random
Access
G-FFTE EP-STREAM Sys EP-STREAM
Triad
EP-DGEMM RandomRing Bandwidth RandomRing Latency
MA/PT/PS/PC/TH/PR/CM/CS/IC/IA/SDTFlop/s GB/s Gup/s GFlop/s GB/s GB/s GFlop/s GB/s usec
Manufacturer: Atipa
Processor Type: AMD Opteron
Processor Speed: 1.4GHz
Processor Count: 128
Threads: 1
Processses: 128
System Name: Conquest cluster
Interconnect: Myrinet 2000
MPI: Lam-GM 7.0.4
Affiliation: University of Tennessee
Submission Date: 06-01-04
Atipa Conquest cluster AMD Opteron   1.4GHz 128 1 128
0.2526110
3.2471


208.525
1.6291

0.03627
23.676
Manufacturer: Clustervision BV
Processor Type: AMD Opteron
Processor Speed: 2.4GHz
Processor Count: 32
Threads: 1
Processses: 32
System Name: Beastie
Interconnect: Gigabit Ethernet, HP pro curve
MPI: MPICH 1.2.7
Affiliation: University of Glasgow
Submission Date: 08-29-05
Clustervision BV Beastie AMD Opteron   2.4GHz 32 1 32
0.1037640
0.8159
0.0002350
2.1470
106.951
3.3422
4.19493
0.02648
53.234
Manufacturer: ClusterVision/Dell/QLogic
Processor Type: Intel Xeon 5160
Processor Speed: 3GHz
Processor Count: 64
Threads: 1
Processses: 64
System Name: Darwin
Interconnect: InfiniBand DDR, QLogic TrueScale adapters
MPI: HPMPI-2.02.07.00
Affiliation: University of Cambridge
Submission Date: 08-04-08
ClusterVision/Dell/QLogic Darwin Intel Xeon 5160   3GHz 64 1 64
0.6327300
7.3008
0.2323370
14.3276
87.826
1.3723
10.75970
0.28258
1.248
Manufacturer: ClusterVision/Dell/QLogic
Processor Type: Intel Xeon 5160
Processor Speed: 3GHz
Processor Count: 128
Threads: 1
Processses: 128
System Name: Darwin
Interconnect: InfiniBand DDR, QLogic TrueScale adapters
MPI: HPMPI-2.02.07.00
Affiliation: University of Cambridge
Submission Date: 08-05-08
ClusterVision/Dell/QLogic Darwin Intel Xeon 5160   3GHz 128 1 128
1.2689700
15.8777
0.4199140
28.9749
176.827
1.3815
10.96020
0.26281
1.297
Manufacturer: ClusterVision/Dell/QLogic
Processor Type: Intel Xeon 5160
Processor Speed: 3GHz
Processor Count: 256
Threads: 1
Processses: 256
System Name: Darwin
Interconnect: InfiniBand DDR, QLogic TrueScale adapters
MPI: HPMPI-2.02.07.00
Affiliation: University of Cambridge
Submission Date: 08-05-08
ClusterVision/Dell/QLogic Darwin Intel Xeon 5160   3GHz 256 1 256
2.4601300
30.8088
0.7689970
54.3011
349.169
1.3639
10.85620
0.22469
1.326
Manufacturer: Cray Inc.
Processor Type: AMD Opteron
Processor Speed: 2.4GHz
Processor Count: 12960
Threads: 1
Processses: 25920
System Name: Red Storm/XT3
Interconnect: Cray custom
MPI: MPICH 2 v 1.0.2
Affiliation: NNSA/Sandia National Laboratories
Submission Date: 11-10-06
Cray Inc. Red Storm/XT3 AMD Opteron   2.4GHz 12960 1 25920
91.0350000
2356.9700
1.7401500
1554.0700
54840.499
2.1158
4.39939
0.05911
16.294
Manufacturer: Cray Inc.
Processor Type: Alpha 21164
Processor Speed: 0.6GHz
Processor Count: 1024
Threads: 1
Processses: 1024
System Name: T3E
Interconnect: Cray 3D torus
MPI: EPCC MPI
Affiliation: Army High Performance Computing Research Center
Submission Date: 02-03-04
Cray Inc. T3E Alpha 21164   0.6GHz 1024 1 1024
0.0481695
10.2765


529.242
0.5168

0.03174
12.093
Manufacturer: Cray Inc.
Processor Type: Alpha 21164
Processor Speed: 0.675GHz
Processor Count: 512
Threads: 1
Processses: 512
System Name: T3E
Interconnect: 3-D Torus
MPI: MPI 2.2.0.0
Affiliation: Engineer Research Development Center - Army Major Shared Resource Center
Submission Date: 11-02-04
Cray Inc. T3E Alpha 21164   0.675GHz 512 1 512
0.2231810
9.7741
0.0289464
15.4774
272.186
0.5316
0.66077
0.03571
8.141
Manufacturer: Cray Inc.
Processor Type: Cray X1 MSP
Processor Speed: 0.8GHz
Processor Count: 64
Threads: 1
Processses: 64
System Name: X1
Interconnect: Cray modified 2D torus
MPI: Cray MPT 2.2
Affiliation: Oak Ridge National Laboratory
Submission Date: 12-01-03
Cray Inc. X1 Cray MSP   0.8GHz 64 1 64
0.5215600
3.2288


959.334
14.9896

0.94074
20.345
Manufacturer: Cray Inc.
Processor Type: Cray X1 MSP
Processor Speed: 0.8GHz
Processor Count: 60
Threads: 1
Processses: 60
System Name: X1
Interconnect: Cray modified 2D torus
MPI: MPT 2.4
Affiliation: Engineer Research and Development Center Major Shared Resource Center
Submission Date: 04-26-04
Cray Inc. X1 Cray MSP   0.8GHz 60 1 60
0.5777790
30.4313


898.446
14.9741

1.03291
20.827
Manufacturer: Cray Inc.
Processor Type: Cray X1 MSP
Processor Speed: 0.8GHz
Processor Count: 120
Threads: 1
Processses: 120
System Name: X1
Interconnect: Cray modified 2D torus
MPI: Cray MPT 2.2
Affiliation: Army High Performance Computing Research Center
Submission Date: 02-03-04
Cray Inc. X1 Cray MSP   0.8GHz 120 1 120
1.0609700
2.4603


1019.519
8.4960

0.83014
20.115
Manufacturer: Cray Inc.
Processor Type: Cray X1 MSP
Processor Speed: 0.8GHz
Processor Count: 252
Threads: 1
Processses: 252
System Name: X1
Interconnect: X1
MPI: MPT 2.4
Affiliation: Oak Ridge National Laboratory
Submission Date: 04-26-04
Cray Inc. X1 Cray MSP   0.8GHz 252 1 252
2.3847300
97.4076


3758.404
14.9143

0.42899
22.271
Manufacturer: Cray Inc.
Processor Type: Cray X1 MSP
Processor Speed: 0.8GHz
Processor Count: 124
Threads: 1
Processses: 124
System Name: X1
Interconnect: Cray modified 2D torus
MPI: MPT 2.3.0.3
Affiliation: Army High Performance Computing Research Center (AHPCRC)
Submission Date: 05-03-04
Cray Inc. X1 Cray MSP   0.8GHz 124 1 124
1.2054200
39.5252


1856.664
14.9731

0.70857
20.152
Manufacturer: Cray Inc.
Processor Type: Cray X1 MSP
Processor Speed: 0.8GHz
Processor Count: 60
Threads: 1
Processses: 60
System Name: X1
Interconnect: Cray modified 2-D Torus
MPI: MPT 2.3.0.0
Affiliation: Engineer Research and Development Center Major Shared Resource Center
Submission Date: 11-02-04
Cray Inc. X1 Cray MSP   0.8GHz 60 1 60
0.5087430
1.6342
0.0030750
3.1444
894.114
14.9019
10.91520
1.16779
14.656
Manufacturer: Cray Inc.
Processor Type: Cray X1 MSP
Processor Speed: 0.8GHz
Processor Count: 32
Threads: 1
Processses: 32
System Name: X1
Interconnect: Cray modified 2-D Torus
MPI: MPT 2.4
Affiliation: Cray Inc.
Submission Date: 11-22-04
Cray Inc. X1 Cray MSP   0.8GHz 32 1 32
0.2767140
32.6606
0.0016620
2.9649
475.846
14.8702
8.25848
1.41269
14.940
Manufacturer: Cray Inc.
Processor Type: Cray X1E
Processor Speed: 1.13GHz
Processor Count: 1008
Threads: 1
Processses: 1008
System Name: X1
Interconnect: Cray Modified 2D torus
MPI: MPT
Affiliation: DOE/Office of Science/ORNL
Submission Date: 11-02-05
Cray Inc. X1 Cray E   1.13GHz 1008 1 1008
12.0263000
108.0190
0.0861199
82.3884
15522.091
15.3989
14.50000
0.15667
16.299
System Information
System - Processor - Speed - Count - Threads - Processes
G-HPL G-PTRANS G-Random
Access
G-FFTE EP-STREAM Sys EP-STREAM
Triad
EP-DGEMM RandomRing Bandwidth RandomRing Latency
MA/PT/PS/PC/TH/PR/CM/CS/IC/IA/SDTFlop/s GB/s Gup/s GFlop/s GB/s GB/s GFlop/s GB/s usec
Manufacturer: Cray Inc.
Processor Type: Cray X1 MSP
Processor Speed: 1.13GHz
Processor Count: 252
Threads: 1
Processses: 252
System Name: X1E
Interconnect: Cray modified 2D torus
MPI: MPT2.4.0.3
Affiliation: Army High Performance Computing Research Center
Submission Date: 06-16-05
Cray Inc. X1E Cray X1 MSP   1.13GHz 252 1 252
3.1940900
85.2040
0.0148684
15.5352
2439.985
9.6825
14.18470
0.36024
14.934
Manufacturer: Cray Inc.
Processor Type: Cray X1E
Processor Speed: 1.13GHz
Processor Count: 32
Threads: 4
Processses: 32
System Name: X1E
Interconnect: Cray Interconnect
MPI: mpt.2.4.0.4.4
Affiliation: ORNL
Submission Date: 09-13-05
Cray Inc. X1E Cray   1.13GHz 32 4 32
0.3376360
18.9199
0.0089686
5.2027
307.565
9.6114
11.60560
1.40487
12.208
Manufacturer: Cray Inc.
Processor Type: AMD Opteron
Processor Speed: 2.2GHz
Processor Count: 64
Threads: 1
Processses: 64
System Name: XD1
Interconnect: RapidArray Interconnect System
MPI: MPI over Rapid Array
Affiliation: Cray Inc.
Submission Date: 11-22-04
Cray Inc. XD1 AMD Opteron   2.2GHz 64 1 64
0.2238980
10.5924
0.0223966
16.3611
169.955
2.6555
4.03375
0.22697
1.629
Manufacturer: Cray Inc.
Processor Type: AMD Opteron
Processor Speed: 2.4GHz
Processor Count: 128
Threads: 1
Processses: 128
System Name: XD1
Interconnect: Rapid Array Fat Tree
MPI: mpich/mpich-pgi601
Affiliation: Cray
Submission Date: 06-15-05
Cray Inc. XD1 AMD Opteron   2.4GHz 128 1 128
0.5020760
13.5155
0.0666722
35.5172
500.065
3.9068
4.33435
0.25919
2.062
Manufacturer: Cray Inc.
Processor Type: AMD Opteron
Processor Speed: 2.6GHz
Processor Count: 1100
Threads: 1
Processses: 1100
System Name: XT3
Interconnect: Cray XT3
MPI: Mpich 2.0
Affiliation: Swiss National Supercomputing Centre CSCS
Submission Date: 06-08-05
Cray Inc. XT3 AMD Opteron   2.6GHz 1100 1 1100
4.7823400
217.9230
0.1370020
266.6600
5274.698
4.7952
4.81050
0.28638
25.942
Manufacturer: Cray Inc.
Processor Type: AMD Opteron
Processor Speed: 2.4GHz
Processor Count: 3744
Threads: 1
Processses: 3744
System Name: XT3
Interconnect: Cray XT3 MPP Interconnect
MPI: MPICH 2.0
Affiliation: Cray Inc. at Oak Ridge National Laboratory
Submission Date: 06-21-05
Cray Inc. XT3 AMD Opteron   2.4GHz 3744 1 3744
14.7040000
608.5060
0.2202960
417.1720
18146.382
4.8468
4.41330
0.16164
25.319
Manufacturer: Cray Inc.
Processor Type: AMD Opteron
Processor Speed: 2.4GHz
Processor Count: 5200
Threads: 1
Processses: 5200
System Name: XT3
Interconnect: Cray XT3 MPP Interconnect
MPI: MPICH 2.0
Affiliation: Cray Inc. at Oak Ridge National Laboratory
Submission Date: 08-01-05
Cray Inc. XT3 AMD Opteron   2.4GHz 5200 1 5200
20.5270000
874.8990
0.2685830