PULSAR 1.0.0, August 2013
The first release of PULSAR provides a complete API for building and executing a Virtual Systolic Array (VSA) - a collection of Virtual Data Processors (VDPs) connected with channels and communicating via packets.
The runtime supports distributed memory systems with multicore processors and relies on POSIX Threads (a.k.a. Pthreads) for intra-node multithreading, and on the Message Passing Interface (MPI) for inter-node communication.
This release is accompanied by an implementation of the tile QR factorization with sequential (a.k.a. “domino”) panel reduction and an implementation of the LU factorization with no pivoting.