Zero-copy MPI communication of JAX arrays, for turbo-charged HPC applications in Python ⚡
-
Updated
Jun 11, 2024 - Python
Zero-copy MPI communication of JAX arrays, for turbo-charged HPC applications in Python ⚡
An adaptive and distributed-memory parallel implementation of the immersed boundary (IB) method
Python bindings for MPI
Performance-portable geometric search library
Material Point Method (MPM) implementation in common lisp
Scripts and examples for Azure Batch for modeling runs
A flyweight in situ visualization and analysis runtime for multi-physics HPC simulations
Multi-level I/O tracing library
a unified cross-architecture heterogeneous CFD solver
Unified Communication X (mailing list - https://elist.ornl.gov/mailman/listinfo/ucx-group)
The PennyLane-Lightning plugin provides a fast state-vector simulator written in C++ for use with PennyLane
Implementation of the BLAS level 3 algorithm SYRK (Symmetric -k Rank update) using MPI.
Advanced High Performance Computing in C with OpenMP, CUDA, MPI and NCCL. The folder project includes my final project for the special course. I implemented a Jacobi-solver for the Poisson partial differential problem both using OpenMP in the CPU, using CUDA on the GPU and using CUDA, MPI and NCCL on multiple GPUs.
Add a description, image, and links to the mpi topic page so that developers can more easily learn about it.
To associate your repository with the mpi topic, visit your repo's landing page and select "manage topics."