I have an existing project that requires implementation of my algorithm (PCA for Image Compression) in CUDA. The project already contains source code for Pthreads, MPI, and serial. Your task will be to maintain the structure of the project while implementing CUDA, CUDA + MPI, and conducting an overall correctness check.
Success story sharing