NPRG058: Advanced Programming in Parallel Environment

Assignment 2 - CUDA Histogram II

We continue with the Histogram saga (assuming you have the first assignment completed).

In this episode, we will try to improve the performance of the atomic updates kernel by three optimizations:

Experiment! Implement all approaches (or their combinations) and empirically determine, which approaches are better and when (i.e., test them for different inputs and different sizes of histogram).