Update timings (!20) · Merge requests · ResearchAndDevelopment / RECRUIT / Beamformer LOFAR

Bram Veenboer requested to merge update-timings into main Sep 03, 2024

The performance counter class didn't properly work with asynchronous launches, as events would be reused while they may not have been recorded. This is solved by using a double-ended queue. The pipeline benchmark now relies on the performance counters to print statistics about individual kernels, simplifying the code. Because events are now created upon kernel launch, we need to make sure that the device context is set properly. This affects the benchmarks for the individual kernels.

Update timings

Merge request reports