Fix runtime issues with NVML powermeter
Two issues were found when using the NVML powermeter:
- The first read after creation is erroneous: solved by introducing a little wait time
- NVML fails when the CUDA environment is destroyed: add
try-catch
around the problematic NVML call