Various updates, mainly in InstanceCUDA and in performance reporting
The main changes are:
- Reimplement the buffer memory management from
InstanceCUDA
into the CUDA proxies - Cleanup of the
PowerSensor
code, usestd::shared_ptr
- Reimplement the
Report
class to be much shorter and easier to maintain