Skip to content

Update to latest wmma kernel

Ronald Rook requested to merge update_to_latest_wmma_kernel into main

Update WMMA kernel. Added a switch for WMMA_K parameter to support 16/32bit floats in GEMM. Improved device<->shmem copying.

Merge request reports

Loading