Skip to content

Support different matrix layouts (row vs col major)

We now only support A and C in row-major order, and B in col-major order. It should be relatively straightforward to support other combinations in the GEMM kernels, but we should decide how to support it in the host code as well.

Note: not all combinations are supported, e.g. 1-bit requires that A is row-major and B col-major.