-
Notifications
You must be signed in to change notification settings - Fork 52
Open
Description
I see this:
enum class SmemSwizzleBits : uint8_t {
DISABLE = 0,
B32 = 1,
B64 = 2,
B128 = 3,
};
And I changed this to 0:
// tensor_map
utils::TmaDescriptor tensormap_a =
utils::make_tma_copy_desc<BLOCKM, BLOCKK, 3>(
gemm_params.A, gemm_params.M, gemm_params.K, Swizzle<3, 4, 3>{},----------------> to <0, 4, 3>
CLUSTER_N);
utils::TmaDescriptor tensormap_b =
utils::make_tma_copy_desc<BLOCKN, BLOCKK, 3>(
gemm_params.B, gemm_params.N, gemm_params.K, Swizzle<3, 4, 3>{},----------------> to <0, 4, 3>
CLUSTER_M);
But I get error:
Copying results done!
Wrong answer! 14663841 errors! 87.4033%
Average diff = 596.463
test: ../../../include/common.h:396: void assert_allclose(DType*, DType*, std::vector<int>, float, bool) [with DType = __half]: Assertion errors == 0' failed.
Aborted (core dumped)
Any suggestion? Thanks!
Metadata
Metadata
Assignees
Labels
No labels