Skip to content

Commit

Permalink
fix
Browse files Browse the repository at this point in the history
  • Loading branch information
masahi committed May 17, 2022
1 parent 403050b commit 3d2c90d
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion src/target/source/codegen_cuda.cc
Original file line number Diff line number Diff line change
Expand Up @@ -840,7 +840,7 @@ void CodeGenCUDA::VisitExpr_(const CallNode* op, std::ostream& os) {
os << "for (int i = 0; i < 4; ++i) {\n";
os << dst << "[(i / 2 * 8 + threadIdx.x / 4) * " << stride
<< " + outer * 8 + (threadIdx.x % 4) * 2 + i % 2]"
<< " = " << src << "[" << src_offset << " + i * outer * 4];\n";
<< " = " << src << "[" << src_offset << " + outer * 4 + i];\n";
os << "}\n";
os << "}\n";
}
Expand Down

0 comments on commit 3d2c90d

Please sign in to comment.