Skip to content

Latest commit

 

History

History
5 lines (4 loc) · 374 Bytes

update_log.md

File metadata and controls

5 lines (4 loc) · 374 Bytes

Updates

[2023.6.21] I found the reason that the original program is slow: I forgot to turn off the debug flags during training: debug_flags The commented code in the above fig are enabled when I was using NVIDIA Nsight Compute. They damage the performance dramatically and should be commented during actual training.