[Topi][Cuda]Optimizations of global_ave_pool for NHWC layout #5450

SXM-inspur · 2020-04-27T07:32:59Z

The runtime of global_ave_pool took about 14.8% in Resnet50_v2 with batchsize of 32, when Tensor Core is enabled on Tesla T4 GPU. The runtime decreased to 0.134%, after optimizations in this PR were made for NHWC layout. The results of unit tests are listed below, and the latency is reported with unit of ms. As we can see from the table, great performance improvements have been achieved.

batch	original	After optimization	speedup
16	1.16	0.03	38.67
32	1.17	0.06	19.5
256	1.65	0.52	3.17

Table 1. Shape of input feature maps is batchx7x7x2048.

@Hzfengsy @Laurawly @vinx13 @jwfromm Please help to review

jwfromm

LGTM

…5450) * Optimizations of global_ave_pool for NHWC layout * Optimize the code format to pass inspection of pylint Co-authored-by: Shawn-Inspur <wushaohua@inspur.com>

Shawn-IEITSystems added 2 commits April 27, 2020 06:40

Optimizations of global_ave_pool for NHWC layout

3fdee7f

Optimize the code format to pass inspection of pylint

c1b888b

SXM-inspur changed the title ~~Optimizations of global_ave_pool for NHWC layout~~ [Topi][Cuda]Optimizations of global_ave_pool for NHWC layout Apr 27, 2020

Hzfengsy approved these changes Apr 27, 2020

View reviewed changes

jwfromm approved these changes Apr 28, 2020

View reviewed changes

vinx13 approved these changes Apr 28, 2020

View reviewed changes

vinx13 merged commit 0a1e160 into apache:master Apr 28, 2020

vinx13 added the status: accepted label Apr 28, 2020

ZihengJiang mentioned this pull request Sep 25, 2020

TVM v0.7 Release Note Candidate #6486

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Topi][Cuda]Optimizations of global_ave_pool for NHWC layout #5450

[Topi][Cuda]Optimizations of global_ave_pool for NHWC layout #5450

SXM-inspur commented Apr 27, 2020 •

edited

Loading

jwfromm left a comment

[Topi][Cuda]Optimizations of global_ave_pool for NHWC layout #5450

[Topi][Cuda]Optimizations of global_ave_pool for NHWC layout #5450

Conversation

SXM-inspur commented Apr 27, 2020 • edited Loading

jwfromm left a comment

Choose a reason for hiding this comment

SXM-inspur commented Apr 27, 2020 •

edited

Loading