[Runtime][Contrib] Support cudnn softmax #5214

icemelon · 2020-04-02T00:54:55Z

Using cudnn can improve the softmax performance on Nvidia GPU.

Hzfengsy · 2020-04-02T15:49:20Z

src/runtime/contrib/cudnn/softmax.cc

+
+  // Set mode and shape descriptor
+  if (axis == ndim - 1) {
+    int64_t N = 1;


I'm confused why we need int64_t here but later cast to int

It's because DLTensor defines the shape in int64_t type. There'll be a cast anyway.

tqchen · 2020-04-02T18:29:12Z

as part of the principle, would be great if we can lookinto making the native op as fast

icemelon · 2020-04-03T05:12:40Z

@tqchen Yes, I understand that. But the latency difference could be 10x between tvm schedule and cudnn for the input shape like [100, 1024] on V100. I guess to achieve such performance it requires fusion across multiple stage of reduction, which it seems not easy to be implemented in tir.

tqchen · 2020-04-03T15:39:47Z

k, I am not trying to blocking the PR, merely trying to say it would be great to have such investigation :)

tqchen · 2020-04-06T03:54:30Z

@wpan11nv @yongfeng-nv can you suggest a bit about possible optimizations that can be done?

yongfeng-nv · 2020-04-06T06:05:15Z

@wpan11nv @yongfeng-nv can you suggest a bit about possible optimizations that can be done?

We don't know the details, but will look into it.

wpan11nv · 2020-04-20T18:10:54Z

@wpan11nv @yongfeng-nv can you suggest a bit about possible optimizations that can be done?

The cuda schedule emits 4 kernels, which cause lots of IO overhead. Ideally, we may emit a single kernel for small reduction sizes (e.g. reduction dim n <= 1024)

tqchen · 2020-06-05T15:20:39Z

#5600 for improving softmax with warp shuffle.

icemelon added 5 commits March 27, 2020 23:16

[Contrib] Add cudnn softmax

ec8e11d

update

9dcdb0f

support axis other than -1

f03a2d4

clean up

da19a21

lint

c30d007

icemelon force-pushed the softmax-cudnn branch from 8add8ed to c30d007 Compare April 2, 2020 03:58

fix test

4359b75

Hzfengsy reviewed Apr 2, 2020

View reviewed changes

yzhliu approved these changes Apr 5, 2020

View reviewed changes

Laurawly approved these changes Apr 6, 2020

View reviewed changes

Hzfengsy approved these changes Apr 6, 2020

View reviewed changes

tqchen merged commit 799ff35 into apache:master Apr 6, 2020

icemelon added a commit to icemelon/tvm that referenced this pull request Apr 14, 2020

[Runtime][Contrib] Support cudnn softmax (apache#5214)

61e85d7

wpan11nv mentioned this pull request Apr 14, 2020

[cuDNN] Add cuDNN grouped convolution support #5319

Merged

trevor-m pushed a commit to trevor-m/tvm that referenced this pull request Apr 16, 2020

[Runtime][Contrib] Support cudnn softmax (apache#5214)

61bfd71

zhiics pushed a commit to neo-ai/tvm that referenced this pull request Apr 17, 2020

[Runtime][Contrib] Support cudnn softmax (apache#5214)

2efea2f

dpankratz pushed a commit to dpankratz/incubator-tvm that referenced this pull request Apr 24, 2020

[Runtime][Contrib] Support cudnn softmax (apache#5214)

8edf8b1

icemelon deleted the softmax-cudnn branch July 21, 2020 22:53

ZihengJiang mentioned this pull request Sep 25, 2020

TVM v0.7 Release Note Candidate #6486

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Runtime][Contrib] Support cudnn softmax #5214

[Runtime][Contrib] Support cudnn softmax #5214

icemelon commented Apr 2, 2020

Hzfengsy Apr 2, 2020

icemelon Apr 3, 2020

tqchen commented Apr 2, 2020

icemelon commented Apr 3, 2020 •

edited

Loading

tqchen commented Apr 3, 2020

tqchen commented Apr 6, 2020 •

edited

Loading

yongfeng-nv commented Apr 6, 2020

wpan11nv commented Apr 20, 2020 •

edited

Loading

tqchen commented Jun 5, 2020 •

edited

Loading

[Runtime][Contrib] Support cudnn softmax #5214

[Runtime][Contrib] Support cudnn softmax #5214

Conversation

icemelon commented Apr 2, 2020

Hzfengsy Apr 2, 2020

Choose a reason for hiding this comment

icemelon Apr 3, 2020

Choose a reason for hiding this comment

tqchen commented Apr 2, 2020

icemelon commented Apr 3, 2020 • edited Loading

tqchen commented Apr 3, 2020

tqchen commented Apr 6, 2020 • edited Loading

yongfeng-nv commented Apr 6, 2020

wpan11nv commented Apr 20, 2020 • edited Loading

tqchen commented Jun 5, 2020 • edited Loading

icemelon commented Apr 3, 2020 •

edited

Loading

tqchen commented Apr 6, 2020 •

edited

Loading

wpan11nv commented Apr 20, 2020 •

edited

Loading

tqchen commented Jun 5, 2020 •

edited

Loading