[RELAY][VM] Enable heterogeneous execution for Relay VM #6337

zhiics · 2020-08-25T23:28:37Z

Currently, the dynamic models can only be executed for on CPU. The GPU execution is not allowed for these models because they have shape functions to do runtime type inference. These functions may contain various control logic to derive the shape of a tensor at runtime and they are never compute intensive, therefore are designed to be executed on CPU. That being said, we must use CPU to execute these functions even when trying to run the whole model on other devices. This PR enables the heterogeneous execution for Relay VM to support dynamic models on devices other than CPU.

More specifically, it includes the following changes:

makes the memory_alloc and memory plan passes context aware when inserting vm/memory dialects.
designs a union-find based context analysis pass to analyze the device context of the IR node in a relay program [Thanks @jroesch and @icemelon9 for help]
implements a DeviceCopy instruction in VM to copy data directly cross different devices.
enables GPU tests for various unit tests involving dynamic inputs/shape functions, namely those in test_any.py, test_adt.py, and test_vm.py, and dynamic namespace tests.
tests heterogeneous execution for the static cases used for graph runtime (test_pass_annotation.py)
fixes several bugs in the VM that are manifested by heterogeneous execution

Followup PRs will fix/add schedules for some ops to enable GPU execution for Bert and TF objection detection models.

cc @icemelon9 @jroesch @mbrookhart @wweic

leandron

I had a look, mostly in the python sources.

python/tvm/relay/analysis/analysis.py

python/tvm/relay/transform/memory_alloc.py

mbrookhart · 2020-08-26T15:38:13Z

Yay! I'm so excited for this! I'll do a deep dive today

There are a number tests in tests/python/relay/dyn that skip running on GPU while waiting for this feature, i.e. https://github.com/apache/incubator-tvm/blob/942c90ba7a7b9bccf6d9bce43808aba2bd6c9787/tests/python/relay/dyn/test_dynamic_op_level3.py#L30-L31

Do you want to enable those as part of this test? Or I can do it as a second PR.

zhiics · 2020-08-26T16:32:00Z

@mbrookhart Thanks for reminding, I just enabled all the dynamic op tests except for level6 because topk has a problem for GPU which I have already had a TODO in the test_any. We need to look into it later.

include/tvm/runtime/vm/bytecode.h

mbrookhart

A few nitpicks, I'd like to see a little more documentation on the passes, I'm not sure I fully understand what you're doing just from looking at the code, but overall it looks really good, I'm excited!

python/tvm/relay/backend/vm.py

python/tvm/relay/transform/memory_alloc.py

src/relay/analysis/context_analysis.cc

icemelon

half way through the pr. will come back and review the rest

python/tvm/runtime/vm.py

src/runtime/vm/executable.cc

src/runtime/vm/vm.cc

python/tvm/runtime/vm.py

src/relay/analysis/context_analysis.cc

src/runtime/vm/vm.cc

icemelon

lgtm

icemelon · 2020-09-03T16:47:42Z

Thanks @zhiics @mbrookhart @leandron @jwfromm

mbrookhart · 2020-09-03T17:18:39Z

Sorry for my delay! I've been out the last few days moving. Anyway, looking over what was changed since my last review, I'm happy to give it a post-merge approval, looks great! Thanks @icemelon9 and @zhiics. I'll start enabling the dynamic tests on gpu and I'll work on fixing anything that fails (including topk)

* vm heterogeneous execution * context analysis on module * fix profiler * fix memory plan * add more unification * add serialization * add gpu tests for test_adt * cache visited functions * path compression * C++ context analysis * remove python context analysis * add tests * clean * lint * fix * enable gpu test for dynamic namespace * remove GetParamsContext * fix comments and add doc for context analysis * cache context * cache allocator * rebase and fix comments

zhiics changed the title ~~[RELAY][VM] Enable heterogeneous execution to VM~~ [RELAY][VM] Enable heterogeneous execution for Relay VM Aug 25, 2020

leandron reviewed Aug 26, 2020

View reviewed changes

python/tvm/relay/analysis/analysis.py Outdated Show resolved Hide resolved

python/tvm/relay/transform/memory_alloc.py Outdated Show resolved Hide resolved

zhiics force-pushed the hetero_vm branch from 3c8f2a4 to 77b9a8b Compare August 26, 2020 16:21

icemelon reviewed Aug 26, 2020

View reviewed changes

include/tvm/runtime/vm/bytecode.h Show resolved Hide resolved

mbrookhart requested changes Aug 27, 2020

View reviewed changes

zhiics force-pushed the hetero_vm branch from 6ae6900 to a988880 Compare August 28, 2020 05:07

icemelon reviewed Aug 28, 2020

View reviewed changes

zhiics added 20 commits September 2, 2020 22:11

vm heterogeneous execution

abc80d5

context analysis on module

9cd7c78

fix profiler

5e67950

fix memory plan

534e9c4

add more unification

f4a903e

add serialization

06b3ced

add gpu tests for test_adt

19fbe6f

cache visited functions

06fa26f

path compression

3921784

C++ context analysis

bdd2d81

remove python context analysis

68cde36

add tests

8ba6efd

clean

62fce97

lint

e7c235a

fix

b64a5f1

enable gpu test for dynamic namespace

23d25fb

remove GetParamsContext

1bac1cf

fix comments and add doc for context analysis

b416675

cache context

07e7091

cache allocator

df47ba4

icemelon reviewed Sep 3, 2020

View reviewed changes

rebase and fix comments

8f572b3

zhiics force-pushed the hetero_vm branch from 465fd1b to 8f572b3 Compare September 3, 2020 03:07

icemelon self-assigned this Sep 3, 2020

icemelon approved these changes Sep 3, 2020

View reviewed changes

icemelon merged commit 1224d56 into apache:master Sep 3, 2020

icemelon added the status: accepted label Sep 3, 2020

mbrookhart mentioned this pull request Sep 3, 2020

Dynamic Strided Slice #6316

Merged

jwfromm mentioned this pull request Sep 24, 2020

Dynamic ONNX Importer #6351

Merged

zhiics deleted the hetero_vm branch October 8, 2020 15:56

zhiics mentioned this pull request Oct 8, 2020

[RFC][VM] Heterogeneous execution in Relay VM #4178

Closed

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RELAY][VM] Enable heterogeneous execution for Relay VM #6337

[RELAY][VM] Enable heterogeneous execution for Relay VM #6337

zhiics commented Aug 25, 2020 •

edited

Loading

leandron left a comment

mbrookhart commented Aug 26, 2020

zhiics commented Aug 26, 2020 •

edited

Loading

mbrookhart left a comment

icemelon left a comment

icemelon left a comment

icemelon commented Sep 3, 2020

mbrookhart commented Sep 3, 2020

[RELAY][VM] Enable heterogeneous execution for Relay VM #6337

[RELAY][VM] Enable heterogeneous execution for Relay VM #6337

Conversation

zhiics commented Aug 25, 2020 • edited Loading

leandron left a comment

Choose a reason for hiding this comment

mbrookhart commented Aug 26, 2020

zhiics commented Aug 26, 2020 • edited Loading

mbrookhart left a comment

Choose a reason for hiding this comment

icemelon left a comment

Choose a reason for hiding this comment

icemelon left a comment

Choose a reason for hiding this comment

icemelon commented Sep 3, 2020

mbrookhart commented Sep 3, 2020

zhiics commented Aug 25, 2020 •

edited

Loading

zhiics commented Aug 26, 2020 •

edited

Loading