UCP/RNDV/CUDA: RNDV protocol improvements for CUDA #5473

bureddy · 2020-07-25T07:07:27Z

What

Improving out-of-the-box behavior for cuda transfers

Default try GET protocol for CUDA and fallback to pipeline protocol if GET protocol fails
Option to tune sender-side pipelining scheme

Why ?

The current default rndv scheme for CUDA transfers is to use sender-side pipelining to overcome GPUDirectRDMA performance limitation on architecture where GPUs are connected to CPU socket directly.
But, now most of the GPU system architectures are designed where GPU and NIC are connected to same PCIe Switch ( Ex: all DGX systems, different vendor GPU server architectures). we are recommending users set UCX_RNDV_SCHEME=get_zcopy explicitly inorder to get an optimal GPUDirectRDMA performance on these architectures. And also we do not have an optimal fallback if GET protocol fails (ex GPUs connected to different CPU sockets(DGX1))

How ?

support for get-zcopy fallback: pre-compute get zcopy lanes from rkey and fallback to the optimal schemes if no lanes are suitable. most of the pre-computing lanes code is used from RNDV/GET: try to push request to pending yosefe/ucx#16 ( @hoopoepg )
option UCX_RNDV_PIPELINE_SEND_THRESH to control the sender-side pipelining scheme if needed.

@Akshay-Venkatesh @yosefe

azure-pipelines · 2020-07-25T17:50:10Z

Commenter does not have sufficient privileges for PR 5473 in repo openucx/ucx

bureddy · 2020-07-25T17:51:29Z

bot:pipe:retest

bureddy · 2020-07-26T02:31:31Z

bot:pipe:retest

bureddy · 2020-07-27T04:35:27Z

bot:pipe:retest

bureddy · 2020-07-27T17:06:32Z

@hoopoepg @yosefe @brminich please review

brminich · 2020-07-27T17:42:14Z

src/ucp/core/ucp_request.h

+                    ucp_rkey_h           rkey;            /* key for remote send buffer */
+                    ucp_lane_map_t       lanes_map_avail; /* used lanes map */
+                    ucp_lane_map_t       lanes_map_all;   /* actual lanes map */
+                    uint8_t              lanes_count;     /* actual lanes map */


lanes count (in the comment)

brminich · 2020-07-27T17:51:39Z

src/ucp/proto/rndv.h

@@ -66,12 +66,14 @@ void ucp_rndv_receive(ucp_worker_h worker, ucp_request_t *rreq,
                      const ucp_rndv_rts_hdr_t *rndv_rts_hdr);

 static UCS_F_ALWAYS_INLINE int ucp_rndv_is_get_zcopy(ucs_memory_type_t mem_type,
+                                                     size_t length,


maybe pass just 2 params to this func: request and context?
2 first params are taken from request, others are from context

src/ucp/proto/rndv.c

brminich · 2020-07-27T18:12:40Z

src/ucp/proto/rndv.c

    freq->send.rndv_get.rreq           = sreq;
+    ucp_rndv_req_init_get_zcopy_lane_map(freq);


do you really need to re-calculate it for every fragment?

I think so because each fragment is tracked in separate rndv frag req

can't we cache it somewhere, because it is supposed to be the same for all fragments?

fixed. can you check it?

@brminich can you please check?

yosefe · 2020-07-30T11:06:58Z

/azp run

azure-pipelines · 2020-07-30T11:07:12Z

Azure Pipelines successfully started running 1 pipeline(s).

bureddy · 2020-07-31T00:27:16Z

/azp run

azure-pipelines · 2020-07-31T00:27:31Z

Azure Pipelines successfully started running 1 pipeline(s).

bureddy · 2020-08-01T01:53:43Z

/azp run

azure-pipelines · 2020-08-01T01:53:56Z

Azure Pipelines successfully started running 1 pipeline(s).

brminich

@hoopoepg, plz review

brminich · 2020-08-03T07:09:03Z

src/ucp/proto/rndv.c

-        ucp_request_recv_buffer_reg(rreq, ep_config->key.rma_bw_md_map,
-                                    rndv_rts_hdr->size);
+
+        if ((rndv_mode == UCP_RNDV_MODE_PUT_ZCOPY) ||


looks like now we do not register RX buffer for PUT protocol in case of UCP_RNDV_MODE_PUT_AUTO + non-CUDA memory

Correct. This PR is not changing current behavior for the HOST memory. Today, it does GET by default for HOST memory. it sends RTR w/o registering RX buffer to switch to Active message rndv it fails to do GET (https://github.com/openucx/ucx/blob/master/src/ucp/proto/rndv.c#L473)

@brminich are we ok here?

bureddy · 2020-08-04T17:24:45Z

@Akshay-Venkatesh @yosefe can you please review?

bureddy · 2020-08-11T17:20:44Z

/azp run

azure-pipelines · 2020-08-11T17:20:57Z

Azure Pipelines successfully started running 1 pipeline(s).

src/ucp/core/ucp_context.c

src/ucp/proto/rndv.c

yosefe · 2020-08-12T11:59:29Z

src/ucp/proto/rndv.c

+
+            if ((lane_bw/max_lane_bw) <
+                (1. / context->config.ext.multi_lane_max_ratio)) {
+                lane_map &= ~UCS_BIT(lane_idx);


seems it can make rndv_req->send.rndv_get.rkey_index[i] invalid because some lanes would be removed from the middle

src/ucp/proto/rndv.c

bureddy · 2020-08-13T05:47:49Z

bot:pipe:retest

src/ucp/proto/rndv.c

bureddy force-pushed the cuda-rndv branch from 709dddc to 545a3e2 Compare July 25, 2020 09:14

UCP/RNDV/CUDA: RNDV protocol improvements for CUDA

779fffd

bureddy force-pushed the cuda-rndv branch from 545a3e2 to 779fffd Compare July 25, 2020 20:26

brminich reviewed Jul 27, 2020

View reviewed changes

bosilca mentioned this pull request Jul 28, 2020

OMPI+UCX on GPUs : drop of performance compared with pt2pt open-mpi/ompi#7965

Open

UCP/RNDV/CUDA: Fix review commnets(1)

1505870

bureddy force-pushed the cuda-rndv branch from 8114a48 to 1505870 Compare July 28, 2020 05:16

UCX/RNDV/CUDA: Fix review comments(2)

b350f69

brminich reviewed Aug 3, 2020

View reviewed changes

hoopoepg approved these changes Aug 4, 2020

View reviewed changes

yosefe reviewed Aug 12, 2020

View reviewed changes

UCP/RNDV/CUDA: Fix review comments(3)

677da25

yosefe reviewed Aug 13, 2020

View reviewed changes

src/ucp/proto/rndv.c Show resolved Hide resolved

yosefe approved these changes Aug 13, 2020

View reviewed changes

yosefe merged commit 5d914ec into openucx:master Aug 13, 2020

jngrad mentioned this pull request Aug 16, 2020

epresso-4.1.3: Many tests fail on Fedora33 due to IndexError: _Map_base::at espressomd/espresso#3853

Closed

This was referenced Sep 1, 2020

very bad intranode GPU to GPU performance across NUMA domains #3249

Open

FLAG_RNDV_FRAG assertion failure with cuda transfers within the node #5646

Closed

bureddy mentioned this pull request Sep 2, 2020

UCX/RNDV/CUDA: RNDV protocol improvements for CUDA - v1.9.x #5648

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

UCP/RNDV/CUDA: RNDV protocol improvements for CUDA #5473

UCP/RNDV/CUDA: RNDV protocol improvements for CUDA #5473

bureddy commented Jul 25, 2020 •

edited

Loading

azure-pipelines bot commented Jul 25, 2020

bureddy commented Jul 25, 2020

bureddy commented Jul 26, 2020

bureddy commented Jul 27, 2020

bureddy commented Jul 27, 2020

brminich Jul 27, 2020

brminich Jul 27, 2020

brminich Jul 27, 2020

bureddy Jul 28, 2020

brminich Jul 28, 2020

bureddy Jul 29, 2020

bureddy Aug 3, 2020

yosefe commented Jul 30, 2020

azure-pipelines bot commented Jul 30, 2020

bureddy commented Jul 31, 2020

azure-pipelines bot commented Jul 31, 2020

bureddy commented Aug 1, 2020

azure-pipelines bot commented Aug 1, 2020

brminich left a comment

brminich Aug 3, 2020

bureddy Aug 3, 2020 •

edited

Loading

bureddy Aug 4, 2020

brminich Aug 4, 2020

bureddy commented Aug 4, 2020

bureddy commented Aug 11, 2020

azure-pipelines bot commented Aug 11, 2020

yosefe Aug 12, 2020

bureddy commented Aug 13, 2020

		freq->send.rndv_get.rreq = sreq;
		ucp_rndv_req_init_get_zcopy_lane_map(freq);

UCP/RNDV/CUDA: RNDV protocol improvements for CUDA #5473

UCP/RNDV/CUDA: RNDV protocol improvements for CUDA #5473

Conversation

bureddy commented Jul 25, 2020 • edited Loading

What

Why ?

How ?

azure-pipelines bot commented Jul 25, 2020

bureddy commented Jul 25, 2020

bureddy commented Jul 26, 2020

bureddy commented Jul 27, 2020

bureddy commented Jul 27, 2020

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

yosefe commented Jul 30, 2020

azure-pipelines bot commented Jul 30, 2020

bureddy commented Jul 31, 2020

azure-pipelines bot commented Jul 31, 2020

bureddy commented Aug 1, 2020

azure-pipelines bot commented Aug 1, 2020

brminich left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

bureddy Aug 3, 2020 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

bureddy commented Aug 4, 2020

bureddy commented Aug 11, 2020

azure-pipelines bot commented Aug 11, 2020

Choose a reason for hiding this comment

bureddy commented Aug 13, 2020

bureddy commented Jul 25, 2020 •

edited

Loading

bureddy Aug 3, 2020 •

edited

Loading