Context and Waker might be accidentally `Sync` #66481

Matthias247 · 2019-11-16T22:45:32Z

One issue that came up in the discussion here is that the Context type implements Send + Sync.

This might have been introduced accidentally. Send probably does not matter at all, given that users will only observe a Context by reference. However Sync has the impliciation that we will not be able to add non thread-safe methods to Context - e.g. in order to optimize thread-local wake-ups again.

It might be interesting to see whether Send and Sync support could be removed from the type. Unfortunately that is however a breaking change - even though it is not likely that any code currently uses Context out of the direct poll() path.

In a similar fashion it had been observed in the implementation of #65875 (comment) that the Waker type is also Send and Sync. While it had been expected for Send- given that Wakers are used to wake-up tasks from different threads, it might not have been for Sync. One downside of Wakers being Sync is that it prevents optimizations. E.g. in linked ticket the original implementation contained an optimization that while a Waker was not cloned (and thereby equipped with a different vtable) it could wake-up the local eventloop again by just setting a non-synchronized boolean flag in the current thread. However given that &Waker could be transferred to another thread, and .wake_by_ref() called from there even within the .poll() context - this optmization is invalid.

Here it would also be interesting if the Sync requirement could be removed. I expect the amount of real-world usages to be in the same ballpark as sending &Context across threads - hopefully 0.
But it's again a breaking change 😢

cc @Ralith , @Nemo157 , @cramertj , @withoutboats

The text was updated successfully, but these errors were encountered:

withoutboats · 2019-11-17T14:49:22Z

Even if this weren't a breaking change, I'd want to see some really good evidence these optimizations would matter for real world code before choosing to make these types non-Sync. I don't think getting rid of an atomic update in thread::block_on is a compelling example. Basically, in my view the current API is intentional and correct.

Ralith · 2019-11-17T18:38:53Z

Context was introduced to leave room for future unforeseen requirements, and being Sync restricts its usefulness as such, which is unfortunate. I can see a case for Waker: Sync since we have wake_by_ref, but it's difficult to imagine a case where it makes sense to be accessing a single Context from multiple threads concurrently.

worktycho · 2019-11-17T19:22:54Z

I have an example of where the current API is resulting in performance issues: I am trying to implement a combined Executor/Reactor around a winit event loop. To wake up the event loop from a different thread we have to post a message to the event loop which can be a non-trivial operation. But on the same thread, we know that we don't need to wake up the eventloop, so we can use much cheaper mechanisms like an atomic bool.

withoutboats · 2019-11-17T20:41:50Z

@worktycho But that would be true if wakers were just Send also, which was an intentional part of the simplification from the previous design.

Ralith · 2019-11-17T21:44:11Z

I think the point is that Context being Send + Sync precludes the otherwise noninvasive restoration of a more efficient LocalWaker through an additional method on Context.

Matthias247 · 2019-12-01T23:40:10Z

Even if this weren't a breaking change, I'd want to see some really good evidence these optimizations would matter for real world code before choosing to make these types non-Sync. I don't think getting rid of an atomic update in thread::block_on is a compelling example.

I don't think it should be the judgement of the libs team (or any of us) to determine whether something is good enough and not needs to be further optimized for any use-case. Rusts goal as a language is to enable zero cost abstractions. This requires in my opinion to not be opinionated on any design choices which have a performance/footprint impact. I think some of the decisions that have been taken in the async/await world however are opinionated, and will have an impact on use-cases that currently are still better implemented with different mechanisms.

I do not really want to elaborate on anything more particular, because I fear this would bring the discussion back to a arguing whether any X% performance degradation is still good enough. That is an OK decision to have for a software project whichs needs to decide whether Rusts async/await support is good enough for them, and whether they have to apply workarounds or not. But it's not a discussion which will move the issue here any more forward, because for yet another project the outcoming of the discussion might be different.

PS: I do think it's okay and good if libraries like tokio or async-std are being more opinionated about what threading models they support, and they might sacrifice performance gains in rare usage scenarios for easy of use. But core and language features are different, and we expect people to use them also for niche use-cases (e.g. there is certainly a lot of cool evaluation going on with the use of async/await in embedded contexts or kernels - where requirements might be very different than for a webserver based on tokio).

withoutboats · 2019-12-03T15:06:29Z

I don't think it should be the judgement of the libs team (or any of us) to determine whether something is good enough and not needs to be further optimized for any use-case. Rusts goal as a language is to enable zero cost abstractions.

This is a trade off: Context is either Sync or it isn't, some users benefit from one choice (they can use references to Context as threadsafe) and some users benefit from the other choice (they can have nonthreadsafe constructs inside the Context construct). Ultimately the libs team has to decide one way or the other on these points in the API where there is a trade off between thread safety and not.

However, this decision has already been made and it would be a breaking change to change it, so this discussion is moot.

kabergstrom · 2019-12-03T17:21:15Z

(they can use references to Context as threadsafe)

Do you have an example where this is actually done within the ecosystem, or a use-case for it? As far as I can tell, this would require spawning a thread using something like crossbeam_utils::thread::scope and capturing the Context from within a Future::poll?

withoutboats · 2019-12-03T17:41:29Z

A common way a user could depend on a type being Sync is to store it in an Arc and send it across threads. This isn't very likely for Context but it is a very reasonable pattern for Waker.

withoutboats · 2019-12-03T18:12:10Z

I think the point is that Context being Send + Sync precludes the otherwise noninvasive restoration of a more efficient LocalWaker through an additional method on Context.

I think its a fair point that we didnt intentionally make context send and sync, and that this precludes adding more fields to context that are not send and sync, and that since context is just a buffer for future changes, it would probably have been better to make it non-threadsafe. But now its a breaking change. If crater showed no regressions and there's no known patterns it nullifies, I would be open to changing this about Context personally, but I am somewhat doubtful the libs team as a whole would approve making a breaking change to support hypothetical future extensions.

If anyone on the thread wants to pursue a change to context (not waker), they should get crater results as the next step of the conversation.

Matthias247 · 2019-12-09T05:27:41Z

A common way a user could depend on a type being Sync is to store it in an Arc and send it across threads. This isn't very likely for Context but it is a very reasonable pattern for Waker.

There is no discussion about Waker being Send or not - it obviously has to be. The question is purely about Sync. And in order to do what you describe you have to store an Arc<&Waker> and send it somewhere. Which is very doubtful to happen, due to the not very useful lifetime and due to Waker already being an Arc like thing internally.

As @kabergstrom mentioned, the most likely way to see this behavior is some scoped thread API being used inside poll - or people doing some very advanced synchronization against an IO driver running in another thread. For all those there are certainly better ways than to rely on Waker being Sync. E.g. to return a flag from the synchronized section and call waker.wake_by_ref() from the original poll() thread when done. Or to .clone() and send the Waker if there is doubt whether it needs to be persisted somewhere else.

And yes, the impact on Context is even bigger. It was meant as an extension point. But we can not add any functions to it which are not thread-safe. E.g. if we want to add methods which spawn another task on the same thread as the current executor thread, and which allows for spawning !Send futures (like Tokios LocalSet) - we would have an issue.

withoutboats · 2019-12-10T13:40:59Z

As @kabergstrom mentioned, the most likely way to see this behavior is some scoped thread API being used inside poll - or people doing some very advanced synchronization against an IO driver running in another thread. For all those there are certainly better ways than to rely on Waker being Sync. E.g. to return a flag from the synchronized section and call waker.wake_by_ref() from the original poll() thread when done. Or to .clone() and send the Waker if there is doubt whether it needs to be persisted somewhere else.

Why would these be "certainly better" than just calling wake_by_ref from the scoped threads?

withoutboats · 2019-12-10T17:29:38Z

To be a little more expansive: the scoped threadpool you're talking about could very well not be scoped inside a poll method - rather, the waker could be cloned once and then owned by one thread and referenced by many other scoped threads in some sort of threadpool construct for CPU bound tasks. This seems like a perfectly valid implementation which allows you to divide up the work among many threads without cloning the waker many times. This is potentially an optimization.

I don't think this optimization is very important, but I don't think the optimizations allowed by making waker Send + !Sync are very important either. The point is that there's a trade off between the optimizations allowed by assuming references to wakers can be shared across threads and the optimizations allowed by assuming they can't be, its not the case that one side is inherently the "zero cost" side.

KodrAus · 2021-01-27T23:55:19Z

We discussed this at the recent Libs meeting and felt that deferring to @rust-lang/wg-async-foundations would make sense here.

yoshuawuyts · 2021-02-16T11:28:58Z

Though I don't have the bandwidth to carry this, this definitely seems interesting. Seeing projects such as glommio and closed-source initiatives lead me to believe that there may actually be a decent case to enable some form of Context -> LocalWaker to avoid the synchronization overhead on single-threaded runtimes.

I don't know if we should make this a priority, but at least we probably shouldn't close it just yet.

Matthias247 · 2021-02-16T19:05:43Z

I think bringing LocalWaker back would be a nice additional improvement. But for the moment it would be nice to just fix the general Context sync-ness, which blocks all other fixes and improvements.

cramertj · 2021-03-02T21:47:35Z

As the person who originally introduced LocalWaker, it was very much a conscious decision to get rid of it and to make Waker and Context Sync. This was called out explicitly in the RFC.

Ralith · 2021-03-02T22:01:17Z

The cited discussion in the RFC justifies making Waker Send, and predates the decision to (re)introduce Context. Making Context !Send + !Sync doesn't defeat the objectives discussed there. In particular, it does not impact ergonomics for the common case at all.

dhardy · 2023-01-11T17:52:44Z

This was recently closed, however the issue mentions both Context and Waker. The recent change only affects Context.

There is good reason that Waker is Send, however after reading this issue I'm unsure that there are good reasons why Waker is Sync.

withoutboats · 2023-01-12T10:23:25Z

(NOT A CONTRIBUTION)

Waker supports wake_by_ref, and so it is possible to pass &Waker to another thread and wake it from that thread. This functionality would not be possible without Waker being Sync.

Supporting wakers that are either not Send or not Sync will best be done by adding new APIs to Context and a new LocalWaker type.

dhardy · 2023-01-12T10:37:12Z

Forgive me the naive question, but how does a LocalWaker type work with Future? As I understand, Future::poll requires Waker, as a concrete type not a trait, so this would also require a LocalFuture trait? At this point we have two completely incompatible async systems...

~~... which means it's very likely not going to happen. I thought as much.~~

so it is possible to pass &Waker to another thread and wake it from that thread

Which is only useful if Waker is not cheap to clone, and has the burden of another lifetime bound. It surprises me that the Waker docs don't say anything about the intended cost of Waker::clone.

withoutboats · 2023-01-12T12:44:32Z

(NOT A CONTRIBUTION)

Forgive me the naive question, but how does a LocalWaker type work with Future? As I understand, Future::poll requires Waker, as a concrete type not a trait, so this would also require a LocalFuture trait? At this point we have two completely incompatible async systems...

Future::poll does not take Waker as an argument, Future::poll takes Context. Context could have the ability to set a LocalWaker, so that an executor could set this. Reactors which operate on the same thread as the future that polls them could migrate to using the LocalWaker argument instead of Waker. Here is a pre-RFC that someone wrote with a possible API: https://internals.rust-lang.org/t/pre-rfc-local-wakers/17962

... which means it's very likely not going to happen. I thought as much.

Commentary like this is both factually wrong and not helpful for the mood of the thread.

jonas-schievink added A-async-await Area: Async & Await C-bug Category: This is a bug. I-nominated T-libs-api Relevant to the library API team, which will review and decide on the PR/issue. labels Nov 16, 2019

Centril added the T-lang Relevant to the language team, which will review and decide on the PR/issue. label Nov 16, 2019

withoutboats removed the T-lang Relevant to the language team, which will review and decide on the PR/issue. label Nov 17, 2019

nikomatsakis added the AsyncAwait-Triaged Async-await issues that have been triaged during a working group meeting. label Nov 19, 2019

sticnarf mentioned this issue Dec 31, 2019

task/future: support spawning locally tikv/yatp#24

Merged

Matthias247 mentioned this issue Nov 30, 2020

Rust Wakers need to be Send + Sync DataDog/glommio#194

Closed

sfackler removed the I-nominated label Feb 3, 2021

nikomatsakis mentioned this issue Mar 24, 2021

per-thread executors rust-lang/wg-async#87

Open

4 tasks

jihiggins mentioned this issue Apr 12, 2022

Add PhantomData marker to Context to make Context !Send and !Sync #95985

Merged

bors closed this as completed in 722bc0c Jan 3, 2023

tvallotton mentioned this issue Mar 16, 2023

Add LocalWaker support rust-lang/libs-team#191

Open

tvallotton mentioned this issue Dec 15, 2023

Tracking Issue for LocalWaker #118959

Open

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Context and Waker might be accidentally `Sync` #66481

Context and Waker might be accidentally `Sync` #66481

Matthias247 commented Nov 16, 2019 •

edited

Loading

withoutboats commented Nov 17, 2019

Ralith commented Nov 17, 2019

worktycho commented Nov 17, 2019

withoutboats commented Nov 17, 2019

Ralith commented Nov 17, 2019

Matthias247 commented Dec 1, 2019

withoutboats commented Dec 3, 2019

kabergstrom commented Dec 3, 2019 •

edited

Loading

withoutboats commented Dec 3, 2019

withoutboats commented Dec 3, 2019

Matthias247 commented Dec 9, 2019

withoutboats commented Dec 10, 2019

withoutboats commented Dec 10, 2019 •

edited

Loading

KodrAus commented Jan 27, 2021

yoshuawuyts commented Feb 16, 2021

Matthias247 commented Feb 16, 2021

cramertj commented Mar 2, 2021

Ralith commented Mar 2, 2021

dhardy commented Jan 11, 2023

withoutboats commented Jan 12, 2023

dhardy commented Jan 12, 2023 •

edited

Loading

withoutboats commented Jan 12, 2023

Context and Waker might be accidentally Sync #66481

Context and Waker might be accidentally Sync #66481

Comments

Matthias247 commented Nov 16, 2019 • edited Loading

withoutboats commented Nov 17, 2019

Ralith commented Nov 17, 2019

worktycho commented Nov 17, 2019

withoutboats commented Nov 17, 2019

Ralith commented Nov 17, 2019

Matthias247 commented Dec 1, 2019

withoutboats commented Dec 3, 2019

kabergstrom commented Dec 3, 2019 • edited Loading

withoutboats commented Dec 3, 2019

withoutboats commented Dec 3, 2019

Matthias247 commented Dec 9, 2019

withoutboats commented Dec 10, 2019

withoutboats commented Dec 10, 2019 • edited Loading

KodrAus commented Jan 27, 2021

yoshuawuyts commented Feb 16, 2021

Matthias247 commented Feb 16, 2021

cramertj commented Mar 2, 2021

Ralith commented Mar 2, 2021

dhardy commented Jan 11, 2023

withoutboats commented Jan 12, 2023

dhardy commented Jan 12, 2023 • edited Loading

withoutboats commented Jan 12, 2023

Context and Waker might be accidentally `Sync` #66481

Context and Waker might be accidentally `Sync` #66481

Matthias247 commented Nov 16, 2019 •

edited

Loading

kabergstrom commented Dec 3, 2019 •

edited

Loading

withoutboats commented Dec 10, 2019 •

edited

Loading

dhardy commented Jan 12, 2023 •

edited

Loading