add support for unchecked math #59148

lcnr · 2019-03-12T23:54:04Z

add compiler support for

/// Returns the result of an unchecked addition, resulting in
/// undefined behavior when `x + y > T::max_value()` or `x + y < T::min_value()`.
pub fn unchecked_add<T>(x: T, y: T) -> T;

/// Returns the result of an unchecked substraction, resulting in
/// undefined behavior when `x - y > T::max_value()` or `x - y < T::min_value()`.
pub fn unchecked_sub<T>(x: T, y: T) -> T;

/// Returns the result of an unchecked multiplication, resulting in
/// undefined behavior when `x * y > T::max_value()` or `x * y < T::min_value()`.
pub fn unchecked_mul<T>(x: T, y: T) -> T;

cc rust-lang/rfcs#2508

rust-highfive · 2019-03-12T23:54:08Z

r? @eddyb

(rust_highfive has picked a reviewer for you, use r? to override)

lachlansneff · 2019-03-13T03:49:13Z

What's the purpose of these? Wrapping math is zero-overhead on every supported architecture, I believe.

scottmcm · 2019-03-13T05:24:07Z

Previous attempt: #52205

@lachlansneff It's not zero-overhead for optimizations, though. x.wrapping_add(1) < x can be true, but x.nowrap_add(1) < x is always false.

lcnr · 2019-03-13T09:22:28Z

@lachlansneff This PR is partially inspired by this blog post: http://blog.llvm.org/2011/05/what-every-c-programmer-should-know.html

Signed integer overflow: If arithmetic on an 'int' type (for example) overflows, the result is undefined. One example is that "INT_MAX+1" is not guaranteed to be INT_MIN. This behavior enables certain classes of optimizations that are important for some code. For example, knowing that INT_MAX+1 is undefined allows optimizing "X+1 > X" to "true". Knowing the multiplication "cannot" overflow (because doing so would be undefined) allows optimizing "X*2/2" to "X". While these may seem trivial, these sorts of things are commonly exposed by inlining and macro expansion. A more important optimization that this allows is for "<=" loops like this:

for (i = 0; i <= N; ++i) { ... }

In this loop, the compiler can assume that the loop will iterate exactly N+1 times if "i" is undefined on overflow, which allows a broad range of loop optimizations to kick in. On the other hand, if the variable is defined to wrap around on overflow, then the compiler must assume that the loop is possibly infinite (which happens if N is INT_MAX) - which then disables these important loop optimizations. This particularly affects 64-bit platforms since so much code uses "int" as induction variables.

eddyb · 2019-03-13T09:58:06Z

@lcnr Yes, but that's caused by C making the wrong default (int) easier to write, in Rust the types force you to index with usize, which optimizes well without UB.

I'd defer to @rust-lang/wg-unsafe-code-guidelines and @pcwalton for more qualified opinions on UB but I doubt we need/want this.

lcnr · 2019-03-13T15:44:06Z

@eddyb while I don't think that the increase in performance will be that relevant, I will try some quick benchmarks when all additions are replaced with add nsw nuw to see if it would make a difference.

strega-nil · 2019-03-13T17:52:19Z

I simultaneously have two opinions here:

this is not an optimization that should ever be necessary with well-written code.
there's no real reason not to have these, and for specific code it may make a difference, however unlikely that might be.

hanna-kruppe · 2019-03-13T17:56:15Z

My main reason for not wanting these is that I fear some people will cargo-cult them in the belief that it makes their program faster when really they just escalate all integer overflow bugs in their code to instant UB.

strega-nil · 2019-03-13T18:28:29Z

@rkruppe personally, I'm far more worried about existing utilities in the standard library, like transmute.

lcnr · 2019-03-17T14:07:30Z

I have looked at the speedup of ./x.py bench when all add, sub and mul are replaced with their unchecked variants. These benchmarks are obviously not very conclusive and can be partially attributed to random noise:

The repo where the default add, sub and mul are replaced with their unchecked counterparts can be found here, in case anyone wants to do some tests on their own.

file pastebin

lcnr · 2019-03-17T15:01:51Z

The following functions/benchmarks seem like they are actually affected by this.

char: to_digit
iter: by_ref().sum()(the speed of sumwithout by_ref() remains equal)
str: find, contains, match_indices, split when used with str, probably in str::pattern::TwoWaySearcher

eddyb · 2019-03-18T07:43:34Z

iter: by_ref().sum() (the speed of sum without by_ref() remains equal)

This one, at least, is definitely wrong, sum should, on overflow:

panic (in debug mode)
wrap around (in release mode)

Otherwise, something as simple as [255u8, 1].iter().cloned().by_ref().sum() becomes UB.

Some of the others may be too - if there are any who aren't (wrong optimizations, that is), you should instead open issues about missed optimizations.

gnzlbg · 2019-03-28T09:55:53Z

These are only exposed in core::intrinsics as unsafe fns right? AFAICT core::intrinsics is perma unstable, so I don't think people will assume that these are intended for "general" usage. I don't see much harm in exposing them here, and I've wanted these a couple of times when diagnosing perf issues and filling bugs for SIMD vectors, so I can imagine these might be useful for primitive types as well.

One thing that one can do with them is implement saturated/checked/wrapping arithmetic in Rust itself on top of these intrinsics only, instead of using the LLVM specific ones. This is not a very practical thing to do, but if the intent is to experiment, I think that's fine.

hanna-kruppe · 2019-03-28T18:46:44Z

These are only exposed in core::intrinsics as unsafe fns right? AFAICT core::intrinsics is perma unstable, so I don't think people will assume that these are intended for "general" usage.

Normally when people request a feature they want to use it in their project and most people want it have a trajectory towards (eventual) stability. I don't want to put words in anyone's mouths but I see no reason to expect that these will stay perma-unstable if they're added, except by accident because everyone forgets about them.

One thing that one can do with them is implement saturated/checked/wrapping arithmetic in Rust itself on top of these intrinsics only, instead of using the LLVM specific ones.

I don't understand this at all. For wrapping arithmetic you can't do better than remove the UB on overflow and let the processor do its thing. For checked and saturating arithmetic, even if they're implemented as a code sequence that "looks before it leaps" and thus could have a non-overflowing operating embedded in it, I don't see how the optimizations that need UB-on-overflow would be applicable.

sanxiyn · 2019-04-04T04:28:20Z

As far as I can tell, this is not waiting on technical review but waiting on decision from the relevant team. (By the way, what is the relevant team here?)

gnzlbg · 2019-04-04T07:35:15Z

@sanxiyn these are perma unstable core intrinsics, so probably the compiler team is the relevant team here. It wouldn't hurt for the lang and lib teams to know that these might become available.

lcnr · 2019-04-04T08:11:44Z

After reading the conversation here and some personal though I am actually slightly against adding these intrinsics to the language. The most important reason for me is that even only emitting the unchecked versions during codegen does not lead to many performance improvements. All noticeable improvements I've looked at were due to enabling UB after these changes.

Personally I would like to only keep the undefined add/sub/mul for codegen and emit them for operations which are guaranteed to not overflow thanks to some kind of range analysis. In case someone wants to use these intrinsics, it should then be possible with the following code:

// this currently does not generate add nuw instead of a simple add instruction,
// requires some future optimizations
a.checked_add(b).unwrap_or_else(|| core::hint::unreachable_unchecked())

Centril · 2019-04-28T11:56:28Z

Adding T-Lang + T-Libs for the question of "do we want to expose this to users?" and T-Compiler for "do we want to use this internally for something else?" and nominating for all teams.

alexcrichton · 2019-05-01T17:54:24Z

The libs team discussed this during triage and concluded that we see no issues including these eventually in the standard library. Stabilization would certainly be a different story, but adding them eventually was not objected to by anyone.

scottmcm · 2019-05-01T19:30:55Z

this is not an optimization that should ever be necessary with well-written code.

I'll note that we're already doing weird things to hack around the lack of these, for example

rust/src/libcore/iter/range.rs

Lines 170 to 185 in 6cc24f2

    
           fn next(&mut self) -> Option<A> { 
        
               if self.start < self.end { 
        
                   // We check for overflow here, even though it can't actually 
        
                   // happen. Adding this check does however help llvm vectorize loops 
        
                   // for some ranges that don't get vectorized otherwise, 
        
                   // and this won't actually result in an extra check in an optimized build. 
        
                   if let Some(mut n) = self.start.add_usize(1) { 
        
                       mem::swap(&mut n, &mut self.start); 
        
                       Some(n) 
        
                   } else { 
        
                       None 
        
                   } 
        
               } else { 
        
                   None 
        
               } 
        
           }

I'd much rather see that code as "I'm using add_unchecked(1) here for optimization and here's why it's safe" than the current "this is super weird but happens to work right now".

Even if these intrinsics never stabilize, I wouldn't be surprised to find a handful of places they're worth it in super-core library pieces like Range above or maybe things like slice iterators.

src/librustc_codegen_llvm/builder.rs

scottmcm · 2019-06-02T19:47:16Z

@lcnr Based on the commits now showing in here, it looks like something went awry in resolving the conflicts? Maybe try the rebase again?

RalfJung · 2019-06-02T19:48:13Z

And in particular, please rebase, don't merge.

lcnr · 2019-06-02T19:52:59Z

this was the weirdest rebase I have ever done...
seems like it is somewhat sorted now, even if it still seems like I am adding fn not to llvm/builder.
Should be fine otherwise :D

Edit: nevermind, I just didn't fully understand the changes made by @eddyb

rust-highfive · 2019-06-02T20:39:09Z

The job x86_64-gnu-llvm-6.0 of your PR failed on Travis (raw log). Through arcane magic we have determined that the following fragments from the build log may contain information about the problem.

Click to expand the log.

travis_time:end:1bf9e0d9:start=1559506424020022212,finish=1559506426107512035,duration=2087489823
$ git checkout -qf FETCH_HEAD
travis_fold:end:git.checkout

Encrypted environment variables have been removed for security reasons.
See https://docs.travis-ci.com/user/pull-requests/#pull-requests-and-security-restrictions
$ export SCCACHE_BUCKET=rust-lang-ci-sccache2
$ export SCCACHE_REGION=us-west-1
$ export GCP_CACHE_BUCKET=rust-lang-ci-cache
$ export AWS_ACCESS_KEY_ID=AKIA46X5W6CZEJZ6XT55

I'm a bot! I can only do what humans tell me to, so if this was not helpful or you have suggestions for improvements, please ping or otherwise contact @TimNN. (Feature Requests)

lcnr · 2019-06-03T10:04:00Z

@eddyb should be ready for review

src/librustc_codegen_llvm/llvm/ffi.rs

lcnr · 2019-06-03T19:33:14Z

Discussed in the compiler team triage meeting. General consensus was that we're ok landing this, it shouldn't add a big maintenance burden etc. Obviously, we would want to see tests that these are unstable and not usable outside rustc.

Added tests for both unsafe and unstable.
@nikomatsakis @eddyb

eddyb · 2019-06-03T19:48:28Z

@bors r+

bors · 2019-06-03T19:48:29Z

📌 Commit d7e0834 has been approved by eddyb

add support for unchecked math add compiler support for ```rust /// Returns the result of an unchecked addition, resulting in /// undefined behavior when `x + y > T::max_value()` or `x + y < T::min_value()`. pub fn unchecked_add<T>(x: T, y: T) -> T; /// Returns the result of an unchecked substraction, resulting in /// undefined behavior when `x - y > T::max_value()` or `x - y < T::min_value()`. pub fn unchecked_sub<T>(x: T, y: T) -> T; /// Returns the result of an unchecked multiplication, resulting in /// undefined behavior when `x * y > T::max_value()` or `x * y < T::min_value()`. pub fn unchecked_mul<T>(x: T, y: T) -> T; ``` cc rust-lang/rfcs#2508

bors · 2019-06-03T22:06:02Z

⌛ Testing commit d7e0834 with merge e22b7a3...

bors · 2019-06-04T01:02:44Z

☀️ Test successful - checks-travis, status-appveyor
Approved by: eddyb
Pushing e22b7a3 to master...

rust-highfive assigned eddyb Mar 12, 2019

rust-highfive added the S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. label Mar 12, 2019

sanxiyn added S-waiting-on-team Status: Awaiting decision from the relevant subteam (see the T-<team> label). and removed S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. labels Apr 4, 2019

alexcrichton removed the T-libs-api Relevant to the library API team, which will review and decide on the PR/issue. label May 1, 2019

scottmcm reviewed May 1, 2019

View reviewed changes

src/librustc_codegen_llvm/builder.rs Outdated Show resolved Hide resolved

lcnr force-pushed the unchecked_maths branch from 5af91f3 to 1ee8a12 Compare June 2, 2019 19:49

lcnr force-pushed the unchecked_maths branch from 1ee8a12 to c676a77 Compare June 2, 2019 20:54

tesuji reviewed Jun 3, 2019

View reviewed changes

src/librustc_codegen_llvm/llvm/ffi.rs Outdated Show resolved Hide resolved

lcnr force-pushed the unchecked_maths branch from c676a77 to 5c0522c Compare June 3, 2019 10:56

lcnr added 3 commits June 3, 2019 12:59

add support for unchecked math

d6266a7

add unchecked math intrinsics

4e7319c

add codegen test for unchecked math

8a25fdb

lcnr force-pushed the unchecked_maths branch from 5c0522c to 8a25fdb Compare June 3, 2019 11:01

add ui tests for unchecked math

d7e0834

eddyb approved these changes Jun 3, 2019

View reviewed changes

bors added S-waiting-on-bors Status: Waiting on bors to run and complete tests. Bors will change the label on completion. and removed S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. labels Jun 3, 2019

bors added the merged-by-bors This PR was explicitly merged by bors. label Jun 4, 2019

bors merged commit d7e0834 into rust-lang:master Jun 4, 2019

scottmcm mentioned this pull request Jun 17, 2019

Pre-RFC: Unchecked arithmetic rust-lang/rfcs#2508

Closed

Centril mentioned this pull request Aug 26, 2019

Add unchecked math inherant impls to integers #63923

Closed

CAD97 mentioned this pull request Sep 3, 2019

Redesign the std::iter::Step trait #62886

Closed

lcnr deleted the unchecked_maths branch April 2, 2020 15:01

matthiaskrgr mentioned this pull request Mar 25, 2024

Using derive(PartialEq) on an enum with a variant that accepts(Box<dyn SomeTrait>) causes cryptic build error. #123056

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add support for unchecked math #59148

add support for unchecked math #59148

lcnr commented Mar 12, 2019 •

edited

Loading

rust-highfive commented Mar 12, 2019

lachlansneff commented Mar 13, 2019

scottmcm commented Mar 13, 2019

lcnr commented Mar 13, 2019 •

edited

Loading

eddyb commented Mar 13, 2019 •

edited

Loading

lcnr commented Mar 13, 2019

strega-nil commented Mar 13, 2019

hanna-kruppe commented Mar 13, 2019

strega-nil commented Mar 13, 2019

lcnr commented Mar 17, 2019 •

edited

Loading

lcnr commented Mar 17, 2019 •

edited

Loading

eddyb commented Mar 18, 2019

gnzlbg commented Mar 28, 2019

hanna-kruppe commented Mar 28, 2019

sanxiyn commented Apr 4, 2019

gnzlbg commented Apr 4, 2019

lcnr commented Apr 4, 2019 •

edited

Loading

Centril commented Apr 28, 2019

alexcrichton commented May 1, 2019

scottmcm commented May 1, 2019

scottmcm commented Jun 2, 2019

RalfJung commented Jun 2, 2019 •

edited

Loading

lcnr commented Jun 2, 2019 •

edited

Loading

rust-highfive commented Jun 2, 2019

lcnr commented Jun 3, 2019

lcnr commented Jun 3, 2019

eddyb commented Jun 3, 2019

bors commented Jun 3, 2019

bors commented Jun 3, 2019

bors commented Jun 4, 2019

add support for unchecked math #59148

add support for unchecked math #59148

Conversation

lcnr commented Mar 12, 2019 • edited Loading

rust-highfive commented Mar 12, 2019

lachlansneff commented Mar 13, 2019

scottmcm commented Mar 13, 2019

lcnr commented Mar 13, 2019 • edited Loading

eddyb commented Mar 13, 2019 • edited Loading

lcnr commented Mar 13, 2019

strega-nil commented Mar 13, 2019

hanna-kruppe commented Mar 13, 2019

strega-nil commented Mar 13, 2019

lcnr commented Mar 17, 2019 • edited Loading

lcnr commented Mar 17, 2019 • edited Loading

eddyb commented Mar 18, 2019

gnzlbg commented Mar 28, 2019

hanna-kruppe commented Mar 28, 2019

sanxiyn commented Apr 4, 2019

gnzlbg commented Apr 4, 2019

lcnr commented Apr 4, 2019 • edited Loading

Centril commented Apr 28, 2019

alexcrichton commented May 1, 2019

scottmcm commented May 1, 2019

scottmcm commented Jun 2, 2019

RalfJung commented Jun 2, 2019 • edited Loading

lcnr commented Jun 2, 2019 • edited Loading

rust-highfive commented Jun 2, 2019

lcnr commented Jun 3, 2019

lcnr commented Jun 3, 2019

eddyb commented Jun 3, 2019

bors commented Jun 3, 2019

bors commented Jun 3, 2019

bors commented Jun 4, 2019

lcnr commented Mar 12, 2019 •

edited

Loading

lcnr commented Mar 13, 2019 •

edited

Loading

eddyb commented Mar 13, 2019 •

edited

Loading

lcnr commented Mar 17, 2019 •

edited

Loading

lcnr commented Mar 17, 2019 •

edited

Loading

lcnr commented Apr 4, 2019 •

edited

Loading

RalfJung commented Jun 2, 2019 •

edited

Loading

lcnr commented Jun 2, 2019 •

edited

Loading