Base64Encoder mini changes #28888

gfoidl · 2018-04-06T15:30:35Z

Some minor changes to the Base64Encoder.

I missed the review of #24888, so it goes here.

gfoidl

Notes for review

gfoidl · 2018-04-06T15:31:12Z

src/System.Memory/src/System/Buffers/Text/Base64Encoder.cs

@@ -108,7 +108,7 @@ public static OperationStatus EncodeToUtf8(ReadOnlySpan<byte> bytes, Span<byte>
        [MethodImpl(MethodImplOptions.AggressiveInlining)]
        public static int GetMaxEncodedToUtf8Length(int length)
        {
-            if (length < 0 || length > MaximumEncodeLength)
+            if ((uint)length > MaximumEncodeLength)


The case length < 0 is covered due the cast to uint.

This hides intent, though (that negative lengths are invalid). And without unchecked, it would throw if the correct compiler/runtime flags are flipped.
My naive guess is that the compiler has some built-in way to perform this particular optimization anyways; "positive-but-less-than" is a common category.

length will just be reinterpreted as uint, so negative numbers are interpreted as very high (> int.MaxValue) numbers. MaximumEncodeLength is a constant (so no cast necessary) and on negative lengths the branch will be taken.

It's no compiler magic here, just interpretation of (raw-) values.

I don't think it hides intent, as used widely and as common trick to safe comparisons.

length will just be reinterpreted as uint, so negative numbers are interpreted as very high (> int.MaxValue) numbers.

Yeah. That's what causes the throw (checked here is a way to force the -checked compiler option) .

I don't think it hides intent, as used widely and as common trick to safe comparisons.

The bare-bones form is demonstrably not always safe.
The equivalent behavior in C++ is officially undefined (although pretty much all implementations will behave as anticipated, for multiple reasons)

@Clockwork-Muse, we do this optimization all over the place.

gfoidl · 2018-04-06T15:31:43Z

src/System.Memory/src/System/Buffers/Text/Base64Encoder.cs

@@ -135,7 +135,7 @@ public static OperationStatus EncodeToUtf8InPlace(Span<byte> buffer, int dataLen
            if (buffer.Length < encodedLength)
                goto FalseExit;

-            int leftover = dataLength - dataLength / 3 * 3; // how many bytes after packs of 3
+            int leftover = dataLength - (dataLength / 3) * 3; // how many bytes after packs of 3


Just to ensure that no compiler will "optimize" the / 3 * 3.

Parenthesis only ever clarify intent, and are essentially removed at the parse stage. "Optimizing" such a statement that way would completely break intent and integer math (consider what that process would do to dataLength / 4 * 2).

Now, for clarifying intent, it might be a benefit to have them there.

gfoidl · 2018-04-06T15:32:22Z

src/System.Memory/src/System/Buffers/Text/Base64Encoder.cs

@@ -228,6 +228,6 @@ private static int EncodeAndPadTwo(ref byte oneByte, ref byte encodingMap)

        private const byte EncodingPad = (byte)'='; // '=', for padding

-        private const int MaximumEncodeLength = (int.MaxValue >> 2) * 3; // 1610612733
+        private const int MaximumEncodeLength = (int.MaxValue / 4) * 3; // 1610612733


It's a constant, so it can be written "easier" to read.

Given this is a constant, the change is OK. However, in general, division by 4 is slower than bit shifting by 2.

public int UseBitShift(int value) { return (value >> 2) * 3; } public int UseDivision(int value) { return (value / 4) * 3; }

However, in general, division by 4 is slower than bit shifting by 2.

For int. For uint they're identical.

Commit migrated from dotnet/corefx@11a5782

Base64Encoder mini changes

53a0726

gfoidl commented Apr 6, 2018

View reviewed changes

ahsonkhan approved these changes Apr 6, 2018

View reviewed changes

stephentoub approved these changes Apr 6, 2018

View reviewed changes

stephentoub merged commit 11a5782 into dotnet:master Apr 6, 2018

karelz added this to the 2.1.0 milestone Apr 6, 2018

gfoidl deleted the base64 branch April 7, 2018 16:51

pjanotti pushed a commit to pjanotti/corefx that referenced this pull request Apr 8, 2018

Base64Encoder mini changes (dotnet#28888)

aef82cc

picenka21 pushed a commit to picenka21/runtime that referenced this pull request Feb 18, 2022

Base64Encoder mini changes (dotnet/corefx#28888)

3d1ff13

Commit migrated from dotnet/corefx@11a5782

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Base64Encoder mini changes #28888

Base64Encoder mini changes #28888

gfoidl commented Apr 6, 2018

gfoidl left a comment

gfoidl Apr 6, 2018

Clockwork-Muse Apr 6, 2018

gfoidl Apr 6, 2018 •

edited

Loading

Clockwork-Muse Apr 6, 2018

stephentoub Apr 6, 2018

gfoidl Apr 6, 2018

Clockwork-Muse Apr 6, 2018

gfoidl Apr 6, 2018

ahsonkhan Apr 6, 2018

stephentoub Apr 6, 2018

Base64Encoder mini changes #28888

Base64Encoder mini changes #28888

Conversation

gfoidl commented Apr 6, 2018

gfoidl left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

gfoidl Apr 6, 2018 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

gfoidl Apr 6, 2018 •

edited

Loading