Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[X64] [tannergooding] Change Vector2/3/4, Quaternion, Plane, Vector<T>, and Vector64/128/256/51 ... #400

Open
MihuBot opened this issue Jun 5, 2024 · 2 comments

Comments

@MihuBot
Copy link
Owner

MihuBot commented Jun 5, 2024

Job completed in 29 minutes.
dotnet/runtime#102301

Diffs

Diffs
Found 281 files with textual diffs.

Summary of Code Size diffs:
(Lower is better)

Total bytes of base: 39757987
Total bytes of diff: 39763503
Total bytes of delta: 5516 (0.01 % of base)
Total relative delta: 152.77
    diff is a regression.
    relative diff is a regression.


Total byte diff includes -614 bytes from reconciling methods
	Base had    6 unique methods,      768 unique bytes
	Diff had    5 unique methods,      154 unique bytes

Top file regressions (bytes):
        5152 : System.Private.CoreLib.dasm (0.07 % of base)
         307 : System.Numerics.Tensors.dasm (0.04 % of base)
          45 : System.Drawing.Primitives.dasm (0.11 % of base)
          12 : System.IO.Hashing.dasm (0.04 % of base)

4 total files with Code Size differences (0 improved, 4 regressed), 253 unchanged.

Top method regressions (bytes):
         540 (29.51 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512:Dot[ubyte](System.Runtime.Intrinsics.Vector512`1[ubyte],System.Runtime.Intrinsics.Vector512`1[ubyte]):ubyte (FullOpts)
         451 (58.12 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256`1[ubyte]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector256<T>,T>.Dot(System.Runtime.Intrinsics.Vector256`1[ubyte],System.Runtime.Intrinsics.Vector256`1[ubyte]):ubyte (FullOpts)
         411 (78.59 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256`1[ubyte]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector256<T>,T>.ShiftLeft(System.Runtime.Intrinsics.Vector256`1[ubyte],int):System.Runtime.Intrinsics.Vector256`1[ubyte] (FullOpts)
         411 (78.59 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256`1[ubyte]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector256<T>,T>.ShiftRightArithmetic(System.Runtime.Intrinsics.Vector256`1[ubyte],int):System.Runtime.Intrinsics.Vector256`1[ubyte] (FullOpts)
         406 (82.35 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512:Dot[long](System.Runtime.Intrinsics.Vector512`1[long],System.Runtime.Intrinsics.Vector512`1[long]):long (FullOpts)
         294 (88.55 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256`1[short]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector256<T>,T>.Divide(System.Runtime.Intrinsics.Vector256`1[short],short):System.Runtime.Intrinsics.Vector256`1[short] (FullOpts)
         267 (126.54 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256`1[long]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector256<T>,T>.Dot(System.Runtime.Intrinsics.Vector256`1[long],System.Runtime.Intrinsics.Vector256`1[long]):long (FullOpts)
         266 (35.90 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512:Divide[int](System.Runtime.Intrinsics.Vector512`1[int],System.Runtime.Intrinsics.Vector512`1[int]):System.Runtime.Intrinsics.Vector512`1[int] (FullOpts)
         237 (27.15 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512:Divide[ubyte](System.Runtime.Intrinsics.Vector512`1[ubyte],System.Runtime.Intrinsics.Vector512`1[ubyte]):System.Runtime.Intrinsics.Vector512`1[ubyte] (FullOpts)
         233 (36.46 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512:Divide[long](System.Runtime.Intrinsics.Vector512`1[long],System.Runtime.Intrinsics.Vector512`1[long]):System.Runtime.Intrinsics.Vector512`1[long] (FullOpts)
         218 (16.02 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512:Divide[short](System.Runtime.Intrinsics.Vector512`1[short],System.Runtime.Intrinsics.Vector512`1[short]):System.Runtime.Intrinsics.Vector512`1[short] (FullOpts)
         169 (81.25 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256`1[int]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector256<T>,T>.Divide(System.Runtime.Intrinsics.Vector256`1[int],int):System.Runtime.Intrinsics.Vector256`1[int] (FullOpts)
         165 (49.11 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256`1[int]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector256<T>,T>.Divide(System.Runtime.Intrinsics.Vector256`1[int],System.Runtime.Intrinsics.Vector256`1[int]):System.Runtime.Intrinsics.Vector256`1[int] (FullOpts)
         157 (25.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256`1[short]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector256<T>,T>.Divide(System.Runtime.Intrinsics.Vector256`1[short],System.Runtime.Intrinsics.Vector256`1[short]):System.Runtime.Intrinsics.Vector256`1[short] (FullOpts)
         155 (44.16 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128`1[ubyte]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector128<T>,T>.Dot(System.Runtime.Intrinsics.Vector128`1[ubyte],System.Runtime.Intrinsics.Vector128`1[ubyte]):ubyte (FullOpts)
         133 (32.36 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256`1[ubyte]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector256<T>,T>.Divide(System.Runtime.Intrinsics.Vector256`1[ubyte],System.Runtime.Intrinsics.Vector256`1[ubyte]):System.Runtime.Intrinsics.Vector256`1[ubyte] (FullOpts)
         132 (62.56 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256:Dot[long](System.Runtime.Intrinsics.Vector256`1[long],System.Runtime.Intrinsics.Vector256`1[long]):long (FullOpts)
         121 (41.72 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256`1[long]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector256<T>,T>.Multiply(System.Runtime.Intrinsics.Vector256`1[long],System.Runtime.Intrinsics.Vector256`1[long]):System.Runtime.Intrinsics.Vector256`1[long] (FullOpts)
         118 (61.78 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256`1[double]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector256<T>,T>.ShiftRightArithmetic(System.Runtime.Intrinsics.Vector256`1[double],int):System.Runtime.Intrinsics.Vector256`1[double] (FullOpts)
         117 (64.64 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256:Multiply[long](long,System.Runtime.Intrinsics.Vector256`1[long]):System.Runtime.Intrinsics.Vector256`1[long] (FullOpts)
         117 (39.80 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256`1[long]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector256<T>,T>.Divide(System.Runtime.Intrinsics.Vector256`1[long],System.Runtime.Intrinsics.Vector256`1[long]):System.Runtime.Intrinsics.Vector256`1[long] (FullOpts)
         117 (64.64 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256`1[long]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector256<T>,T>.Multiply(System.Runtime.Intrinsics.Vector256`1[long],long):System.Runtime.Intrinsics.Vector256`1[long] (FullOpts)
         114 (44.71 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128`1[ubyte]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector128<T>,T>.ShiftLeft(System.Runtime.Intrinsics.Vector128`1[ubyte],int):System.Runtime.Intrinsics.Vector128`1[ubyte] (FullOpts)
         114 (44.71 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128`1[ubyte]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector128<T>,T>.ShiftRightArithmetic(System.Runtime.Intrinsics.Vector128`1[ubyte],int):System.Runtime.Intrinsics.Vector128`1[ubyte] (FullOpts)
         112 (Infinity of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128:<ConvertToSingle>g__SoftwareFallback|45_0(System.Runtime.Intrinsics.Vector128`1[uint]):System.Runtime.Intrinsics.Vector128`1[float] (FullOpts) (0 base, 1 diff methods)
         110 (58.51 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256`1[long]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector256<T>,T>.Divide(System.Runtime.Intrinsics.Vector256`1[long],long):System.Runtime.Intrinsics.Vector256`1[long] (FullOpts)
         109 (68.13 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128`1[short]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector128<T>,T>.Divide(System.Runtime.Intrinsics.Vector128`1[short],short):System.Runtime.Intrinsics.Vector128`1[short] (FullOpts)
          93 (48.69 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256`1[long]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector256<T>,T>.ShiftRightArithmetic(System.Runtime.Intrinsics.Vector256`1[long],int):System.Runtime.Intrinsics.Vector256`1[long] (FullOpts)
          89 (79.46 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector64`1[short]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector64<T>,T>.Dot(System.Runtime.Intrinsics.Vector64`1[short],System.Runtime.Intrinsics.Vector64`1[short]):short (FullOpts)
          87 (117.57 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128`1[long]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector128<T>,T>.Dot(System.Runtime.Intrinsics.Vector128`1[long],System.Runtime.Intrinsics.Vector128`1[long]):long (FullOpts)
          86 (191.11 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector:Dot[ubyte](System.Numerics.Vector`1[ubyte],System.Numerics.Vector`1[ubyte]):ubyte (FullOpts)
          84 (10.82 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256:Dot[ubyte](System.Runtime.Intrinsics.Vector256`1[ubyte],System.Runtime.Intrinsics.Vector256`1[ubyte]):ubyte (FullOpts)
          84 (17.04 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512`1[long]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector512<T>,T>.Dot(System.Runtime.Intrinsics.Vector512`1[long],System.Runtime.Intrinsics.Vector512`1[long]):long (FullOpts)
          81 (158.82 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector:Dot[long](System.Numerics.Vector`1[long],System.Numerics.Vector`1[long]):long (FullOpts)
          79 (36.41 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128:Dot[short](System.Runtime.Intrinsics.Vector128`1[short],System.Runtime.Intrinsics.Vector128`1[short]):short (FullOpts)
          61 (70.93 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector64:Dot[short](System.Runtime.Intrinsics.Vector64`1[short],System.Runtime.Intrinsics.Vector64`1[short]):short (FullOpts)
          61 (34.66 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector64`1[ubyte]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector64<T>,T>.Dot(System.Runtime.Intrinsics.Vector64`1[ubyte],System.Runtime.Intrinsics.Vector64`1[ubyte]):ubyte (FullOpts)
          60 (62.50 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128`1[int]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector128<T>,T>.Divide(System.Runtime.Intrinsics.Vector128`1[int],int):System.Runtime.Intrinsics.Vector128`1[int] (FullOpts)
          58 (152.63 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector`1[System.__Canon]:op_Explicit(System.Numerics.Vector`1[System.__Canon]):System.Numerics.Vector`1[long] (FullOpts) (2 methods)
          58 (152.63 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector`1[System.__Canon]:op_Explicit(System.Numerics.Vector`1[System.__Canon]):System.Numerics.Vector`1[ulong] (FullOpts) (2 methods)
          53 (45.69 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128`1[int]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector128<T>,T>.Divide(System.Runtime.Intrinsics.Vector128`1[int],System.Runtime.Intrinsics.Vector128`1[int]):System.Runtime.Intrinsics.Vector128`1[int] (FullOpts)
          49 (20.08 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128`1[short]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector128<T>,T>.Divide(System.Runtime.Intrinsics.Vector128`1[short],System.Runtime.Intrinsics.Vector128`1[short]):System.Runtime.Intrinsics.Vector128`1[short] (FullOpts)
          48 (300.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector64:Dot[double](System.Runtime.Intrinsics.Vector64`1[double],System.Runtime.Intrinsics.Vector64`1[double]):double (FullOpts)
          47 (97.92 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector64`1[int]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector64<T>,T>.Dot(System.Runtime.Intrinsics.Vector64`1[int],System.Runtime.Intrinsics.Vector64`1[int]):int (FullOpts)
          44 (122.22 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512:OnesComplement[double](System.Runtime.Intrinsics.Vector512`1[double]):System.Runtime.Intrinsics.Vector512`1[double] (FullOpts)
          44 (122.22 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512:OnesComplement[int](System.Runtime.Intrinsics.Vector512`1[int]):System.Runtime.Intrinsics.Vector512`1[int] (FullOpts)
          44 (122.22 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512:OnesComplement[long](System.Runtime.Intrinsics.Vector512`1[long]):System.Runtime.Intrinsics.Vector512`1[long] (FullOpts)
          44 (122.22 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512:OnesComplement[short](System.Runtime.Intrinsics.Vector512`1[short]):System.Runtime.Intrinsics.Vector512`1[short] (FullOpts)
          44 (122.22 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512:OnesComplement[ubyte](System.Runtime.Intrinsics.Vector512`1[ubyte]):System.Runtime.Intrinsics.Vector512`1[ubyte] (FullOpts)
          44 (122.22 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512`1[double]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector512<T>,T>.OnesComplement(System.Runtime.Intrinsics.Vector512`1[double]):System.Runtime.Intrinsics.Vector512`1[double] (FullOpts)
          44 (122.22 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512`1[int]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector512<T>,T>.OnesComplement(System.Runtime.Intrinsics.Vector512`1[int]):System.Runtime.Intrinsics.Vector512`1[int] (FullOpts)
          44 (122.22 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512`1[long]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector512<T>,T>.OnesComplement(System.Runtime.Intrinsics.Vector512`1[long]):System.Runtime.Intrinsics.Vector512`1[long] (FullOpts)
          44 (122.22 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512`1[short]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector512<T>,T>.OnesComplement(System.Runtime.Intrinsics.Vector512`1[short]):System.Runtime.Intrinsics.Vector512`1[short] (FullOpts)
          44 (122.22 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512`1[ubyte]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector512<T>,T>.OnesComplement(System.Runtime.Intrinsics.Vector512`1[ubyte]):System.Runtime.Intrinsics.Vector512`1[ubyte] (FullOpts)
          44 (200.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector64:Dot[int](System.Runtime.Intrinsics.Vector64`1[int],System.Runtime.Intrinsics.Vector64`1[int]):int (FullOpts)
          43 (226.32 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector`1[System.__Canon]:op_Inequality(System.Numerics.Vector`1[System.__Canon],System.Numerics.Vector`1[System.__Canon]):ubyte (FullOpts)
          43 (226.32 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector`1[System.Nullable`1[int]]:op_Inequality(System.Numerics.Vector`1[System.Nullable`1[int]],System.Numerics.Vector`1[System.Nullable`1[int]]):ubyte (FullOpts)
          43 (226.32 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector`1[System.Numerics.Vector`1[float]]:op_Inequality(System.Numerics.Vector`1[System.Numerics.Vector`1[float]],System.Numerics.Vector`1[System.Numerics.Vector`1[float]]):ubyte (FullOpts)
          42 (280.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector64:Dot[long](System.Runtime.Intrinsics.Vector64`1[long],System.Runtime.Intrinsics.Vector64`1[long]):long (FullOpts)
          41 (50.62 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128`1[long]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector128<T>,T>.Multiply(System.Runtime.Intrinsics.Vector128`1[long],long):System.Runtime.Intrinsics.Vector128`1[long] (FullOpts)
          41 (37.96 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector64:Negate[short](System.Runtime.Intrinsics.Vector64`1[short]):System.Runtime.Intrinsics.Vector64`1[short] (FullOpts)
          41 (37.96 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector64`1[short]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector64<T>,T>.Negate(System.Runtime.Intrinsics.Vector64`1[short]):System.Runtime.Intrinsics.Vector64`1[short] (FullOpts)
          39 (50.65 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128:Dot[double](System.Runtime.Intrinsics.Vector128`1[double],System.Runtime.Intrinsics.Vector128`1[double]):double (FullOpts)
          37 (5.86 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512`1[long]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector512<T>,T>.Multiply(System.Runtime.Intrinsics.Vector512`1[long],System.Runtime.Intrinsics.Vector512`1[long]):System.Runtime.Intrinsics.Vector512`1[long] (FullOpts)
          36 (24.49 % of base) : System.Numerics.Tensors.dasm - System.Numerics.Tensors.TensorPrimitives:ElementWiseSelect[System.__Canon](System.Runtime.Intrinsics.Vector512`1[System.__Canon],System.Runtime.Intrinsics.Vector512`1[System.__Canon],System.Runtime.Intrinsics.Vector512`1[System.__Canon]):System.Runtime.Intrinsics.Vector512`1[System.__Canon] (FullOpts)
          36 (43.90 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128:Dot[int](System.Runtime.Intrinsics.Vector128`1[int],System.Runtime.Intrinsics.Vector128`1[int]):int (FullOpts)
          36 (41.86 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128`1[long]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector128<T>,T>.Divide(System.Runtime.Intrinsics.Vector128`1[long],long):System.Runtime.Intrinsics.Vector128`1[long] (FullOpts)
          36 (31.58 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256`1[System.__Canon]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector256<T>,T>.AndNot(System.Runtime.Intrinsics.Vector256`1[System.__Canon],System.Runtime.Intrinsics.Vector256`1[System.__Canon]):System.Runtime.Intrinsics.Vector256`1[System.__Canon] (FullOpts)
          36 (30.77 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256`1[System.__Canon]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector256<T>,T>.ConditionalSelect(System.Runtime.Intrinsics.Vector256`1[System.__Canon],System.Runtime.Intrinsics.Vector256`1[System.__Canon],System.Runtime.Intrinsics.Vector256`1[System.__Canon]):System.Runtime.Intrinsics.Vector256`1[System.__Canon] (FullOpts)
          36 (32.43 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512:AndNot[System.__Canon](System.Runtime.Intrinsics.Vector512`1[System.__Canon],System.Runtime.Intrinsics.Vector512`1[System.__Canon]):System.Runtime.Intrinsics.Vector512`1[System.__Canon] (FullOpts)
          36 (32.43 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512:ConditionalSelect[System.__Canon](System.Runtime.Intrinsics.Vector512`1[System.__Canon],System.Runtime.Intrinsics.Vector512`1[System.__Canon],System.Runtime.Intrinsics.Vector512`1[System.__Canon]):System.Runtime.Intrinsics.Vector512`1[System.__Canon] (FullOpts)
          36 (24.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512`1[System.__Canon]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector512<T>,T>.AndNot(System.Runtime.Intrinsics.Vector512`1[System.__Canon],System.Runtime.Intrinsics.Vector512`1[System.__Canon]):System.Runtime.Intrinsics.Vector512`1[System.__Canon] (FullOpts)
          36 (23.53 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512`1[System.__Canon]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector512<T>,T>.ConditionalSelect(System.Runtime.Intrinsics.Vector512`1[System.__Canon],System.Runtime.Intrinsics.Vector512`1[System.__Canon],System.Runtime.Intrinsics.Vector512`1[System.__Canon]):System.Runtime.Intrinsics.Vector512`1[System.__Canon] (FullOpts)
          36 (22.64 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512`1[System.__Canon]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector512<T>,T>.Negate(System.Runtime.Intrinsics.Vector512`1[System.__Canon]):System.Runtime.Intrinsics.Vector512`1[System.__Canon] (FullOpts)
          36 (87.80 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector64`1[double]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector64<T>,T>.Dot(System.Runtime.Intrinsics.Vector64`1[double],System.Runtime.Intrinsics.Vector64`1[double]):double (FullOpts)
          34 (125.93 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector64`1[double]:op_Inequality(System.Runtime.Intrinsics.Vector64`1[double],System.Runtime.Intrinsics.Vector64`1[double]):ubyte (FullOpts)
          34 (109.68 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector64`1[int]:op_Inequality(System.Runtime.Intrinsics.Vector64`1[int],System.Runtime.Intrinsics.Vector64`1[int]):ubyte (FullOpts)
          34 (170.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector64`1[long]:op_Inequality(System.Runtime.Intrinsics.Vector64`1[long],System.Runtime.Intrinsics.Vector64`1[long]):ubyte (FullOpts)
          34 (57.63 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector64`1[short]:op_Inequality(System.Runtime.Intrinsics.Vector64`1[short],System.Runtime.Intrinsics.Vector64`1[short]):ubyte (FullOpts)
          34 (39.08 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector64`1[ubyte]:op_Inequality(System.Runtime.Intrinsics.Vector64`1[ubyte],System.Runtime.Intrinsics.Vector64`1[ubyte]):ubyte (FullOpts)
          33 (73.33 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128:AndNot[System.__Canon](System.Runtime.Intrinsics.Vector128`1[System.__Canon],System.Runtime.Intrinsics.Vector128`1[System.__Canon]):System.Runtime.Intrinsics.Vector128`1[System.__Canon] (FullOpts)
          33 (73.33 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128:ConditionalSelect[System.__Canon](System.Runtime.Intrinsics.Vector128`1[System.__Canon],System.Runtime.Intrinsics.Vector128`1[System.__Canon],System.Runtime.Intrinsics.Vector128`1[System.__Canon]):System.Runtime.Intrinsics.Vector128`1[System.__Canon] (FullOpts)
          33 (40.74 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128`1[System.__Canon]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector128<T>,T>.AndNot(System.Runtime.Intrinsics.Vector128`1[System.__Canon],System.Runtime.Intrinsics.Vector128`1[System.__Canon]):System.Runtime.Intrinsics.Vector128`1[System.__Canon] (FullOpts)
          33 (39.29 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128`1[System.__Canon]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector128<T>,T>.ConditionalSelect(System.Runtime.Intrinsics.Vector128`1[System.__Canon],System.Runtime.Intrinsics.Vector128`1[System.__Canon],System.Runtime.Intrinsics.Vector128`1[System.__Canon]):System.Runtime.Intrinsics.Vector128`1[System.__Canon] (FullOpts)
          33 (42.31 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256:AndNot[System.__Canon](System.Runtime.Intrinsics.Vector256`1[System.__Canon],System.Runtime.Intrinsics.Vector256`1[System.__Canon]):System.Runtime.Intrinsics.Vector256`1[System.__Canon] (FullOpts)
          33 (42.31 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256:ConditionalSelect[System.__Canon](System.Runtime.Intrinsics.Vector256`1[System.__Canon],System.Runtime.Intrinsics.Vector256`1[System.__Canon],System.Runtime.Intrinsics.Vector256`1[System.__Canon]):System.Runtime.Intrinsics.Vector256`1[System.__Canon] (FullOpts)
          33 (68.75 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector64`1[System.__Canon]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector64<T>,T>.AndNot(System.Runtime.Intrinsics.Vector64`1[System.__Canon],System.Runtime.Intrinsics.Vector64`1[System.__Canon]):System.Runtime.Intrinsics.Vector64`1[System.__Canon] (FullOpts)
          33 (68.75 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector64`1[System.__Canon]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector64<T>,T>.ConditionalSelect(System.Runtime.Intrinsics.Vector64`1[System.__Canon],System.Runtime.Intrinsics.Vector64`1[System.__Canon],System.Runtime.Intrinsics.Vector64`1[System.__Canon]):System.Runtime.Intrinsics.Vector64`1[System.__Canon] (FullOpts)
          32 (42.11 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector`1[long]:op_Multiply(System.Numerics.Vector`1[long],long):System.Numerics.Vector`1[long] (FullOpts)
          32 (168.42 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector`1[System.__Canon]:op_Explicit(System.Numerics.Vector`1[System.__Canon]):System.Numerics.Vector`1[byte] (FullOpts)
          32 (168.42 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector`1[System.__Canon]:op_Explicit(System.Numerics.Vector`1[System.__Canon]):System.Numerics.Vector`1[float] (FullOpts)
          32 (168.42 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector`1[System.__Canon]:op_Explicit(System.Numerics.Vector`1[System.__Canon]):System.Numerics.Vector`1[uint] (FullOpts)
          32 (168.42 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector`1[System.__Canon]:op_Explicit(System.Numerics.Vector`1[System.__Canon]):System.Numerics.Vector`1[ushort] (FullOpts)
          32 (48.48 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector`1[ubyte]:op_Multiply(System.Numerics.Vector`1[ubyte],ubyte):System.Numerics.Vector`1[ubyte] (FullOpts)
          32 (19.16 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128`1[ubyte]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector128<T>,T>.Divide(System.Runtime.Intrinsics.Vector128`1[ubyte],System.Runtime.Intrinsics.Vector128`1[ubyte]):System.Runtime.Intrinsics.Vector128`1[ubyte] (FullOpts)
          32 (80.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector64`1[long]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector64<T>,T>.Dot(System.Runtime.Intrinsics.Vector64`1[long],System.Runtime.Intrinsics.Vector64`1[long]):long (FullOpts)
          31 (30.39 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128`1[long]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector128<T>,T>.Multiply(System.Runtime.Intrinsics.Vector128`1[long],System.Runtime.Intrinsics.Vector128`1[long]):System.Runtime.Intrinsics.Vector128`1[long] (FullOpts)
          30 (66.67 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128:StoreLowerUnsafe[System.__Canon](System.Runtime.Intrinsics.Vector128`1[System.__Canon],byref,ulong) (FullOpts)
          29 (152.63 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector`1[System.__Canon]:op_Explicit(System.Numerics.Vector`1[System.__Canon]):System.Numerics.Vector`1[double] (FullOpts)
          29 (152.63 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector`1[System.__Canon]:op_Explicit(System.Numerics.Vector`1[System.__Canon]):System.Numerics.Vector`1[int] (FullOpts)

Top method improvements (bytes):
        -671 (-55.82 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512`1[ubyte]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector512<T>,T>.Divide(System.Runtime.Intrinsics.Vector512`1[ubyte],ubyte):System.Runtime.Intrinsics.Vector512`1[ubyte] (FullOpts)
        -448 (-24.48 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512`1[ubyte]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector512<T>,T>.Dot(System.Runtime.Intrinsics.Vector512`1[ubyte],System.Runtime.Intrinsics.Vector512`1[ubyte]):ubyte (FullOpts)
        -373 (-27.41 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512`1[short]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector512<T>,T>.Divide(System.Runtime.Intrinsics.Vector512`1[short],System.Runtime.Intrinsics.Vector512`1[short]):System.Runtime.Intrinsics.Vector512`1[short] (FullOpts)
        -204 (-87.93 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128`1[ubyte]:op_Inequality(System.Runtime.Intrinsics.Vector128`1[ubyte],System.Runtime.Intrinsics.Vector128`1[ubyte]):ubyte (FullOpts)
        -187 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Numerics.Quaternion:<Equals>g__SoftwareFallback|44_0(byref,System.Numerics.Quaternion):ubyte (FullOpts) (1 base, 0 diff methods)
        -187 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector4:<Equals>g__SoftwareFallback|68_0(byref,System.Numerics.Vector4):ubyte (FullOpts) (1 base, 0 diff methods)
        -183 (-20.96 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512`1[ubyte]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector512<T>,T>.Divide(System.Runtime.Intrinsics.Vector512`1[ubyte],System.Runtime.Intrinsics.Vector512`1[ubyte]):System.Runtime.Intrinsics.Vector512`1[ubyte] (FullOpts)
        -138 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Numerics.Plane:<Equals>g__SoftwareFallback|16_0(byref,System.Numerics.Plane):ubyte (FullOpts) (1 base, 0 diff methods)
        -128 (-82.05 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128`1[short]:op_Inequality(System.Runtime.Intrinsics.Vector128`1[short],System.Runtime.Intrinsics.Vector128`1[short]):ubyte (FullOpts)
        -112 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128:<ConvertToSingle>g__SoftwareFallback|43_0(System.Runtime.Intrinsics.Vector128`1[uint]):System.Runtime.Intrinsics.Vector128`1[float] (FullOpts) (1 base, 0 diff methods)
        -105 (-84.68 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector4:Max(System.Numerics.Vector4,System.Numerics.Vector4):System.Numerics.Vector4 (FullOpts)
        -105 (-84.68 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector4:Min(System.Numerics.Vector4,System.Numerics.Vector4):System.Numerics.Vector4 (FullOpts)
         -84 (-77.06 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector4:FusedMultiplyAdd(System.Numerics.Vector4,System.Numerics.Vector4,System.Numerics.Vector4):System.Numerics.Vector4 (FullOpts)
         -84 (-77.06 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector4:MultiplyAddEstimate(System.Numerics.Vector4,System.Numerics.Vector4,System.Numerics.Vector4):System.Numerics.Vector4 (FullOpts)
         -83 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Numerics.Quaternion:Multiply(System.Numerics.Quaternion,System.Numerics.Vector4):System.Numerics.Quaternion (FullOpts) (1 base, 0 diff methods)
         -81 (-16.46 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256`1[ubyte]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector256<T>,T>.Divide(System.Runtime.Intrinsics.Vector256`1[ubyte],ubyte):System.Runtime.Intrinsics.Vector256`1[ubyte] (FullOpts)
         -79 (-12.36 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512`1[long]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector512<T>,T>.Divide(System.Runtime.Intrinsics.Vector512`1[long],System.Runtime.Intrinsics.Vector512`1[long]):System.Runtime.Intrinsics.Vector512`1[long] (FullOpts)
         -74 (-73.27 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector`1[int]:op_Multiply(System.Numerics.Vector`1[int],int):System.Numerics.Vector`1[int] (FullOpts)
         -73 (-9.85 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512`1[int]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector512<T>,T>.Divide(System.Runtime.Intrinsics.Vector512`1[int],System.Runtime.Intrinsics.Vector512`1[int]):System.Runtime.Intrinsics.Vector512`1[int] (FullOpts)
         -72 (-72.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128`1[int]:op_Inequality(System.Runtime.Intrinsics.Vector128`1[int],System.Runtime.Intrinsics.Vector128`1[int]):ubyte (FullOpts)
         -67 (-71.28 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128`1[double]:op_Inequality(System.Runtime.Intrinsics.Vector128`1[double],System.Runtime.Intrinsics.Vector128`1[double]):ubyte (FullOpts)
         -66 (-18.80 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128:Dot[ubyte](System.Runtime.Intrinsics.Vector128`1[ubyte],System.Runtime.Intrinsics.Vector128`1[ubyte]):ubyte (FullOpts)
         -64 (-77.11 % of base) : System.Private.CoreLib.dasm - System.Numerics.Quaternion:op_Addition(System.Numerics.Quaternion,System.Numerics.Quaternion):System.Numerics.Quaternion (FullOpts)
         -64 (-68.82 % of base) : System.Private.CoreLib.dasm - System.Numerics.Quaternion:op_Equality(System.Numerics.Quaternion,System.Numerics.Quaternion):ubyte (FullOpts)
         -64 (-77.11 % of base) : System.Private.CoreLib.dasm - System.Numerics.Quaternion:op_Subtraction(System.Numerics.Quaternion,System.Numerics.Quaternion):System.Numerics.Quaternion (FullOpts)
         -64 (-77.11 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector4:op_Addition(System.Numerics.Vector4,System.Numerics.Vector4):System.Numerics.Vector4 (FullOpts)
         -64 (-77.11 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector4:op_Division(System.Numerics.Vector4,System.Numerics.Vector4):System.Numerics.Vector4 (FullOpts)
         -64 (-68.82 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector4:op_Equality(System.Numerics.Vector4,System.Numerics.Vector4):ubyte (FullOpts)
         -64 (-77.11 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector4:op_Multiply(System.Numerics.Vector4,System.Numerics.Vector4):System.Numerics.Vector4 (FullOpts)
         -64 (-77.11 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector4:op_Subtraction(System.Numerics.Vector4,System.Numerics.Vector4):System.Numerics.Vector4 (FullOpts)
         -61 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Numerics.Quaternion:Divide(System.Numerics.Quaternion,float):System.Numerics.Quaternion (FullOpts) (1 base, 0 diff methods)
         -61 (-25.42 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128`1[ubyte]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector128<T>,T>.Divide(System.Runtime.Intrinsics.Vector128`1[ubyte],ubyte):System.Runtime.Intrinsics.Vector128`1[ubyte] (FullOpts)
         -61 (-4.82 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512`1[ubyte]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector512<T>,T>.ShiftLeft(System.Runtime.Intrinsics.Vector512`1[ubyte],int):System.Runtime.Intrinsics.Vector512`1[ubyte] (FullOpts)
         -61 (-4.82 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512`1[ubyte]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector512<T>,T>.ShiftRightArithmetic(System.Runtime.Intrinsics.Vector512`1[ubyte],int):System.Runtime.Intrinsics.Vector512`1[ubyte] (FullOpts)
         -56 (-76.71 % of base) : System.Private.CoreLib.dasm - System.Numerics.Plane:Dot(System.Numerics.Plane,System.Numerics.Vector4):float (FullOpts)
         -56 (-76.71 % of base) : System.Private.CoreLib.dasm - System.Numerics.Quaternion:Dot(System.Numerics.Quaternion,System.Numerics.Quaternion):float (FullOpts)
         -56 (-76.71 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector4:Dot(System.Numerics.Vector4,System.Numerics.Vector4):float (FullOpts)
         -55 (-71.43 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector4:Abs(System.Numerics.Vector4):System.Numerics.Vector4 (FullOpts)
         -55 (-3.25 % of base) : System.Private.CoreLib.dasm - System.SpanHelpers:IndexOf(byref,int,byref,int):int (FullOpts) (2 methods)
         -54 (-65.06 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128`1[long]:op_Inequality(System.Runtime.Intrinsics.Vector128`1[long],System.Runtime.Intrinsics.Vector128`1[long]):ubyte (FullOpts)
         -51 (-62.96 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256`1[long]:op_Inequality(System.Runtime.Intrinsics.Vector256`1[long],System.Runtime.Intrinsics.Vector256`1[long]):ubyte (FullOpts)
         -50 (-63.29 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector`1[int]:op_Inequality(System.Numerics.Vector`1[int],System.Numerics.Vector`1[int]):ubyte (FullOpts)
         -50 (-63.29 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256`1[int]:op_Inequality(System.Runtime.Intrinsics.Vector256`1[int],System.Runtime.Intrinsics.Vector256`1[int]):ubyte (FullOpts)
         -50 (-63.29 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256`1[short]:op_Inequality(System.Runtime.Intrinsics.Vector256`1[short],System.Runtime.Intrinsics.Vector256`1[short]):ubyte (FullOpts)
         -50 (-63.29 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256`1[ubyte]:op_Inequality(System.Runtime.Intrinsics.Vector256`1[ubyte],System.Runtime.Intrinsics.Vector256`1[ubyte]):ubyte (FullOpts)
         -49 (-69.01 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector`1[double]:op_Multiply(System.Numerics.Vector`1[double],double):System.Numerics.Vector`1[double] (FullOpts)
         -47 (-77.05 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector4:SquareRoot(System.Numerics.Vector4):System.Numerics.Vector4 (FullOpts)
         -47 (-61.04 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256`1[double]:op_Inequality(System.Runtime.Intrinsics.Vector256`1[double],System.Runtime.Intrinsics.Vector256`1[double]):ubyte (FullOpts)
         -42 (-68.85 % of base) : System.Private.CoreLib.dasm - System.Numerics.Quaternion:op_Multiply(System.Numerics.Quaternion,float):System.Numerics.Quaternion (FullOpts)
         -41 (-57.75 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector`1[double]:op_Inequality(System.Numerics.Vector`1[double],System.Numerics.Vector`1[double]):ubyte (FullOpts)
         -39 (-26.35 % of base) : System.Private.CoreLib.dasm - System.Numerics.Matrix4x4:Lerp(System.Numerics.Matrix4x4,System.Numerics.Matrix4x4,float):System.Numerics.Matrix4x4 (FullOpts)
         -39 (-27.27 % of base) : System.Private.CoreLib.dasm - System.Numerics.Matrix4x4+Impl:Lerp(byref,byref,float):System.Numerics.Matrix4x4+Impl (FullOpts)
         -39 (-20.63 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512`1[System.__Canon]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector512<T>,T>.Dot(System.Runtime.Intrinsics.Vector512`1[System.__Canon],System.Runtime.Intrinsics.Vector512`1[System.__Canon]):System.__Canon (FullOpts)
         -38 (-55.88 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector`1[short]:op_Multiply(System.Numerics.Vector`1[short],short):System.Numerics.Vector`1[short] (FullOpts)
         -38 (-2.25 % of base) : System.Private.CoreLib.dasm - System.SpanHelpers:LastIndexOf(byref,int,byref,int):int (FullOpts) (2 methods)
         -36 (-30.77 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128`1[System.__Canon]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector128<T>,T>.Dot(System.Runtime.Intrinsics.Vector128`1[System.__Canon],System.Runtime.Intrinsics.Vector128`1[System.__Canon]):System.__Canon (FullOpts)
         -36 (-32.43 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256:Dot[System.__Canon](System.Runtime.Intrinsics.Vector256`1[System.__Canon],System.Runtime.Intrinsics.Vector256`1[System.__Canon]):System.__Canon (FullOpts)
         -36 (-23.53 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256`1[System.__Canon]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector256<T>,T>.Dot(System.Runtime.Intrinsics.Vector256`1[System.__Canon],System.Runtime.Intrinsics.Vector256`1[System.__Canon]):System.__Canon (FullOpts)
         -36 (-24.49 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512:Dot[System.__Canon](System.Runtime.Intrinsics.Vector512`1[System.__Canon],System.Runtime.Intrinsics.Vector512`1[System.__Canon]):System.__Canon (FullOpts)
         -35 (-54.69 % of base) : System.Private.CoreLib.dasm - System.Numerics.Plane:op_Equality(System.Numerics.Plane,System.Numerics.Plane):ubyte (FullOpts)
         -30 (-45.45 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector:Dot[double](System.Numerics.Vector`1[double],System.Numerics.Vector`1[double]):double (FullOpts)
         -30 (-40.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128:Dot[System.__Canon](System.Runtime.Intrinsics.Vector128`1[System.__Canon],System.Runtime.Intrinsics.Vector128`1[System.__Canon]):System.__Canon (FullOpts)
         -30 (-37.04 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector64`1[System.__Canon]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector64<T>,T>.Dot(System.Runtime.Intrinsics.Vector64`1[System.__Canon],System.Runtime.Intrinsics.Vector64`1[System.__Canon]):System.__Canon (FullOpts)
         -29 (-29.90 % of base) : System.Private.CoreLib.dasm - System.Numerics.Plane:Normalize(System.Numerics.Plane):System.Numerics.Plane (FullOpts)
         -27 (-48.21 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector`1[short]:op_Inequality(System.Numerics.Vector`1[short],System.Numerics.Vector`1[short]):ubyte (FullOpts)
         -26 (-57.78 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector:Dot[System.__Canon](System.Numerics.Vector`1[System.__Canon],System.Numerics.Vector`1[System.__Canon]):System.__Canon (FullOpts)
         -26 (-57.78 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector64:Dot[System.__Canon](System.Runtime.Intrinsics.Vector64`1[System.__Canon],System.Runtime.Intrinsics.Vector64`1[System.__Canon]):System.__Canon (FullOpts)
         -25 (-45.45 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector`1[long]:op_Inequality(System.Numerics.Vector`1[long],System.Numerics.Vector`1[long]):ubyte (FullOpts)
         -25 (-46.30 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector`1[ubyte]:op_Inequality(System.Numerics.Vector`1[ubyte],System.Numerics.Vector`1[ubyte]):ubyte (FullOpts)
         -24 (-50.00 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector:GetElementUnsafe(byref,int):float (FullOpts) (4 base, 2 diff methods)
         -24 (-50.00 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector:SetElementUnsafe(byref,int,float) (FullOpts) (4 base, 2 diff methods)
         -24 (-52.17 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector4:.ctor(System.Numerics.Vector3,float):this (FullOpts)
         -23 (-10.00 % of base) : System.Private.CoreLib.dasm - System.Numerics.Matrix4x4:CreateReflection(System.Numerics.Plane):System.Numerics.Matrix4x4 (FullOpts)
         -23 (-10.04 % of base) : System.Private.CoreLib.dasm - System.Numerics.Matrix4x4+Impl:CreateReflection(byref):System.Numerics.Matrix4x4+Impl (FullOpts)
         -20 (-20.00 % of base) : System.Private.CoreLib.dasm - System.Numerics.Matrix4x4:set_Item(int,int,float):this (FullOpts)
         -20 (-20.00 % of base) : System.Private.CoreLib.dasm - System.Numerics.Matrix4x4+Impl:set_Item(int,int,float):this (FullOpts)
         -20 (-36.36 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector:GetElement(System.Numerics.Quaternion,int):float (FullOpts)
         -20 (-36.36 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector:GetElement(System.Numerics.Vector4,int):float (FullOpts)
         -20 (-47.62 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector:Multiply[System.__Canon](System.__Canon,System.Numerics.Vector`1[System.__Canon]):System.Numerics.Vector`1[System.__Canon] (FullOpts)
         -20 (-47.62 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector:Multiply[System.__Canon](System.Numerics.Vector`1[System.__Canon],System.__Canon):System.Numerics.Vector`1[System.__Canon] (FullOpts)
         -20 (-47.62 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector:Multiply[System.Nullable`1[int]](System.Nullable`1[int],System.Numerics.Vector`1[System.Nullable`1[int]]):System.Numerics.Vector`1[System.Nullable`1[int]] (FullOpts)
         -20 (-47.62 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector:Multiply[System.Nullable`1[int]](System.Numerics.Vector`1[System.Nullable`1[int]],System.Nullable`1[int]):System.Numerics.Vector`1[System.Nullable`1[int]] (FullOpts)
         -20 (-47.62 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector:Multiply[System.Numerics.Vector`1[float]](System.Numerics.Vector`1[float],System.Numerics.Vector`1[System.Numerics.Vector`1[float]]):System.Numerics.Vector`1[System.Numerics.Vector`1[float]] (FullOpts)
         -20 (-47.62 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector:Multiply[System.Numerics.Vector`1[float]](System.Numerics.Vector`1[System.Numerics.Vector`1[float]],System.Numerics.Vector`1[float]):System.Numerics.Vector`1[System.Numerics.Vector`1[float]] (FullOpts)
         -20 (-47.62 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector`1[System.__Canon]:op_Multiply(System.__Canon,System.Numerics.Vector`1[System.__Canon]):System.Numerics.Vector`1[System.__Canon] (FullOpts)
         -20 (-47.62 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector`1[System.Nullable`1[int]]:op_Multiply(System.Nullable`1[int],System.Numerics.Vector`1[System.Nullable`1[int]]):System.Numerics.Vector`1[System.Nullable`1[int]] (FullOpts)
         -20 (-47.62 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector`1[System.Numerics.Vector`1[float]]:op_Multiply(System.Numerics.Vector`1[float],System.Numerics.Vector`1[System.Numerics.Vector`1[float]]):System.Numerics.Vector`1[System.Numerics.Vector`1[float]] (FullOpts)
         -18 (-64.29 % of base) : System.Private.CoreLib.dasm - System.Numerics.Plane:.ctor(System.Numerics.Vector4):this (FullOpts)
         -18 (-25.71 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector:WithElement(System.Numerics.Quaternion,int,float):System.Numerics.Quaternion (FullOpts)
         -18 (-25.71 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector:WithElement(System.Numerics.Vector4,int,float):System.Numerics.Vector4 (FullOpts)
         -17 (-62.96 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector4:.ctor(float):this (FullOpts)
         -15 (-5.75 % of base) : System.Private.CoreLib.dasm - System.Numerics.Matrix4x4:CreateShadow(System.Numerics.Vector3,System.Numerics.Plane):System.Numerics.Matrix4x4 (FullOpts)
         -15 (-5.77 % of base) : System.Private.CoreLib.dasm - System.Numerics.Matrix4x4+Impl:CreateShadow(byref,byref):System.Numerics.Matrix4x4+Impl (FullOpts)
         -14 (-45.16 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector4:.ctor(System.Numerics.Vector2,float,float):this (FullOpts)
         -13 (-3.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512`1[long]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector512<T>,T>.ShiftRightArithmetic(System.Runtime.Intrinsics.Vector512`1[long],int):System.Runtime.Intrinsics.Vector512`1[long] (FullOpts)
         -12 (-19.35 % of base) : System.Private.CoreLib.dasm - System.Numerics.Quaternion:set_Item(int,float):this (FullOpts)
         -12 (-19.35 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector4:set_Item(int,float):this (FullOpts)
          -6 (-20.69 % of base) : System.Private.CoreLib.dasm - System.Numerics.Plane:.ctor(float,float,float,float):this (FullOpts)
          -6 (-21.43 % of base) : System.Private.CoreLib.dasm - System.Numerics.Plane:.ctor(System.Numerics.Vector3,float):this (FullOpts)
          -6 (-21.43 % of base) : System.Private.CoreLib.dasm - System.Numerics.Quaternion:.ctor(System.Numerics.Vector3,float):this (FullOpts)

Top method regressions (percentages):
          10 (Infinity of base) : System.Private.CoreLib.dasm - System.Numerics.Vector:AsPlane(System.Numerics.Vector4):System.Numerics.Plane (FullOpts) (0 base, 1 diff methods)
          10 (Infinity of base) : System.Private.CoreLib.dasm - System.Numerics.Vector:AsQuaternion(System.Numerics.Vector4):System.Numerics.Quaternion (FullOpts) (0 base, 1 diff methods)
         112 (Infinity of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128:<ConvertToSingle>g__SoftwareFallback|45_0(System.Runtime.Intrinsics.Vector128`1[uint]):System.Runtime.Intrinsics.Vector128`1[float] (FullOpts) (0 base, 1 diff methods)
          11 (Infinity of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128:AsPlane(System.Runtime.Intrinsics.Vector128`1[float]):System.Numerics.Plane (FullOpts) (0 base, 1 diff methods)
          11 (Infinity of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128:AsQuaternion(System.Runtime.Intrinsics.Vector128`1[float]):System.Numerics.Quaternion (FullOpts) (0 base, 1 diff methods)
          48 (300.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector64:Dot[double](System.Runtime.Intrinsics.Vector64`1[double],System.Runtime.Intrinsics.Vector64`1[double]):double (FullOpts)
          42 (280.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector64:Dot[long](System.Runtime.Intrinsics.Vector64`1[long],System.Runtime.Intrinsics.Vector64`1[long]):long (FullOpts)
          21 (233.33 % of base) : System.Drawing.Primitives.dasm - System.Drawing.RectangleF:ToVector4():System.Numerics.Vector4:this (FullOpts)
          43 (226.32 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector`1[System.__Canon]:op_Inequality(System.Numerics.Vector`1[System.__Canon],System.Numerics.Vector`1[System.__Canon]):ubyte (FullOpts)
          43 (226.32 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector`1[System.Nullable`1[int]]:op_Inequality(System.Numerics.Vector`1[System.Nullable`1[int]],System.Numerics.Vector`1[System.Nullable`1[int]]):ubyte (FullOpts)
          43 (226.32 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector`1[System.Numerics.Vector`1[float]]:op_Inequality(System.Numerics.Vector`1[System.Numerics.Vector`1[float]],System.Numerics.Vector`1[System.Numerics.Vector`1[float]]):ubyte (FullOpts)
          44 (200.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector64:Dot[int](System.Runtime.Intrinsics.Vector64`1[int],System.Runtime.Intrinsics.Vector64`1[int]):int (FullOpts)
          86 (191.11 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector:Dot[ubyte](System.Numerics.Vector`1[ubyte],System.Numerics.Vector`1[ubyte]):ubyte (FullOpts)
          34 (170.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector64`1[long]:op_Inequality(System.Runtime.Intrinsics.Vector64`1[long],System.Runtime.Intrinsics.Vector64`1[long]):ubyte (FullOpts)
          32 (168.42 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector`1[System.__Canon]:op_Explicit(System.Numerics.Vector`1[System.__Canon]):System.Numerics.Vector`1[byte] (FullOpts)
          32 (168.42 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector`1[System.__Canon]:op_Explicit(System.Numerics.Vector`1[System.__Canon]):System.Numerics.Vector`1[float] (FullOpts)
          32 (168.42 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector`1[System.__Canon]:op_Explicit(System.Numerics.Vector`1[System.__Canon]):System.Numerics.Vector`1[uint] (FullOpts)
          32 (168.42 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector`1[System.__Canon]:op_Explicit(System.Numerics.Vector`1[System.__Canon]):System.Numerics.Vector`1[ushort] (FullOpts)
          81 (158.82 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector:Dot[long](System.Numerics.Vector`1[long],System.Numerics.Vector`1[long]):long (FullOpts)
          29 (152.63 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector`1[System.__Canon]:op_Explicit(System.Numerics.Vector`1[System.__Canon]):System.Numerics.Vector`1[double] (FullOpts)
          29 (152.63 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector`1[System.__Canon]:op_Explicit(System.Numerics.Vector`1[System.__Canon]):System.Numerics.Vector`1[int] (FullOpts)
          58 (152.63 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector`1[System.__Canon]:op_Explicit(System.Numerics.Vector`1[System.__Canon]):System.Numerics.Vector`1[long] (FullOpts) (2 methods)
          29 (152.63 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector`1[System.__Canon]:op_Explicit(System.Numerics.Vector`1[System.__Canon]):System.Numerics.Vector`1[short] (FullOpts)
          29 (152.63 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector`1[System.__Canon]:op_Explicit(System.Numerics.Vector`1[System.__Canon]):System.Numerics.Vector`1[ubyte] (FullOpts)
          58 (152.63 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector`1[System.__Canon]:op_Explicit(System.Numerics.Vector`1[System.__Canon]):System.Numerics.Vector`1[ulong] (FullOpts) (2 methods)
          26 (136.84 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector:AsVectorByte[System.__Canon](System.Numerics.Vector`1[System.__Canon]):System.Numerics.Vector`1[ubyte] (FullOpts)
          26 (136.84 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector:AsVectorDouble[System.__Canon](System.Numerics.Vector`1[System.__Canon]):System.Numerics.Vector`1[double] (FullOpts)
          26 (136.84 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector:AsVectorInt16[System.__Canon](System.Numerics.Vector`1[System.__Canon]):System.Numerics.Vector`1[short] (FullOpts)
          26 (136.84 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector:AsVectorInt32[System.__Canon](System.Numerics.Vector`1[System.__Canon]):System.Numerics.Vector`1[int] (FullOpts)
          26 (136.84 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector:AsVectorInt64[System.__Canon](System.Numerics.Vector`1[System.__Canon]):System.Numerics.Vector`1[long] (FullOpts)
          26 (136.84 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector:AsVectorNInt[System.__Canon](System.Numerics.Vector`1[System.__Canon]):System.Numerics.Vector`1[long] (FullOpts)
          26 (136.84 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector:AsVectorNUInt[System.__Canon](System.Numerics.Vector`1[System.__Canon]):System.Numerics.Vector`1[ulong] (FullOpts)
          26 (136.84 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector:AsVectorSByte[System.__Canon](System.Numerics.Vector`1[System.__Canon]):System.Numerics.Vector`1[byte] (FullOpts)
          26 (136.84 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector:AsVectorSingle[System.__Canon](System.Numerics.Vector`1[System.__Canon]):System.Numerics.Vector`1[float] (FullOpts)
          26 (136.84 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector:AsVectorUInt16[System.__Canon](System.Numerics.Vector`1[System.__Canon]):System.Numerics.Vector`1[ushort] (FullOpts)
          26 (136.84 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector:AsVectorUInt32[System.__Canon](System.Numerics.Vector`1[System.__Canon]):System.Numerics.Vector`1[uint] (FullOpts)
          26 (136.84 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector:AsVectorUInt64[System.__Canon](System.Numerics.Vector`1[System.__Canon]):System.Numerics.Vector`1[ulong] (FullOpts)
          26 (136.84 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128:AsByte[System.__Canon](System.Runtime.Intrinsics.Vector128`1[System.__Canon]):System.Runtime.Intrinsics.Vector128`1[ubyte] (FullOpts)
          26 (136.84 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128:AsDouble[System.__Canon](System.Runtime.Intrinsics.Vector128`1[System.__Canon]):System.Runtime.Intrinsics.Vector128`1[double] (FullOpts)
          26 (136.84 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128:AsInt16[System.__Canon](System.Runtime.Intrinsics.Vector128`1[System.__Canon]):System.Runtime.Intrinsics.Vector128`1[short] (FullOpts)
          26 (136.84 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128:AsInt32[System.__Canon](System.Runtime.Intrinsics.Vector128`1[System.__Canon]):System.Runtime.Intrinsics.Vector128`1[int] (FullOpts)
          26 (136.84 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128:AsInt64[System.__Canon](System.Runtime.Intrinsics.Vector128`1[System.__Canon]):System.Runtime.Intrinsics.Vector128`1[long] (FullOpts)
          26 (136.84 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128:AsNInt[System.__Canon](System.Runtime.Intrinsics.Vector128`1[System.__Canon]):System.Runtime.Intrinsics.Vector128`1[long] (FullOpts)
          26 (136.84 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128:AsNUInt[System.__Canon](System.Runtime.Intrinsics.Vector128`1[System.__Canon]):System.Runtime.Intrinsics.Vector128`1[ulong] (FullOpts)
          26 (136.84 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128:AsSByte[System.__Canon](System.Runtime.Intrinsics.Vector128`1[System.__Canon]):System.Runtime.Intrinsics.Vector128`1[byte] (FullOpts)
          26 (136.84 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128:AsSingle[System.__Canon](System.Runtime.Intrinsics.Vector128`1[System.__Canon]):System.Runtime.Intrinsics.Vector128`1[float] (FullOpts)
          26 (136.84 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128:AsUInt16[System.__Canon](System.Runtime.Intrinsics.Vector128`1[System.__Canon]):System.Runtime.Intrinsics.Vector128`1[ushort] (FullOpts)
          26 (136.84 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128:AsUInt32[System.__Canon](System.Runtime.Intrinsics.Vector128`1[System.__Canon]):System.Runtime.Intrinsics.Vector128`1[uint] (FullOpts)
          26 (136.84 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128:AsUInt64[System.__Canon](System.Runtime.Intrinsics.Vector128`1[System.__Canon]):System.Runtime.Intrinsics.Vector128`1[ulong] (FullOpts)
          26 (136.84 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256:AsByte[System.__Canon](System.Runtime.Intrinsics.Vector256`1[System.__Canon]):System.Runtime.Intrinsics.Vector256`1[ubyte] (FullOpts)
          26 (136.84 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256:AsDouble[System.__Canon](System.Runtime.Intrinsics.Vector256`1[System.__Canon]):System.Runtime.Intrinsics.Vector256`1[double] (FullOpts)
          26 (136.84 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256:AsInt16[System.__Canon](System.Runtime.Intrinsics.Vector256`1[System.__Canon]):System.Runtime.Intrinsics.Vector256`1[short] (FullOpts)
          26 (136.84 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256:AsInt32[System.__Canon](System.Runtime.Intrinsics.Vector256`1[System.__Canon]):System.Runtime.Intrinsics.Vector256`1[int] (FullOpts)
          26 (136.84 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256:AsInt64[System.__Canon](System.Runtime.Intrinsics.Vector256`1[System.__Canon]):System.Runtime.Intrinsics.Vector256`1[long] (FullOpts)
          26 (136.84 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256:AsNInt[System.__Canon](System.Runtime.Intrinsics.Vector256`1[System.__Canon]):System.Runtime.Intrinsics.Vector256`1[long] (FullOpts)
          26 (136.84 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256:AsNUInt[System.__Canon](System.Runtime.Intrinsics.Vector256`1[System.__Canon]):System.Runtime.Intrinsics.Vector256`1[ulong] (FullOpts)
          26 (136.84 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256:AsSByte[System.__Canon](System.Runtime.Intrinsics.Vector256`1[System.__Canon]):System.Runtime.Intrinsics.Vector256`1[byte] (FullOpts)
          26 (136.84 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256:AsSingle[System.__Canon](System.Runtime.Intrinsics.Vector256`1[System.__Canon]):System.Runtime.Intrinsics.Vector256`1[float] (FullOpts)
          26 (136.84 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256:AsUInt16[System.__Canon](System.Runtime.Intrinsics.Vector256`1[System.__Canon]):System.Runtime.Intrinsics.Vector256`1[ushort] (FullOpts)
          26 (136.84 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256:AsUInt32[System.__Canon](System.Runtime.Intrinsics.Vector256`1[System.__Canon]):System.Runtime.Intrinsics.Vector256`1[uint] (FullOpts)
          26 (136.84 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256:AsUInt64[System.__Canon](System.Runtime.Intrinsics.Vector256`1[System.__Canon]):System.Runtime.Intrinsics.Vector256`1[ulong] (FullOpts)
          26 (136.84 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512:AsByte[System.__Canon](System.Runtime.Intrinsics.Vector512`1[System.__Canon]):System.Runtime.Intrinsics.Vector512`1[ubyte] (FullOpts)
          26 (136.84 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512:AsDouble[System.__Canon](System.Runtime.Intrinsics.Vector512`1[System.__Canon]):System.Runtime.Intrinsics.Vector512`1[double] (FullOpts)
          26 (136.84 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512:AsInt16[System.__Canon](System.Runtime.Intrinsics.Vector512`1[System.__Canon]):System.Runtime.Intrinsics.Vector512`1[short] (FullOpts)
          26 (136.84 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512:AsInt32[System.__Canon](System.Runtime.Intrinsics.Vector512`1[System.__Canon]):System.Runtime.Intrinsics.Vector512`1[int] (FullOpts)
          26 (136.84 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512:AsInt64[System.__Canon](System.Runtime.Intrinsics.Vector512`1[System.__Canon]):System.Runtime.Intrinsics.Vector512`1[long] (FullOpts)
          26 (136.84 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512:AsNInt[System.__Canon](System.Runtime.Intrinsics.Vector512`1[System.__Canon]):System.Runtime.Intrinsics.Vector512`1[long] (FullOpts)
          26 (136.84 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512:AsNUInt[System.__Canon](System.Runtime.Intrinsics.Vector512`1[System.__Canon]):System.Runtime.Intrinsics.Vector512`1[ulong] (FullOpts)
          26 (136.84 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512:AsSByte[System.__Canon](System.Runtime.Intrinsics.Vector512`1[System.__Canon]):System.Runtime.Intrinsics.Vector512`1[byte] (FullOpts)
          26 (136.84 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512:AsSingle[System.__Canon](System.Runtime.Intrinsics.Vector512`1[System.__Canon]):System.Runtime.Intrinsics.Vector512`1[float] (FullOpts)
          26 (136.84 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512:AsUInt16[System.__Canon](System.Runtime.Intrinsics.Vector512`1[System.__Canon]):System.Runtime.Intrinsics.Vector512`1[ushort] (FullOpts)
          26 (136.84 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512:AsUInt32[System.__Canon](System.Runtime.Intrinsics.Vector512`1[System.__Canon]):System.Runtime.Intrinsics.Vector512`1[uint] (FullOpts)
          26 (136.84 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512:AsUInt64[System.__Canon](System.Runtime.Intrinsics.Vector512`1[System.__Canon]):System.Runtime.Intrinsics.Vector512`1[ulong] (FullOpts)
          26 (136.84 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector64:AndNot[System.__Canon](System.Runtime.Intrinsics.Vector64`1[System.__Canon],System.Runtime.Intrinsics.Vector64`1[System.__Canon]):System.Runtime.Intrinsics.Vector64`1[System.__Canon] (FullOpts)
          26 (136.84 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector64:AsByte[System.__Canon](System.Runtime.Intrinsics.Vector64`1[System.__Canon]):System.Runtime.Intrinsics.Vector64`1[ubyte] (FullOpts)
          26 (136.84 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector64:AsDouble[System.__Canon](System.Runtime.Intrinsics.Vector64`1[System.__Canon]):System.Runtime.Intrinsics.Vector64`1[double] (FullOpts)
          26 (136.84 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector64:AsInt16[System.__Canon](System.Runtime.Intrinsics.Vector64`1[System.__Canon]):System.Runtime.Intrinsics.Vector64`1[short] (FullOpts)
          26 (136.84 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector64:AsInt32[System.__Canon](System.Runtime.Intrinsics.Vector64`1[System.__Canon]):System.Runtime.Intrinsics.Vector64`1[int] (FullOpts)
          26 (136.84 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector64:AsInt64[System.__Canon](System.Runtime.Intrinsics.Vector64`1[System.__Canon]):System.Runtime.Intrinsics.Vector64`1[long] (FullOpts)
          26 (136.84 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector64:AsNInt[System.__Canon](System.Runtime.Intrinsics.Vector64`1[System.__Canon]):System.Runtime.Intrinsics.Vector64`1[long] (FullOpts)
          26 (136.84 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector64:AsNUInt[System.__Canon](System.Runtime.Intrinsics.Vector64`1[System.__Canon]):System.Runtime.Intrinsics.Vector64`1[ulong] (FullOpts)
          26 (136.84 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector64:AsSByte[System.__Canon](System.Runtime.Intrinsics.Vector64`1[System.__Canon]):System.Runtime.Intrinsics.Vector64`1[byte] (FullOpts)
          26 (136.84 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector64:AsSingle[System.__Canon](System.Runtime.Intrinsics.Vector64`1[System.__Canon]):System.Runtime.Intrinsics.Vector64`1[float] (FullOpts)
          26 (136.84 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector64:AsUInt16[System.__Canon](System.Runtime.Intrinsics.Vector64`1[System.__Canon]):System.Runtime.Intrinsics.Vector64`1[ushort] (FullOpts)
          26 (136.84 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector64:AsUInt32[System.__Canon](System.Runtime.Intrinsics.Vector64`1[System.__Canon]):System.Runtime.Intrinsics.Vector64`1[uint] (FullOpts)
          26 (136.84 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector64:AsUInt64[System.__Canon](System.Runtime.Intrinsics.Vector64`1[System.__Canon]):System.Runtime.Intrinsics.Vector64`1[ulong] (FullOpts)
          26 (136.84 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector64:ConditionalSelect[System.__Canon](System.Runtime.Intrinsics.Vector64`1[System.__Canon],System.Runtime.Intrinsics.Vector64`1[System.__Canon],System.Runtime.Intrinsics.Vector64`1[System.__Canon]):System.Runtime.Intrinsics.Vector64`1[System.__Canon] (FullOpts)
         267 (126.54 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256`1[long]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector256<T>,T>.Dot(System.Runtime.Intrinsics.Vector256`1[long],System.Runtime.Intrinsics.Vector256`1[long]):long (FullOpts)
          34 (125.93 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector64`1[double]:op_Inequality(System.Runtime.Intrinsics.Vector64`1[double],System.Runtime.Intrinsics.Vector64`1[double]):ubyte (FullOpts)
          44 (122.22 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512:OnesComplement[double](System.Runtime.Intrinsics.Vector512`1[double]):System.Runtime.Intrinsics.Vector512`1[double] (FullOpts)
          44 (122.22 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512:OnesComplement[int](System.Runtime.Intrinsics.Vector512`1[int]):System.Runtime.Intrinsics.Vector512`1[int] (FullOpts)
          44 (122.22 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512:OnesComplement[long](System.Runtime.Intrinsics.Vector512`1[long]):System.Runtime.Intrinsics.Vector512`1[long] (FullOpts)
          44 (122.22 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512:OnesComplement[short](System.Runtime.Intrinsics.Vector512`1[short]):System.Runtime.Intrinsics.Vector512`1[short] (FullOpts)
          44 (122.22 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512:OnesComplement[ubyte](System.Runtime.Intrinsics.Vector512`1[ubyte]):System.Runtime.Intrinsics.Vector512`1[ubyte] (FullOpts)
          44 (122.22 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512`1[double]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector512<T>,T>.OnesComplement(System.Runtime.Intrinsics.Vector512`1[double]):System.Runtime.Intrinsics.Vector512`1[double] (FullOpts)
          44 (122.22 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512`1[int]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector512<T>,T>.OnesComplement(System.Runtime.Intrinsics.Vector512`1[int]):System.Runtime.Intrinsics.Vector512`1[int] (FullOpts)
          44 (122.22 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512`1[long]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector512<T>,T>.OnesComplement(System.Runtime.Intrinsics.Vector512`1[long]):System.Runtime.Intrinsics.Vector512`1[long] (FullOpts)
          44 (122.22 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512`1[short]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector512<T>,T>.OnesComplement(System.Runtime.Intrinsics.Vector512`1[short]):System.Runtime.Intrinsics.Vector512`1[short] (FullOpts)
          44 (122.22 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512`1[ubyte]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector512<T>,T>.OnesComplement(System.Runtime.Intrinsics.Vector512`1[ubyte]):System.Runtime.Intrinsics.Vector512`1[ubyte] (FullOpts)
          87 (117.57 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128`1[long]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector128<T>,T>.Dot(System.Runtime.Intrinsics.Vector128`1[long],System.Runtime.Intrinsics.Vector128`1[long]):long (FullOpts)

Top method improvements (percentages):
        -138 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Numerics.Plane:<Equals>g__SoftwareFallback|16_0(byref,System.Numerics.Plane):ubyte (FullOpts) (1 base, 0 diff methods)
        -187 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Numerics.Quaternion:<Equals>g__SoftwareFallback|44_0(byref,System.Numerics.Quaternion):ubyte (FullOpts) (1 base, 0 diff methods)
         -61 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Numerics.Quaternion:Divide(System.Numerics.Quaternion,float):System.Numerics.Quaternion (FullOpts) (1 base, 0 diff methods)
         -83 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Numerics.Quaternion:Multiply(System.Numerics.Quaternion,System.Numerics.Vector4):System.Numerics.Quaternion (FullOpts) (1 base, 0 diff methods)
        -187 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector4:<Equals>g__SoftwareFallback|68_0(byref,System.Numerics.Vector4):ubyte (FullOpts) (1 base, 0 diff methods)
        -112 (-100.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128:<ConvertToSingle>g__SoftwareFallback|43_0(System.Runtime.Intrinsics.Vector128`1[uint]):System.Runtime.Intrinsics.Vector128`1[float] (FullOpts) (1 base, 0 diff methods)
        -204 (-87.93 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128`1[ubyte]:op_Inequality(System.Runtime.Intrinsics.Vector128`1[ubyte],System.Runtime.Intrinsics.Vector128`1[ubyte]):ubyte (FullOpts)
        -105 (-84.68 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector4:Max(System.Numerics.Vector4,System.Numerics.Vector4):System.Numerics.Vector4 (FullOpts)
        -105 (-84.68 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector4:Min(System.Numerics.Vector4,System.Numerics.Vector4):System.Numerics.Vector4 (FullOpts)
        -128 (-82.05 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128`1[short]:op_Inequality(System.Runtime.Intrinsics.Vector128`1[short],System.Runtime.Intrinsics.Vector128`1[short]):ubyte (FullOpts)
         -64 (-77.11 % of base) : System.Private.CoreLib.dasm - System.Numerics.Quaternion:op_Addition(System.Numerics.Quaternion,System.Numerics.Quaternion):System.Numerics.Quaternion (FullOpts)
         -64 (-77.11 % of base) : System.Private.CoreLib.dasm - System.Numerics.Quaternion:op_Subtraction(System.Numerics.Quaternion,System.Numerics.Quaternion):System.Numerics.Quaternion (FullOpts)
         -64 (-77.11 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector4:op_Addition(System.Numerics.Vector4,System.Numerics.Vector4):System.Numerics.Vector4 (FullOpts)
         -64 (-77.11 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector4:op_Division(System.Numerics.Vector4,System.Numerics.Vector4):System.Numerics.Vector4 (FullOpts)
         -64 (-77.11 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector4:op_Multiply(System.Numerics.Vector4,System.Numerics.Vector4):System.Numerics.Vector4 (FullOpts)
         -64 (-77.11 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector4:op_Subtraction(System.Numerics.Vector4,System.Numerics.Vector4):System.Numerics.Vector4 (FullOpts)
         -84 (-77.06 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector4:FusedMultiplyAdd(System.Numerics.Vector4,System.Numerics.Vector4,System.Numerics.Vector4):System.Numerics.Vector4 (FullOpts)
         -84 (-77.06 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector4:MultiplyAddEstimate(System.Numerics.Vector4,System.Numerics.Vector4,System.Numerics.Vector4):System.Numerics.Vector4 (FullOpts)
         -47 (-77.05 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector4:SquareRoot(System.Numerics.Vector4):System.Numerics.Vector4 (FullOpts)
         -56 (-76.71 % of base) : System.Private.CoreLib.dasm - System.Numerics.Plane:Dot(System.Numerics.Plane,System.Numerics.Vector4):float (FullOpts)
         -56 (-76.71 % of base) : System.Private.CoreLib.dasm - System.Numerics.Quaternion:Dot(System.Numerics.Quaternion,System.Numerics.Quaternion):float (FullOpts)
         -56 (-76.71 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector4:Dot(System.Numerics.Vector4,System.Numerics.Vector4):float (FullOpts)
         -74 (-73.27 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector`1[int]:op_Multiply(System.Numerics.Vector`1[int],int):System.Numerics.Vector`1[int] (FullOpts)
         -72 (-72.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128`1[int]:op_Inequality(System.Runtime.Intrinsics.Vector128`1[int],System.Runtime.Intrinsics.Vector128`1[int]):ubyte (FullOpts)
         -55 (-71.43 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector4:Abs(System.Numerics.Vector4):System.Numerics.Vector4 (FullOpts)
         -67 (-71.28 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128`1[double]:op_Inequality(System.Runtime.Intrinsics.Vector128`1[double],System.Runtime.Intrinsics.Vector128`1[double]):ubyte (FullOpts)
         -49 (-69.01 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector`1[double]:op_Multiply(System.Numerics.Vector`1[double],double):System.Numerics.Vector`1[double] (FullOpts)
         -42 (-68.85 % of base) : System.Private.CoreLib.dasm - System.Numerics.Quaternion:op_Multiply(System.Numerics.Quaternion,float):System.Numerics.Quaternion (FullOpts)
         -64 (-68.82 % of base) : System.Private.CoreLib.dasm - System.Numerics.Quaternion:op_Equality(System.Numerics.Quaternion,System.Numerics.Quaternion):ubyte (FullOpts)
         -64 (-68.82 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector4:op_Equality(System.Numerics.Vector4,System.Numerics.Vector4):ubyte (FullOpts)
         -54 (-65.06 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128`1[long]:op_Inequality(System.Runtime.Intrinsics.Vector128`1[long],System.Runtime.Intrinsics.Vector128`1[long]):ubyte (FullOpts)
         -18 (-64.29 % of base) : System.Private.CoreLib.dasm - System.Numerics.Plane:.ctor(System.Numerics.Vector4):this (FullOpts)
         -50 (-63.29 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector`1[int]:op_Inequality(System.Numerics.Vector`1[int],System.Numerics.Vector`1[int]):ubyte (FullOpts)
         -50 (-63.29 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256`1[int]:op_Inequality(System.Runtime.Intrinsics.Vector256`1[int],System.Runtime.Intrinsics.Vector256`1[int]):ubyte (FullOpts)
         -50 (-63.29 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256`1[short]:op_Inequality(System.Runtime.Intrinsics.Vector256`1[short],System.Runtime.Intrinsics.Vector256`1[short]):ubyte (FullOpts)
         -50 (-63.29 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256`1[ubyte]:op_Inequality(System.Runtime.Intrinsics.Vector256`1[ubyte],System.Runtime.Intrinsics.Vector256`1[ubyte]):ubyte (FullOpts)
         -17 (-62.96 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector4:.ctor(float):this (FullOpts)
         -51 (-62.96 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256`1[long]:op_Inequality(System.Runtime.Intrinsics.Vector256`1[long],System.Runtime.Intrinsics.Vector256`1[long]):ubyte (FullOpts)
         -47 (-61.04 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256`1[double]:op_Inequality(System.Runtime.Intrinsics.Vector256`1[double],System.Runtime.Intrinsics.Vector256`1[double]):ubyte (FullOpts)
         -26 (-57.78 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector:Dot[System.__Canon](System.Numerics.Vector`1[System.__Canon],System.Numerics.Vector`1[System.__Canon]):System.__Canon (FullOpts)
         -26 (-57.78 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector64:Dot[System.__Canon](System.Runtime.Intrinsics.Vector64`1[System.__Canon],System.Runtime.Intrinsics.Vector64`1[System.__Canon]):System.__Canon (FullOpts)
         -41 (-57.75 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector`1[double]:op_Inequality(System.Numerics.Vector`1[double],System.Numerics.Vector`1[double]):ubyte (FullOpts)
         -38 (-55.88 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector`1[short]:op_Multiply(System.Numerics.Vector`1[short],short):System.Numerics.Vector`1[short] (FullOpts)
        -671 (-55.82 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512`1[ubyte]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector512<T>,T>.Divide(System.Runtime.Intrinsics.Vector512`1[ubyte],ubyte):System.Runtime.Intrinsics.Vector512`1[ubyte] (FullOpts)
         -35 (-54.69 % of base) : System.Private.CoreLib.dasm - System.Numerics.Plane:op_Equality(System.Numerics.Plane,System.Numerics.Plane):ubyte (FullOpts)
         -24 (-52.17 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector4:.ctor(System.Numerics.Vector3,float):this (FullOpts)
         -24 (-50.00 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector:GetElementUnsafe(byref,int):float (FullOpts) (4 base, 2 diff methods)
         -24 (-50.00 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector:SetElementUnsafe(byref,int,float) (FullOpts) (4 base, 2 diff methods)
         -27 (-48.21 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector`1[short]:op_Inequality(System.Numerics.Vector`1[short],System.Numerics.Vector`1[short]):ubyte (FullOpts)
         -20 (-47.62 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector:Multiply[System.__Canon](System.__Canon,System.Numerics.Vector`1[System.__Canon]):System.Numerics.Vector`1[System.__Canon] (FullOpts)
         -20 (-47.62 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector:Multiply[System.__Canon](System.Numerics.Vector`1[System.__Canon],System.__Canon):System.Numerics.Vector`1[System.__Canon] (FullOpts)
         -20 (-47.62 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector:Multiply[System.Nullable`1[int]](System.Nullable`1[int],System.Numerics.Vector`1[System.Nullable`1[int]]):System.Numerics.Vector`1[System.Nullable`1[int]] (FullOpts)
         -20 (-47.62 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector:Multiply[System.Nullable`1[int]](System.Numerics.Vector`1[System.Nullable`1[int]],System.Nullable`1[int]):System.Numerics.Vector`1[System.Nullable`1[int]] (FullOpts)
         -20 (-47.62 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector:Multiply[System.Numerics.Vector`1[float]](System.Numerics.Vector`1[float],System.Numerics.Vector`1[System.Numerics.Vector`1[float]]):System.Numerics.Vector`1[System.Numerics.Vector`1[float]] (FullOpts)
         -20 (-47.62 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector:Multiply[System.Numerics.Vector`1[float]](System.Numerics.Vector`1[System.Numerics.Vector`1[float]],System.Numerics.Vector`1[float]):System.Numerics.Vector`1[System.Numerics.Vector`1[float]] (FullOpts)
         -20 (-47.62 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector`1[System.__Canon]:op_Multiply(System.__Canon,System.Numerics.Vector`1[System.__Canon]):System.Numerics.Vector`1[System.__Canon] (FullOpts)
         -20 (-47.62 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector`1[System.Nullable`1[int]]:op_Multiply(System.Nullable`1[int],System.Numerics.Vector`1[System.Nullable`1[int]]):System.Numerics.Vector`1[System.Nullable`1[int]] (FullOpts)
         -20 (-47.62 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector`1[System.Numerics.Vector`1[float]]:op_Multiply(System.Numerics.Vector`1[float],System.Numerics.Vector`1[System.Numerics.Vector`1[float]]):System.Numerics.Vector`1[System.Numerics.Vector`1[float]] (FullOpts)
         -25 (-46.30 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector`1[ubyte]:op_Inequality(System.Numerics.Vector`1[ubyte],System.Numerics.Vector`1[ubyte]):ubyte (FullOpts)
         -30 (-45.45 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector:Dot[double](System.Numerics.Vector`1[double],System.Numerics.Vector`1[double]):double (FullOpts)
         -25 (-45.45 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector`1[long]:op_Inequality(System.Numerics.Vector`1[long],System.Numerics.Vector`1[long]):ubyte (FullOpts)
         -14 (-45.16 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector4:.ctor(System.Numerics.Vector2,float,float):this (FullOpts)
         -30 (-40.00 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128:Dot[System.__Canon](System.Runtime.Intrinsics.Vector128`1[System.__Canon],System.Runtime.Intrinsics.Vector128`1[System.__Canon]):System.__Canon (FullOpts)
         -30 (-37.04 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector64`1[System.__Canon]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector64<T>,T>.Dot(System.Runtime.Intrinsics.Vector64`1[System.__Canon],System.Runtime.Intrinsics.Vector64`1[System.__Canon]):System.__Canon (FullOpts)
         -20 (-36.36 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector:GetElement(System.Numerics.Quaternion,int):float (FullOpts)
         -20 (-36.36 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector:GetElement(System.Numerics.Vector4,int):float (FullOpts)
         -36 (-32.43 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256:Dot[System.__Canon](System.Runtime.Intrinsics.Vector256`1[System.__Canon],System.Runtime.Intrinsics.Vector256`1[System.__Canon]):System.__Canon (FullOpts)
         -36 (-30.77 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128`1[System.__Canon]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector128<T>,T>.Dot(System.Runtime.Intrinsics.Vector128`1[System.__Canon],System.Runtime.Intrinsics.Vector128`1[System.__Canon]):System.__Canon (FullOpts)
         -29 (-29.90 % of base) : System.Private.CoreLib.dasm - System.Numerics.Plane:Normalize(System.Numerics.Plane):System.Numerics.Plane (FullOpts)
        -373 (-27.41 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512`1[short]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector512<T>,T>.Divide(System.Runtime.Intrinsics.Vector512`1[short],System.Runtime.Intrinsics.Vector512`1[short]):System.Runtime.Intrinsics.Vector512`1[short] (FullOpts)
         -39 (-27.27 % of base) : System.Private.CoreLib.dasm - System.Numerics.Matrix4x4+Impl:Lerp(byref,byref,float):System.Numerics.Matrix4x4+Impl (FullOpts)
         -39 (-26.35 % of base) : System.Private.CoreLib.dasm - System.Numerics.Matrix4x4:Lerp(System.Numerics.Matrix4x4,System.Numerics.Matrix4x4,float):System.Numerics.Matrix4x4 (FullOpts)
         -18 (-25.71 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector:WithElement(System.Numerics.Quaternion,int,float):System.Numerics.Quaternion (FullOpts)
         -18 (-25.71 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector:WithElement(System.Numerics.Vector4,int,float):System.Numerics.Vector4 (FullOpts)
         -61 (-25.42 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128`1[ubyte]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector128<T>,T>.Divide(System.Runtime.Intrinsics.Vector128`1[ubyte],ubyte):System.Runtime.Intrinsics.Vector128`1[ubyte] (FullOpts)
          -5 (-25.00 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector2:Normalize(System.Numerics.Vector2):System.Numerics.Vector2 (FullOpts)
         -36 (-24.49 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512:Dot[System.__Canon](System.Runtime.Intrinsics.Vector512`1[System.__Canon],System.Runtime.Intrinsics.Vector512`1[System.__Canon]):System.__Canon (FullOpts)
        -448 (-24.48 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512`1[ubyte]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector512<T>,T>.Dot(System.Runtime.Intrinsics.Vector512`1[ubyte],System.Runtime.Intrinsics.Vector512`1[ubyte]):ubyte (FullOpts)
         -36 (-23.53 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256`1[System.__Canon]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector256<T>,T>.Dot(System.Runtime.Intrinsics.Vector256`1[System.__Canon],System.Runtime.Intrinsics.Vector256`1[System.__Canon]):System.__Canon (FullOpts)
          -6 (-21.43 % of base) : System.Private.CoreLib.dasm - System.Numerics.Plane:.ctor(System.Numerics.Vector3,float):this (FullOpts)
          -6 (-21.43 % of base) : System.Private.CoreLib.dasm - System.Numerics.Quaternion:.ctor(System.Numerics.Vector3,float):this (FullOpts)
        -183 (-20.96 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512`1[ubyte]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector512<T>,T>.Divide(System.Runtime.Intrinsics.Vector512`1[ubyte],System.Runtime.Intrinsics.Vector512`1[ubyte]):System.Runtime.Intrinsics.Vector512`1[ubyte] (FullOpts)
          -6 (-20.69 % of base) : System.Private.CoreLib.dasm - System.Numerics.Plane:.ctor(float,float,float,float):this (FullOpts)
         -39 (-20.63 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512`1[System.__Canon]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector512<T>,T>.Dot(System.Runtime.Intrinsics.Vector512`1[System.__Canon],System.Runtime.Intrinsics.Vector512`1[System.__Canon]):System.__Canon (FullOpts)
         -20 (-20.00 % of base) : System.Private.CoreLib.dasm - System.Numerics.Matrix4x4:set_Item(int,int,float):this (FullOpts)
         -20 (-20.00 % of base) : System.Private.CoreLib.dasm - System.Numerics.Matrix4x4+Impl:set_Item(int,int,float):this (FullOpts)
         -12 (-19.35 % of base) : System.Private.CoreLib.dasm - System.Numerics.Quaternion:set_Item(int,float):this (FullOpts)
         -12 (-19.35 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector4:set_Item(int,float):this (FullOpts)
         -66 (-18.80 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector128:Dot[ubyte](System.Runtime.Intrinsics.Vector128`1[ubyte],System.Runtime.Intrinsics.Vector128`1[ubyte]):ubyte (FullOpts)
          -5 (-17.24 % of base) : System.Private.CoreLib.dasm - System.Numerics.Quaternion:Normalize(System.Numerics.Quaternion):System.Numerics.Quaternion (FullOpts)
          -5 (-17.24 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector4:Normalize(System.Numerics.Vector4):System.Numerics.Vector4 (FullOpts)
         -81 (-16.46 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector256`1[ubyte]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector256<T>,T>.Divide(System.Runtime.Intrinsics.Vector256`1[ubyte],ubyte):System.Runtime.Intrinsics.Vector256`1[ubyte] (FullOpts)
          -5 (-14.29 % of base) : System.Private.CoreLib.dasm - System.Numerics.Vector3:Normalize(System.Numerics.Vector3):System.Numerics.Vector3 (FullOpts)
          -3 (-13.64 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512:Sqrt[System.Nullable`1[int]](System.Runtime.Intrinsics.Vector512`1[System.Nullable`1[int]]):System.Runtime.Intrinsics.Vector512`1[System.Nullable`1[int]] (FullOpts)
          -3 (-13.64 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512:Sqrt[System.Numerics.Vector`1[float]](System.Runtime.Intrinsics.Vector512`1[System.Numerics.Vector`1[float]]):System.Runtime.Intrinsics.Vector512`1[System.Numerics.Vector`1[float]] (FullOpts)
          -3 (-13.64 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector64:Dot[System.Nullable`1[int]](System.Runtime.Intrinsics.Vector64`1[System.Nullable`1[int]],System.Runtime.Intrinsics.Vector64`1[System.Nullable`1[int]]):System.Nullable`1[int] (FullOpts)
          -3 (-13.64 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector64:Dot[System.Numerics.Vector`1[float]](System.Runtime.Intrinsics.Vector64`1[System.Numerics.Vector`1[float]],System.Runtime.Intrinsics.Vector64`1[System.Numerics.Vector`1[float]]):System.Numerics.Vector`1[float] (FullOpts)
         -79 (-12.36 % of base) : System.Private.CoreLib.dasm - System.Runtime.Intrinsics.Vector512`1[long]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector512<T>,T>.Divide(System.Runtime.Intrinsics.Vector512`1[long],System.Runtime.Intrinsics.Vector512`1[long]):System.Runtime.Intrinsics.Vector512`1[long] (FullOpts)
         -23 (-10.04 % of base) : System.Private.CoreLib.dasm - System.Numerics.Matrix4x4+Impl:CreateReflection(byref):System.Numerics.Matrix4x4+Impl (FullOpts)
         -23 (-10.00 % of base) : System.Private.CoreLib.dasm - System.Numerics.Matrix4x4:CreateReflection(System.Numerics.Plane):System.Numerics.Matrix4x4 (FullOpts)

432 total methods with Code Size differences (116 improved, 316 regressed), 245179 unchanged.

--------------------------------------------------------------------------------

Artifacts:

@MihuBot
Copy link
Owner Author

MihuBot commented Jun 5, 2024

Top method regressions

451 (58.12 % of base) - System.Runtime.Intrinsics.Vector256`1[ubyte]:System.Runtime.Intrinsics.ISimdVector,T>.Dot(System.Runtime.Intrinsics.Vector256`1[ubyte],System.Runtime.Intrinsics.Vector256`1[ubyte]):ubyte
 ; Assembly listing for method System.Runtime.Intrinsics.Vector256`1[ubyte]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector256<T>,T>.Dot(System.Runtime.Intrinsics.Vector256`1[ubyte],System.Runtime.Intrinsics.Vector256`1[ubyte]):ubyte (FullOpts)
 ; Emitting BLENDED_CODE for X64 with AVX - Unix
 ; FullOpts code
 ; optimized code
 ; rbp based frame
-; partially interruptible
+; fully interruptible
 ; No PGO data
-; 0 inlinees with PGO data; 19 single block inlinees; 23 inlinees without PGO data
+; 0 inlinees with PGO data; 23 single block inlinees; 9 inlinees without PGO data
 ; Final local variable assignments
 ;
-;  V00 arg0         [V00,T27] (  2,  2   )  simd32  ->  mm0         single-def <System.Runtime.Intrinsics.Vector256`1[ubyte]>
-;  V01 arg1         [V01,T28] (  2,  2   )  simd32  ->  mm1         single-def <System.Runtime.Intrinsics.Vector256`1[ubyte]>
+;  V00 arg0         [V00,T32] (  2,  2   )  simd32  ->  [rbp+0x10]  single-def <System.Runtime.Intrinsics.Vector256`1[ubyte]>
+;  V01 arg1         [V01,T33] (  2,  2   )  simd32  ->  [rbp+0x30]  single-def <System.Runtime.Intrinsics.Vector256`1[ubyte]>
 ;# V02 OutArgs      [V02    ] (  1,  1   )  struct ( 0) [rsp+0x00]  do-not-enreg[XS] addr-exposed "OutgoingArgSpace"
-;  V03 tmp1         [V03,T12] (  2,  4   )     int  ->  rax         "impAppendStmt"
-;  V04 tmp2         [V04,T23] (  3,  6   )  simd16  ->  [rbp-0x10]  do-not-enreg[SF] "Inlining Arg" <System.Runtime.Intrinsics.Vector128`1[ubyte]>
-;  V05 tmp3         [V05,T24] (  3,  6   )  simd16  ->  [rbp-0x20]  do-not-enreg[SF] "Inlining Arg" <System.Runtime.Intrinsics.Vector128`1[ubyte]>
-;  V06 tmp4         [V06,T13] (  2,  4   )     int  ->  rax         "impAppendStmt"
-;  V07 tmp5         [V07,T04] (  8,  8   )   ubyte  ->  rax         ld-addr-op "Inline ldloca(s) first use temp"
-;* V08 tmp6         [V08,T19] (  0,  0   )     int  ->  zero-ref    "Inline stloc first use temp"
-;  V09 tmp7         [V09    ] (  9, 18   )  struct ( 8) [rbp-0x28]  do-not-enreg[XS] addr-exposed ld-addr-op "Inlining Arg" <System.Runtime.Intrinsics.Vector64`1[ubyte]>
-;  V10 tmp8         [V10    ] (  9, 18   )  struct ( 8) [rbp-0x30]  do-not-enreg[XS] addr-exposed ld-addr-op "Inlining Arg" <System.Runtime.Intrinsics.Vector64`1[ubyte]>
-;* V11 tmp9         [V11    ] (  0,  0   )     int  ->  zero-ref    "impAppendStmt"
-;  V12 tmp10        [V12,T00] (  8, 16   )     int  ->  registers   "impAppendStmt"
-;* V13 tmp11        [V13    ] (  0,  0   )     int  ->  zero-ref    "impAppendStmt"
-;* V14 tmp12        [V14    ] (  0,  0   )   ubyte  ->  zero-ref    "Inline stloc first use temp"
-;* V15 tmp13        [V15    ] (  0,  0   )   ubyte  ->  zero-ref    "Inlining Arg"
-;* V16 tmp14        [V16    ] (  0,  0   )   ubyte  ->  zero-ref    "Inlining Arg"
-;* V17 tmp15        [V17    ] (  0,  0   )     int  ->  zero-ref    "Inlining Arg"
-;* V18 tmp16        [V18    ] (  0,  0   )     int  ->  zero-ref    "Inlining Arg"
-;  V19 tmp17        [V19,T08] (  8,  8   )   ubyte  ->  registers   "Inline return value spill temp"
-;* V20 tmp18        [V20    ] (  0,  0   )   ubyte  ->  zero-ref    "Inlining Arg"
-;* V21 tmp19        [V21    ] (  0,  0   )   ubyte  ->  zero-ref    "Inlining Arg"
-;* V22 tmp20        [V22    ] (  0,  0   )   ubyte  ->  zero-ref    "Inline return value spill temp"
-;* V23 tmp21        [V23    ] (  0,  0   )   ubyte  ->  zero-ref    "Inlining Arg"
-;* V24 tmp22        [V24    ] (  0,  0   )   ubyte  ->  zero-ref    "Inline return value spill temp"
-;* V25 tmp23        [V25    ] (  0,  0   )   ubyte  ->  zero-ref    "Inlining Arg"
-;  V26 tmp24        [V26,T05] (  8,  8   )   ubyte  ->  rcx         ld-addr-op "Inline ldloca(s) first use temp"
-;* V27 tmp25        [V27,T20] (  0,  0   )     int  ->  zero-ref    "Inline stloc first use temp"
-;  V28 tmp26        [V28    ] (  9, 18   )  struct ( 8) [rbp-0x38]  do-not-enreg[XS] addr-exposed ld-addr-op "Inlining Arg" <System.Runtime.Intrinsics.Vector64`1[ubyte]>
-;  V29 tmp27        [V29    ] (  9, 18   )  struct ( 8) [rbp-0x40]  do-not-enreg[XS] addr-exposed ld-addr-op "Inlining Arg" <System.Runtime.Intrinsics.Vector64`1[ubyte]>
+;  V03 tmp1         [V03,T12] (  2,  4   )     int  ->  rbx         "impAppendStmt"
+;  V04 tmp2         [V04,T28] (  3,  6   )  simd16  ->  [rbp-0x30]  do-not-enreg[SF] "Inlining Arg" <System.Runtime.Intrinsics.Vector128`1[ubyte]>
+;  V05 tmp3         [V05,T29] (  3,  6   )  simd16  ->  [rbp-0x40]  do-not-enreg[SF] "Inlining Arg" <System.Runtime.Intrinsics.Vector128`1[ubyte]>
+;  V06 tmp4         [V06,T13] (  2,  4   )     int  ->  rbx         "impAppendStmt"
+;* V07 tmp5         [V07    ] (  0,  0   )  struct ( 8) zero-ref    "Inlining Arg" <System.Runtime.Intrinsics.Vector64`1[ubyte]>
+;* V08 tmp6         [V08    ] (  0,  0   )  struct ( 8) zero-ref    "Inlining Arg" <System.Runtime.Intrinsics.Vector64`1[ubyte]>
+;* V09 tmp7         [V09    ] (  0,  0   )  struct ( 8) zero-ref    "spilled call-like call argument" <System.Runtime.Intrinsics.Vector64`1[ubyte]>
+;  V10 tmp8         [V10    ] (  2,  5   )  struct ( 8) [rbp-0x48]  do-not-enreg[XS] addr-exposed ld-addr-op "Inline ldloca(s) first use temp" <System.Runtime.Intrinsics.Vector64`1[ubyte]>
+;  V11 tmp9         [V11,T04] (  5, 17   )     int  ->  rsi         "Inline stloc first use temp"
+;  V12 tmp10        [V12    ] (  2, 10   )  struct ( 8) [rbp-0x50]  do-not-enreg[XS] addr-exposed ld-addr-op "Inlining Arg" <System.Runtime.Intrinsics.Vector64`1[ubyte]>
+;  V13 tmp11        [V13    ] (  2, 10   )  struct ( 8) [rbp-0x58]  do-not-enreg[XS] addr-exposed ld-addr-op "Inlining Arg" <System.Runtime.Intrinsics.Vector64`1[ubyte]>
+;* V14 tmp12        [V14    ] (  0,  0   )     int  ->  zero-ref    "impAppendStmt"
+;* V15 tmp13        [V15    ] (  0,  0   )   ubyte  ->  zero-ref    "Inline stloc first use temp"
+;* V16 tmp14        [V16    ] (  0,  0   )   ubyte  ->  zero-ref    "Inline return value spill temp"
+;* V17 tmp15        [V17    ] (  0,  0   )   ubyte  ->  zero-ref    "Inlining Arg"
+;* V18 tmp16        [V18    ] (  0,  0   )   ubyte  ->  zero-ref    "Inlining Arg"
+;  V19 tmp17        [V19,T00] ( 16, 16   )   ubyte  ->  rbx         ld-addr-op "Inline ldloca(s) first use temp"
+;* V20 tmp18        [V20,T24] (  0,  0   )     int  ->  zero-ref    "Inline stloc first use temp"
+;  V21 tmp19        [V21    ] (  9, 18   )  struct ( 8) [rbp-0x60]  do-not-enreg[XS] addr-exposed ld-addr-op "Inlining Arg" <System.Runtime.Intrinsics.Vector64`1[ubyte]>
+;* V22 tmp20        [V22    ] (  0,  0   )     int  ->  zero-ref    "impAppendStmt"
+;* V23 tmp21        [V23    ] (  0,  0   )  struct ( 8) zero-ref    "Inlining Arg" <System.Runtime.Intrinsics.Vector64`1[ubyte]>
+;* V24 tmp22        [V24    ] (  0,  0   )  struct ( 8) zero-ref    "Inlining Arg" <System.Runtime.Intrinsics.Vector64`1[ubyte]>
+;* V25 tmp23        [V25    ] (  0,  0   )  struct ( 8) zero-ref    "spilled call-like call argument" <System.Runtime.Intrinsics.Vector64`1[ubyte]>
+;  V26 tmp24        [V26    ] (  2,  5   )  struct ( 8) [rbp-0x68]  do-not-enreg[XS] addr-exposed ld-addr-op "Inline ldloca(s) first use temp" <System.Runtime.Intrinsics.Vector64`1[ubyte]>
+;  V27 tmp25        [V27,T05] (  5, 17   )     int  ->  r15         "Inline stloc first use temp"
+;  V28 tmp26        [V28    ] (  2, 10   )  struct ( 8) [rbp-0x70]  do-not-enreg[XS] addr-exposed ld-addr-op "Inlining Arg" <System.Runtime.Intrinsics.Vector64`1[ubyte]>
+;  V29 tmp27        [V29    ] (  2, 10   )  struct ( 8) [rbp-0x78]  do-not-enreg[XS] addr-exposed ld-addr-op "Inlining Arg" <System.Runtime.Intrinsics.Vector64`1[ubyte]>
 ;* V30 tmp28        [V30    ] (  0,  0   )     int  ->  zero-ref    "impAppendStmt"
-;  V31 tmp29        [V31,T01] (  8, 16   )     int  ->  registers   "impAppendStmt"
-;* V32 tmp30        [V32    ] (  0,  0   )     int  ->  zero-ref    "impAppendStmt"
-;* V33 tmp31        [V33    ] (  0,  0   )   ubyte  ->  zero-ref    "Inline stloc first use temp"
-;* V34 tmp32        [V34    ] (  0,  0   )   ubyte  ->  zero-ref    "Inlining Arg"
-;* V35 tmp33        [V35    ] (  0,  0   )   ubyte  ->  zero-ref    "Inlining Arg"
-;* V36 tmp34        [V36    ] (  0,  0   )     int  ->  zero-ref    "Inlining Arg"
-;* V37 tmp35        [V37    ] (  0,  0   )     int  ->  zero-ref    "Inlining Arg"
-;  V38 tmp36        [V38,T09] (  8,  8   )   ubyte  ->  registers   "Inline return value spill temp"
-;* V39 tmp37        [V39    ] (  0,  0   )   ubyte  ->  zero-ref    "Inlining Arg"
-;* V40 tmp38        [V40    ] (  0,  0   )   ubyte  ->  zero-ref    "Inlining Arg"
-;* V41 tmp39        [V41    ] (  0,  0   )   ubyte  ->  zero-ref    "Inline return value spill temp"
-;* V42 tmp40        [V42    ] (  0,  0   )   ubyte  ->  zero-ref    "Inlining Arg"
-;* V43 tmp41        [V43    ] (  0,  0   )   ubyte  ->  zero-ref    "Inline return value spill temp"
-;* V44 tmp42        [V44    ] (  0,  0   )   ubyte  ->  zero-ref    "Inlining Arg"
-;* V45 tmp43        [V45    ] (  0,  0   )   ubyte  ->  zero-ref    "Inline return value spill temp"
-;* V46 tmp44        [V46    ] (  0,  0   )   ubyte  ->  zero-ref    "Inlining Arg"
-;* V47 tmp45        [V47    ] (  0,  0   )   ubyte  ->  zero-ref    "Inlining Arg"
-;  V48 tmp46        [V48,T25] (  3,  6   )  simd16  ->  [rbp-0x50]  do-not-enreg[SF] "Inlining Arg" <System.Runtime.Intrinsics.Vector128`1[ubyte]>
-;  V49 tmp47        [V49,T26] (  3,  6   )  simd16  ->  [rbp-0x60]  do-not-enreg[SF] "Inlining Arg" <System.Runtime.Intrinsics.Vector128`1[ubyte]>
-;  V50 tmp48        [V50,T14] (  2,  4   )     int  ->  rcx         "impAppendStmt"
-;  V51 tmp49        [V51,T06] (  8,  8   )   ubyte  ->  rcx         ld-addr-op "Inline ldloca(s) first use temp"
-;* V52 tmp50        [V52,T21] (  0,  0   )     int  ->  zero-ref    "Inline stloc first use temp"
-;  V53 tmp51        [V53    ] (  9, 18   )  struct ( 8) [rbp-0x68]  do-not-enreg[XS] addr-exposed ld-addr-op "Inlining Arg" <System.Runtime.Intrinsics.Vector64`1[ubyte]>
-;  V54 tmp52        [V54    ] (  9, 18   )  struct ( 8) [rbp-0x70]  do-not-enreg[XS] addr-exposed ld-addr-op "Inlining Arg" <System.Runtime.Intrinsics.Vector64`1[ubyte]>
-;* V55 tmp53        [V55    ] (  0,  0   )     int  ->  zero-ref    "impAppendStmt"
-;  V56 tmp54        [V56,T02] (  8, 16   )     int  ->  registers   "impAppendStmt"
-;* V57 tmp55        [V57    ] (  0,  0   )     int  ->  zero-ref    "impAppendStmt"
-;* V58 tmp56        [V58    ] (  0,  0   )   ubyte  ->  zero-ref    "Inline stloc first use temp"
-;* V59 tmp57        [V59    ] (  0,  0   )   ubyte  ->  zero-ref    "Inlining Arg"
-;* V60 tmp58        [V60    ] (  0,  0   )   ubyte  ->  zero-ref    "Inlining Arg"
-;* V61 tmp59        [V61    ] (  0,  0   )     int  ->  zero-ref    "Inlining Arg"
-;* V62 tmp60        [V62    ] (  0,  0   )     int  ->  zero-ref    "Inlining Arg"
-;  V63 tmp61        [V63,T10] (  8,  8   )   ubyte  ->  registers   "Inline return value spill temp"
-;* V64 tmp62        [V64    ] (  0,  0   )   ubyte  ->  zero-ref    "Inlining Arg"
-;* V65 tmp63        [V65    ] (  0,  0   )   ubyte  ->  zero-ref    "Inlining Arg"
-;* V66 tmp64        [V66    ] (  0,  0   )   ubyte  ->  zero-ref    "Inline return value spill temp"
-;* V67 tmp65        [V67    ] (  0,  0   )   ubyte  ->  zero-ref    "Inlining Arg"
-;* V68 tmp66        [V68    ] (  0,  0   )   ubyte  ->  zero-ref    "Inline return value spill temp"
-;* V69 tmp67        [V69    ] (  0,  0   )   ubyte  ->  zero-ref    "Inlining Arg"
-;  V70 tmp68        [V70,T07] (  8,  8   )   ubyte  ->  rdx         ld-addr-op "Inline ldloca(s) first use temp"
-;* V71 tmp69        [V71,T22] (  0,  0   )     int  ->  zero-ref    "Inline stloc first use temp"
-;  V72 tmp70        [V72    ] (  9, 18   )  struct ( 8) [rbp-0x78]  do-not-enreg[XS] addr-exposed ld-addr-op "Inlining Arg" <System.Runtime.Intrinsics.Vector64`1[ubyte]>
-;  V73 tmp71        [V73    ] (  9, 18   )  struct ( 8) [rbp-0x80]  do-not-enreg[XS] addr-exposed ld-addr-op "Inlining Arg" <System.Runtime.Intrinsics.Vector64`1[ubyte]>
-;* V74 tmp72        [V74    ] (  0,  0   )     int  ->  zero-ref    "impAppendStmt"
-;  V75 tmp73        [V75,T03] (  8, 16   )     int  ->  registers   "impAppendStmt"
-;* V76 tmp74        [V76    ] (  0,  0   )     int  ->  zero-ref    "impAppendStmt"
-;* V77 tmp75        [V77    ] (  0,  0   )   ubyte  ->  zero-ref    "Inline stloc first use temp"
-;* V78 tmp76        [V78    ] (  0,  0   )   ubyte  ->  zero-ref    "Inlining Arg"
-;* V79 tmp77        [V79    ] (  0,  0   )   ubyte  ->  zero-ref    "Inlining Arg"
-;* V80 tmp78        [V80    ] (  0,  0   )     int  ->  zero-ref    "Inlining Arg"
-;* V81 tmp79        [V81    ] (  0,  0   )     int  ->  zero-ref    "Inlining Arg"
-;  V82 tmp80        [V82,T11] (  8,  8   )   ubyte  ->  registers   "Inline return value spill temp"
-;* V83 tmp81        [V83    ] (  0,  0   )   ubyte  ->  zero-ref    "Inlining Arg"
-;* V84 tmp82        [V84    ] (  0,  0   )   ubyte  ->  zero-ref    "Inlining Arg"
-;* V85 tmp83        [V85    ] (  0,  0   )   ubyte  ->  zero-ref    "Inline return value spill temp"
-;* V86 tmp84        [V86    ] (  0,  0   )   ubyte  ->  zero-ref    "Inlining Arg"
-;* V87 tmp85        [V87    ] (  0,  0   )   ubyte  ->  zero-ref    "Inline return value spill temp"
-;* V88 tmp86        [V88    ] (  0,  0   )   ubyte  ->  zero-ref    "Inlining Arg"
-;* V89 tmp87        [V89    ] (  0,  0   )   ubyte  ->  zero-ref    "Inline return value spill temp"
-;* V90 tmp88        [V90    ] (  0,  0   )   ubyte  ->  zero-ref    "Inlining Arg"
-;* V91 tmp89        [V91    ] (  0,  0   )   ubyte  ->  zero-ref    "Inlining Arg"
-;* V92 tmp90        [V92    ] (  0,  0   )   ubyte  ->  zero-ref    "Inline return value spill temp"
-;* V93 tmp91        [V93    ] (  0,  0   )   ubyte  ->  zero-ref    "Inlining Arg"
-;  V94 tmp92        [V94    ] (  9, 17   )    long  ->  [rbp-0x28]  do-not-enreg[X] addr-exposed "field V09._00 (fldOffset=0x0)" P-DEP
-;  V95 tmp93        [V95    ] (  9, 17   )    long  ->  [rbp-0x30]  do-not-enreg[X] addr-exposed "field V10._00 (fldOffset=0x0)" P-DEP
-;  V96 tmp94        [V96    ] (  9, 17   )    long  ->  [rbp-0x38]  do-not-enreg[X] addr-exposed "field V28._00 (fldOffset=0x0)" P-DEP
-;  V97 tmp95        [V97    ] (  9, 17   )    long  ->  [rbp-0x40]  do-not-enreg[X] addr-exposed "field V29._00 (fldOffset=0x0)" P-DEP
-;  V98 tmp96        [V98    ] (  9, 17   )    long  ->  [rbp-0x68]  do-not-enreg[X] addr-exposed "field V53._00 (fldOffset=0x0)" P-DEP
-;  V99 tmp97        [V99    ] (  9, 17   )    long  ->  [rbp-0x70]  do-not-enreg[X] addr-exposed "field V54._00 (fldOffset=0x0)" P-DEP
-;  V100 tmp98       [V100    ] (  9, 17   )    long  ->  [rbp-0x78]  do-not-enreg[X] addr-exposed "field V72._00 (fldOffset=0x0)" P-DEP
-;  V101 tmp99       [V101    ] (  9, 17   )    long  ->  [rbp-0x80]  do-not-enreg[X] addr-exposed "field V73._00 (fldOffset=0x0)" P-DEP
-;  V102 cse0        [V102,T15] (  2,  2   )     int  ->  rcx         "CSE #01: moderate"
-;  V103 cse1        [V103,T16] (  2,  2   )     int  ->  rdx         "CSE #02: moderate"
-;  V104 cse2        [V104,T17] (  2,  2   )     int  ->  rdx         "CSE #03: moderate"
-;  V105 cse3        [V105,T18] (  2,  2   )     int  ->  rdi         "CSE #04: moderate"
+;* V31 tmp29        [V31    ] (  0,  0   )   ubyte  ->  zero-ref    "Inline stloc first use temp"
+;  V32 tmp30        [V32,T01] ( 16, 16   )   ubyte  ->  rax         ld-addr-op "Inline ldloca(s) first use temp"
+;* V33 tmp31        [V33,T25] (  0,  0   )     int  ->  zero-ref    "Inline stloc first use temp"
+;  V34 tmp32        [V34    ] (  9, 18   )  struct ( 8) [rbp-0x80]  do-not-enreg[XS] addr-exposed ld-addr-op "Inlining Arg" <System.Runtime.Intrinsics.Vector64`1[ubyte]>
+;* V35 tmp33        [V35    ] (  0,  0   )     int  ->  zero-ref    "impAppendStmt"
+;  V36 tmp34        [V36,T30] (  3,  6   )  simd16  ->  [rbp-0x90]  do-not-enreg[SF] "Inlining Arg" <System.Runtime.Intrinsics.Vector128`1[ubyte]>
+;  V37 tmp35        [V37,T31] (  3,  6   )  simd16  ->  [rbp-0xA0]  do-not-enreg[SF] "Inlining Arg" <System.Runtime.Intrinsics.Vector128`1[ubyte]>
+;  V38 tmp36        [V38,T14] (  2,  4   )     int  ->  r15         "impAppendStmt"
+;* V39 tmp37        [V39    ] (  0,  0   )  struct ( 8) zero-ref    "Inlining Arg" <System.Runtime.Intrinsics.Vector64`1[ubyte]>
+;* V40 tmp38        [V40    ] (  0,  0   )  struct ( 8) zero-ref    "Inlining Arg" <System.Runtime.Intrinsics.Vector64`1[ubyte]>
+;* V41 tmp39        [V41    ] (  0,  0   )  struct ( 8) zero-ref    "spilled call-like call argument" <System.Runtime.Intrinsics.Vector64`1[ubyte]>
+;  V42 tmp40        [V42    ] (  2,  5   )  struct ( 8) [rbp-0xA8]  do-not-enreg[XS] addr-exposed ld-addr-op "Inline ldloca(s) first use temp" <System.Runtime.Intrinsics.Vector64`1[ubyte]>
+;  V43 tmp41        [V43,T06] (  5, 17   )     int  ->  r15         "Inline stloc first use temp"
+;  V44 tmp42        [V44    ] (  2, 10   )  struct ( 8) [rbp-0xB0]  do-not-enreg[XS] addr-exposed ld-addr-op "Inlining Arg" <System.Runtime.Intrinsics.Vector64`1[ubyte]>
+;  V45 tmp43        [V45    ] (  2, 10   )  struct ( 8) [rbp-0xB8]  do-not-enreg[XS] addr-exposed ld-addr-op "Inlining Arg" <System.Runtime.Intrinsics.Vector64`1[ubyte]>
+;* V46 tmp44        [V46    ] (  0,  0   )     int  ->  zero-ref    "impAppendStmt"
+;* V47 tmp45        [V47    ] (  0,  0   )   ubyte  ->  zero-ref    "Inline stloc first use temp"
+;  V48 tmp46        [V48,T02] ( 16, 16   )   ubyte  ->  r15         ld-addr-op "Inline ldloca(s) first use temp"
+;* V49 tmp47        [V49,T26] (  0,  0   )     int  ->  zero-ref    "Inline stloc first use temp"
+;  V50 tmp48        [V50    ] (  9, 18   )  struct ( 8) [rbp-0xC0]  do-not-enreg[XS] addr-exposed ld-addr-op "Inlining Arg" <System.Runtime.Intrinsics.Vector64`1[ubyte]>
+;* V51 tmp49        [V51    ] (  0,  0   )     int  ->  zero-ref    "impAppendStmt"
+;* V52 tmp50        [V52    ] (  0,  0   )  struct ( 8) zero-ref    "Inlining Arg" <System.Runtime.Intrinsics.Vector64`1[ubyte]>
+;* V53 tmp51        [V53    ] (  0,  0   )  struct ( 8) zero-ref    "Inlining Arg" <System.Runtime.Intrinsics.Vector64`1[ubyte]>
+;* V54 tmp52        [V54    ] (  0,  0   )  struct ( 8) zero-ref    "spilled call-like call argument" <System.Runtime.Intrinsics.Vector64`1[ubyte]>
+;  V55 tmp53        [V55    ] (  2,  5   )  struct ( 8) [rbp-0xC8]  do-not-enreg[XS] addr-exposed ld-addr-op "Inline ldloca(s) first use temp" <System.Runtime.Intrinsics.Vector64`1[ubyte]>
+;  V56 tmp54        [V56,T07] (  5, 17   )     int  ->  r14         "Inline stloc first use temp"
+;  V57 tmp55        [V57    ] (  2, 10   )  struct ( 8) [rbp-0xD0]  do-not-enreg[XS] addr-exposed ld-addr-op "Inlining Arg" <System.Runtime.Intrinsics.Vector64`1[ubyte]>
+;  V58 tmp56        [V58    ] (  2, 10   )  struct ( 8) [rbp-0xD8]  do-not-enreg[XS] addr-exposed ld-addr-op "Inlining Arg" <System.Runtime.Intrinsics.Vector64`1[ubyte]>
+;* V59 tmp57        [V59    ] (  0,  0   )     int  ->  zero-ref    "impAppendStmt"
+;* V60 tmp58        [V60    ] (  0,  0   )   ubyte  ->  zero-ref    "Inline stloc first use temp"
+;  V61 tmp59        [V61,T03] ( 16, 16   )   ubyte  ->  rax         ld-addr-op "Inline ldloca(s) first use temp"
+;* V62 tmp60        [V62,T27] (  0,  0   )     int  ->  zero-ref    "Inline stloc first use temp"
+;  V63 tmp61        [V63    ] (  9, 18   )  struct ( 8) [rbp-0xE0]  do-not-enreg[XS] addr-exposed ld-addr-op "Inlining Arg" <System.Runtime.Intrinsics.Vector64`1[ubyte]>
+;* V64 tmp62        [V64    ] (  0,  0   )     int  ->  zero-ref    "impAppendStmt"
+;  V65 tmp63        [V65,T16] (  2,  2   )    long  ->  rsi         "field V07._00 (fldOffset=0x0)" P-INDEP
+;  V66 tmp64        [V66,T17] (  2,  2   )    long  ->  rdi         "field V08._00 (fldOffset=0x0)" P-INDEP
+;* V67 tmp65        [V67    ] (  0,  0   )    long  ->  zero-ref    "field V09._00 (fldOffset=0x0)" P-INDEP
+;  V68 tmp66        [V68    ] (  2,  5   )    long  ->  [rbp-0x48]  do-not-enreg[X] addr-exposed "field V10._00 (fldOffset=0x0)" P-DEP
+;  V69 tmp67        [V69    ] (  2,  9   )    long  ->  [rbp-0x50]  do-not-enreg[X] addr-exposed "field V12._00 (fldOffset=0x0)" P-DEP
+;  V70 tmp68        [V70    ] (  2,  9   )    long  ->  [rbp-0x58]  do-not-enreg[X] addr-exposed "field V13._00 (fldOffset=0x0)" P-DEP
+;  V71 tmp69        [V71    ] (  9, 17   )    long  ->  [rbp-0x60]  do-not-enreg[X] addr-exposed "field V21._00 (fldOffset=0x0)" P-DEP
+;  V72 tmp70        [V72,T18] (  2,  2   )    long  ->  rdi         "field V23._00 (fldOffset=0x0)" P-INDEP
+;  V73 tmp71        [V73,T19] (  2,  2   )    long  ->  rsi         "field V24._00 (fldOffset=0x0)" P-INDEP
+;* V74 tmp72        [V74    ] (  0,  0   )    long  ->  zero-ref    "field V25._00 (fldOffset=0x0)" P-INDEP
+;  V75 tmp73        [V75    ] (  2,  5   )    long  ->  [rbp-0x68]  do-not-enreg[X] addr-exposed "field V26._00 (fldOffset=0x0)" P-DEP
+;  V76 tmp74        [V76    ] (  2,  9   )    long  ->  [rbp-0x70]  do-not-enreg[X] addr-exposed "field V28._00 (fldOffset=0x0)" P-DEP
+;  V77 tmp75        [V77    ] (  2,  9   )    long  ->  [rbp-0x78]  do-not-enreg[X] addr-exposed "field V29._00 (fldOffset=0x0)" P-DEP
+;  V78 tmp76        [V78    ] (  9, 17   )    long  ->  [rbp-0x80]  do-not-enreg[X] addr-exposed "field V34._00 (fldOffset=0x0)" P-DEP
+;  V79 tmp77        [V79,T20] (  2,  2   )    long  ->  rdi         "field V39._00 (fldOffset=0x0)" P-INDEP
+;  V80 tmp78        [V80,T21] (  2,  2   )    long  ->  rsi         "field V40._00 (fldOffset=0x0)" P-INDEP
+;* V81 tmp79        [V81    ] (  0,  0   )    long  ->  zero-ref    "field V41._00 (fldOffset=0x0)" P-INDEP
+;  V82 tmp80        [V82    ] (  2,  5   )    long  ->  [rbp-0xA8]  do-not-enreg[X] addr-exposed "field V42._00 (fldOffset=0x0)" P-DEP
+;  V83 tmp81        [V83    ] (  2,  9   )    long  ->  [rbp-0xB0]  do-not-enreg[X] addr-exposed "field V44._00 (fldOffset=0x0)" P-DEP
+;  V84 tmp82        [V84    ] (  2,  9   )    long  ->  [rbp-0xB8]  do-not-enreg[X] addr-exposed "field V45._00 (fldOffset=0x0)" P-DEP
+;  V85 tmp83        [V85    ] (  9, 17   )    long  ->  [rbp-0xC0]  do-not-enreg[X] addr-exposed "field V50._00 (fldOffset=0x0)" P-DEP
+;  V86 tmp84        [V86,T22] (  2,  2   )    long  ->  rdi         "field V52._00 (fldOffset=0x0)" P-INDEP
+;  V87 tmp85        [V87,T23] (  2,  2   )    long  ->  rsi         "field V53._00 (fldOffset=0x0)" P-INDEP
+;* V88 tmp86        [V88    ] (  0,  0   )    long  ->  zero-ref    "field V54._00 (fldOffset=0x0)" P-INDEP
+;  V89 tmp87        [V89    ] (  2,  5   )    long  ->  [rbp-0xC8]  do-not-enreg[X] addr-exposed "field V55._00 (fldOffset=0x0)" P-DEP
+;  V90 tmp88        [V90    ] (  2,  9   )    long  ->  [rbp-0xD0]  do-not-enreg[X] addr-exposed "field V57._00 (fldOffset=0x0)" P-DEP
+;  V91 tmp89        [V91    ] (  2,  9   )    long  ->  [rbp-0xD8]  do-not-enreg[X] addr-exposed "field V58._00 (fldOffset=0x0)" P-DEP
+;  V92 tmp90        [V92    ] (  9, 17   )    long  ->  [rbp-0xE0]  do-not-enreg[X] addr-exposed "field V63._00 (fldOffset=0x0)" P-DEP
+;  V93 tmp91        [V93,T15] (  2,  4   )     int  ->  rsi         "argument with side effect"
+;  V94 cse0         [V94,T08] (  4, 16   )    long  ->  rax         "CSE #01: aggressive"
+;  V95 cse1         [V95,T09] (  4, 16   )    long  ->  r14         "CSE #02: aggressive"
+;  V96 cse2         [V96,T10] (  4, 16   )    long  ->  r14         "CSE #03: aggressive"
+;  V97 cse3         [V97,T11] (  4, 16   )    long  ->  r13         "CSE #04: aggressive"
 ;
-; Lcl frame size = 128
+; Lcl frame size = 192
 
 G_M42821_IG01:
        push     rbp
-       sub      rsp, 128
-       lea      rbp, [rsp+0x80]
+       push     r15
+       push     r14
+       push     r13
+       push     rbx
+       sub      rsp, 192
+       lea      rbp, [rsp+0xE0]
        vmovups  ymm0, ymmword ptr [rbp+0x10]
        vmovups  ymm1, ymmword ptr [rbp+0x30]
-						;; size=26 bbWeight=1 PerfScore 9.75
+						;; size=33 bbWeight=1 PerfScore 13.75
 G_M42821_IG02:
+       vmovups  ymmword ptr [rbp+0x10], ymm0
        vmovaps  ymm2, ymm0
-       vmovaps  xmmword ptr [rbp-0x10], xmm2
+       vmovaps  xmmword ptr [rbp-0x30], xmm2
+       vmovups  ymmword ptr [rbp+0x30], ymm1
        vmovaps  ymm2, ymm1
-       vmovaps  xmmword ptr [rbp-0x20], xmm2
-       mov      rax, qword ptr [rbp-0x10]
-       mov      qword ptr [rbp-0x28], rax
-       mov      rax, qword ptr [rbp-0x20]
-       mov      qword ptr [rbp-0x30], rax
-       movzx    rax, byte  ptr [rbp-0x28]
-       movzx    rcx, byte  ptr [rbp-0x30]
-       imul     eax, ecx
-       movzx    rax, al
-       movzx    rcx, byte  ptr [rbp-0x27]
-       movzx    rdx, byte  ptr [rbp-0x2F]
-       imul     ecx, edx
-       movzx    rcx, cl
-       add      eax, ecx
-       movzx    rcx, al
-       movzx    rax, cl
-       movzx    rcx, byte  ptr [rbp-0x26]
-       movzx    rdx, byte  ptr [rbp-0x2E]
-       imul     ecx, edx
-       movzx    rcx, cl
-       movzx    rdx, byte  ptr [rbp-0x25]
-       movzx    rdi, byte  ptr [rbp-0x2D]
-       imul     edx, edi
-       movzx    rdx, dl
-       add      ecx, edx
-       add      eax, ecx
-       movzx    rax, al
-       movzx    rdx, byte  ptr [rbp-0x24]
-       movzx    rcx, byte  ptr [rbp-0x2C]
-       imul     ecx, edx
-       movzx    rcx, cl
-       movzx    rdx, byte  ptr [rbp-0x23]
-       movzx    rdi, byte  ptr [rbp-0x2B]
-       imul     edx, edi
-       movzx    rdx, dl
-       add      ecx, edx
-       add      ecx, eax
-       movzx    rax, cl
-       movzx    rdx, byte  ptr [rbp-0x22]
-       movzx    rcx, byte  ptr [rbp-0x2A]
-       imul     ecx, edx
-       movzx    rcx, cl
-       movzx    rdx, byte  ptr [rbp-0x21]
-       movzx    rdi, byte  ptr [rbp-0x29]
-       imul     edx, edi
-       movzx    rdx, dl
-       add      ecx, edx
-       add      ecx, eax
-       movzx    rax, cl
-       mov      rcx, qword ptr [rbp-0x08]
-       mov      qword ptr [rbp-0x38], rcx
-       mov      rcx, qword ptr [rbp-0x18]
-       mov      qword ptr [rbp-0x40], rcx
-       movzx    rcx, byte  ptr [rbp-0x38]
-       movzx    rdx, byte  ptr [rbp-0x40]
-       imul     ecx, edx
-       movzx    rcx, cl
-       movzx    rdx, byte  ptr [rbp-0x37]
-       movzx    rdi, byte  ptr [rbp-0x3F]
-       imul     edx, edi
-       movzx    rdx, dl
-       add      ecx, edx
-       movzx    rdx, cl
-       movzx    rcx, dl
-       movzx    rdx, byte  ptr [rbp-0x36]
-       movzx    rdi, byte  ptr [rbp-0x3E]
-       imul     edx, edi
-       movzx    rdx, dl
-       movzx    rdi, byte  ptr [rbp-0x35]
-						;; size=251 bbWeight=1 PerfScore 62.00
+       vmovaps  xmmword ptr [rbp-0x40], xmm2
+       mov      rsi, qword ptr [rbp-0x30]
+       mov      rdi, qword ptr [rbp-0x40]
+       mov      qword ptr [rbp-0x50], rsi
+       mov      qword ptr [rbp-0x58], rdi
+       xor      esi, esi
+       align    [0 bytes for IG03]
+						;; size=46 bbWeight=1 PerfScore 8.75
 G_M42821_IG03:
-       movzx    rsi, byte  ptr [rbp-0x3D]
-       imul     edi, esi
-       movzx    rdi, dil
-       add      edx, edi
-       add      ecx, edx
-       movzx    rcx, cl
-       movzx    rdi, byte  ptr [rbp-0x34]
-       movzx    rdx, byte  ptr [rbp-0x3C]
-       imul     edx, edi
-       movzx    rdx, dl
-       movzx    rdi, byte  ptr [rbp-0x33]
-       movzx    rsi, byte  ptr [rbp-0x3B]
-       imul     edi, esi
-       movzx    rdi, dil
-       add      edx, edi
-       add      edx, ecx
-       movzx    rcx, dl
-       movzx    rdi, byte  ptr [rbp-0x32]
-       movzx    rdx, byte  ptr [rbp-0x3A]
-       imul     edx, edi
-       movzx    rdx, dl
-       movzx    rdi, byte  ptr [rbp-0x31]
-       movzx    rsi, byte  ptr [rbp-0x39]
-       imul     edi, esi
-       movzx    rdi, dil
-       add      edx, edi
-       add      edx, ecx
-       movzx    rcx, dl
-       add      eax, ecx
-       movzx    rax, al
-       vextractf128 xmm0, ymm0, 1
-       vmovaps  xmmword ptr [rbp-0x50], xmm0
-       vextractf128 xmm0, ymm1, 1
-       vmovaps  xmmword ptr [rbp-0x60], xmm0
-       mov      rcx, qword ptr [rbp-0x50]
-       mov      qword ptr [rbp-0x68], rcx
-       mov      rcx, qword ptr [rbp-0x60]
-       mov      qword ptr [rbp-0x70], rcx
-       movzx    rcx, byte  ptr [rbp-0x68]
-       movzx    rdx, byte  ptr [rbp-0x70]
-       imul     ecx, edx
-       movzx    rcx, cl
-       movzx    rdx, byte  ptr [rbp-0x67]
-       movzx    rdi, byte  ptr [rbp-0x6F]
-       imul     edx, edi
-       movzx    rdx, dl
-       add      ecx, edx
-       movzx    rdx, cl
-       movzx    rcx, dl
-       movzx    rdx, byte  ptr [rbp-0x66]
-       movzx    rdi, byte  ptr [rbp-0x6E]
-       imul     edx, edi
-       movzx    rdx, dl
-       movzx    rdi, byte  ptr [rbp-0x65]
-       movzx    rsi, byte  ptr [rbp-0x6D]
-       imul     edi, esi
-       movzx    rdi, dil
-       add      edx, edi
-       add      ecx, edx
-       movzx    rcx, cl
-       movzx    rdi, byte  ptr [rbp-0x64]
-       movzx    rdx, byte  ptr [rbp-0x6C]
-       imul     edx, edi
-       movzx    rdx, dl
-       movzx    rdi, byte  ptr [rbp-0x63]
-       movzx    rsi, byte  ptr [rbp-0x6B]
-       imul     edi, esi
-       movzx    rdi, dil
-       add      edx, edi
-       add      edx, ecx
-       movzx    rcx, dl
-       movzx    rdi, byte  ptr [rbp-0x62]
-       movzx    rdx, byte  ptr [rbp-0x6A]
-       imul     edx, edi
-						;; size=267 bbWeight=1 PerfScore 64.75
+       lea      rdi, [rbp-0x50]
+       movsxd   rax, esi
+       movzx    rdi, byte  ptr [rdi+rax]
+       lea      rcx, [rbp-0x58]
+       movzx    rcx, byte  ptr [rcx+rax]
+       imul     edi, ecx
+       lea      rcx, [rbp-0x48]
+       mov      byte  ptr [rcx+rax], dil
+       inc      esi
+       cmp      esi, 8
+       jl       SHORT G_M42821_IG03
+						;; size=38 bbWeight=4 PerfScore 41.00
 G_M42821_IG04:
-       movzx    rdx, dl
-       movzx    rdi, byte  ptr [rbp-0x61]
-       movzx    rsi, byte  ptr [rbp-0x69]
-       imul     edi, esi
-       movzx    rdi, dil
-       add      edx, edi
-       add      edx, ecx
-       movzx    rcx, dl
-       mov      rdx, qword ptr [rbp-0x48]
-       mov      qword ptr [rbp-0x78], rdx
-       mov      rdx, qword ptr [rbp-0x58]
-       mov      qword ptr [rbp-0x80], rdx
-       movzx    rdx, byte  ptr [rbp-0x78]
-       movzx    rdi, byte  ptr [rbp-0x80]
-       imul     edx, edi
-       movzx    rdx, dl
-       movzx    rdi, byte  ptr [rbp-0x77]
+       mov      rsi, qword ptr [rbp-0x48]
+       mov      qword ptr [rbp-0x60], rsi
+       movzx    rsi, byte  ptr [rbp-0x60]
+       xor      edi, edi
+       mov      rax, 0xD1FFAB1E      ; code for System.Runtime.Intrinsics.Scalar`1[ubyte]:Add(ubyte,ubyte):ubyte
+       call     [rax]System.Runtime.Intrinsics.Scalar`1[ubyte]:Add(ubyte,ubyte):ubyte
+       mov      ebx, eax
+       movzx    rsi, byte  ptr [rbp-0x5F]
+       mov      edi, ebx
+       mov      rax, 0xD1FFAB1E      ; code for System.Runtime.Intrinsics.Scalar`1[ubyte]:Add(ubyte,ubyte):ubyte
+       call     [rax]System.Runtime.Intrinsics.Scalar`1[ubyte]:Add(ubyte,ubyte):ubyte
+       mov      ebx, eax
+       movzx    rsi, byte  ptr [rbp-0x5E]
+       mov      edi, ebx
+       mov      rax, 0xD1FFAB1E      ; code for System.Runtime.Intrinsics.Scalar`1[ubyte]:Add(ubyte,ubyte):ubyte
+       call     [rax]System.Runtime.Intrinsics.Scalar`1[ubyte]:Add(ubyte,ubyte):ubyte
+       mov      ebx, eax
+       movzx    rsi, byte  ptr [rbp-0x5D]
+       mov      edi, ebx
+       mov      rax, 0xD1FFAB1E      ; code for System.Runtime.Intrinsics.Scalar`1[ubyte]:Add(ubyte,ubyte):ubyte
+       call     [rax]System.Runtime.Intrinsics.Scalar`1[ubyte]:Add(ubyte,ubyte):ubyte
+       mov      ebx, eax
+       movzx    rsi, byte  ptr [rbp-0x5C]
+       mov      edi, ebx
+       mov      rax, 0xD1FFAB1E      ; code for System.Runtime.Intrinsics.Scalar`1[ubyte]:Add(ubyte,ubyte):ubyte
+       call     [rax]System.Runtime.Intrinsics.Scalar`1[ubyte]:Add(ubyte,ubyte):ubyte
+       mov      ebx, eax
+       movzx    rsi, byte  ptr [rbp-0x5B]
+       mov      edi, ebx
+       mov      rax, 0xD1FFAB1E      ; code for System.Runtime.Intrinsics.Scalar`1[ubyte]:Add(ubyte,ubyte):ubyte
+       call     [rax]System.Runtime.Intrinsics.Scalar`1[ubyte]:Add(ubyte,ubyte):ubyte
+       mov      ebx, eax
+       movzx    rsi, byte  ptr [rbp-0x5A]
+       mov      edi, ebx
+       mov      rax, 0xD1FFAB1E      ; code for System.Runtime.Intrinsics.Scalar`1[ubyte]:Add(ubyte,ubyte):ubyte
+       call     [rax]System.Runtime.Intrinsics.Scalar`1[ubyte]:Add(ubyte,ubyte):ubyte
+       mov      ebx, eax
+       movzx    rsi, byte  ptr [rbp-0x59]
+       mov      edi, ebx
+       mov      rax, 0xD1FFAB1E      ; code for System.Runtime.Intrinsics.Scalar`1[ubyte]:Add(ubyte,ubyte):ubyte
+       call     [rax]System.Runtime.Intrinsics.Scalar`1[ubyte]:Add(ubyte,ubyte):ubyte
+       mov      ebx, eax
+       mov      rdi, qword ptr [rbp-0x28]
+       mov      rsi, qword ptr [rbp-0x38]
+       mov      qword ptr [rbp-0x70], rdi
+       mov      qword ptr [rbp-0x78], rsi
+       xor      r15d, r15d
+						;; size=195 bbWeight=1 PerfScore 44.25
+G_M42821_IG05:
+       lea      rdi, [rbp-0x70]
+       movsxd   r14, r15d
+       movzx    rdi, byte  ptr [rdi+r14]
+       lea      rsi, [rbp-0x78]
+       movzx    rsi, byte  ptr [rsi+r14]
+       mov      rax, 0xD1FFAB1E      ; code for System.Runtime.Intrinsics.Scalar`1[ubyte]:Multiply(ubyte,ubyte):ubyte
+       call     [rax]System.Runtime.Intrinsics.Scalar`1[ubyte]:Multiply(ubyte,ubyte):ubyte
+       lea      rsi, [rbp-0x68]
+       mov      byte  ptr [rsi+r14], al
+       inc      r15d
+       cmp      r15d, 8
+       jl       SHORT G_M42821_IG05
+						;; size=50 bbWeight=4 PerfScore 46.00
+G_M42821_IG06:
+       mov      rsi, qword ptr [rbp-0x68]
+       mov      qword ptr [rbp-0x80], rsi
+       movzx    rsi, byte  ptr [rbp-0x80]
+       xor      edi, edi
+       mov      rax, 0xD1FFAB1E      ; code for System.Runtime.Intrinsics.Scalar`1[ubyte]:Add(ubyte,ubyte):ubyte
+       call     [rax]System.Runtime.Intrinsics.Scalar`1[ubyte]:Add(ubyte,ubyte):ubyte
        movzx    rsi, byte  ptr [rbp-0x7F]
-       imul     edi, esi
-       movzx    rdi, dil
-       add      edx, edi
-       movzx    rdi, dl
-       movzx    rdx, dil
-       movzx    rdi, byte  ptr [rbp-0x76]
+       mov      edi, eax
+       mov      rax, 0xD1FFAB1E      ; code for System.Runtime.Intrinsics.Scalar`1[ubyte]:Add(ubyte,ubyte):ubyte
+       call     [rax]System.Runtime.Intrinsics.Scalar`1[ubyte]:Add(ubyte,ubyte):ubyte
        movzx    rsi, byte  ptr [rbp-0x7E]
-       imul     edi, esi
-       movzx    rdi, dil
-       movzx    rsi, byte  ptr [rbp-0x75]
-       movzx    r8, byte  ptr [rbp-0x7D]
-       imul     esi, r8d
-       movzx    rsi, sil
-       add      edi, esi
-       add      edx, edi
-       movzx    rdx, dl
-       movzx    rsi, byte  ptr [rbp-0x74]
-       movzx    rdi, byte  ptr [rbp-0x7C]
-       imul     edi, esi
-       movzx    rdi, dil
-       movzx    rsi, byte  ptr [rbp-0x73]
-       movzx    r8, byte  ptr [rbp-0x7B]
-       imul     esi, r8d
-       movzx    rsi, sil
-       add      edi, esi
-       add      edi, edx
-       movzx    rdx, dil
-       movzx    rsi, byte  ptr [rbp-0x72]
-       movzx    rdi, byte  ptr [rbp-0x7A]
-       imul     edi, esi
-       movzx    rdi, dil
-       movzx    rsi, byte  ptr [rbp-0x71]
-       movzx    r8, byte  ptr [rbp-0x79]
-       imul     esi, r8d
-       movzx    rsi, sil
-       add      edi, esi
-       add      edi, edx
-       movzx    rdx, dil
-       add      ecx, edx
-       add      eax, ecx
-       movzx    rax, al
-						;; size=220 bbWeight=1 PerfScore 47.00
-G_M42821_IG05:
+       mov      edi, eax
+       mov      rax, 0xD1FFAB1E      ; code for System.Runtime.Intrinsics.Scalar`1[ubyte]:Add(ubyte,ubyte):ubyte
+       call     [rax]System.Runtime.Intrinsics.Scalar`1[ubyte]:Add(ubyte,ubyte):ubyte
+       movzx    rsi, byte  ptr [rbp-0x7D]
+       mov      edi, eax
+       mov      rax, 0xD1FFAB1E      ; code for System.Runtime.Intrinsics.Scalar`1[ubyte]:Add(ubyte,ubyte):ubyte
+       call     [rax]System.Runtime.Intrinsics.Scalar`1[ubyte]:Add(ubyte,ubyte):ubyte
+       movzx    rsi, byte  ptr [rbp-0x7C]
+       mov      edi, eax
+       mov      rax, 0xD1FFAB1E      ; code for System.Runtime.Intrinsics.Scalar`1[ubyte]:Add(ubyte,ubyte):ubyte
+       call     [rax]System.Runtime.Intrinsics.Scalar`1[ubyte]:Add(ubyte,ubyte):ubyte
+       movzx    rsi, byte  ptr [rbp-0x7B]
+       mov      edi, eax
+       mov      rax, 0xD1FFAB1E      ; code for System.Runtime.Intrinsics.Scalar`1[ubyte]:Add(ubyte,ubyte):ubyte
+       call     [rax]System.Runtime.Intrinsics.Scalar`1[ubyte]:Add(ubyte,ubyte):ubyte
+       movzx    rsi, byte  ptr [rbp-0x7A]
+       mov      edi, eax
+       mov      rax, 0xD1FFAB1E      ; code for System.Runtime.Intrinsics.Scalar`1[ubyte]:Add(ubyte,ubyte):ubyte
+       call     [rax]System.Runtime.Intrinsics.Scalar`1[ubyte]:Add(ubyte,ubyte):ubyte
+       movzx    rsi, byte  ptr [rbp-0x79]
+       mov      edi, eax
+       mov      rax, 0xD1FFAB1E      ; code for System.Runtime.Intrinsics.Scalar`1[ubyte]:Add(ubyte,ubyte):ubyte
+       call     [rax]System.Runtime.Intrinsics.Scalar`1[ubyte]:Add(ubyte,ubyte):ubyte
+       movzx    rdi, bl
+       mov      esi, eax
+       mov      rax, 0xD1FFAB1E      ; code for System.Runtime.Intrinsics.Scalar`1[ubyte]:Add(ubyte,ubyte):ubyte
+       call     [rax]System.Runtime.Intrinsics.Scalar`1[ubyte]:Add(ubyte,ubyte):ubyte
+       mov      ebx, eax
+       vmovups  ymm0, ymmword ptr [rbp+0x10]
+       vextractf128 xmm0, ymm0, 1
+       vmovaps  xmmword ptr [rbp-0x90], xmm0
+       vmovups  ymm1, ymmword ptr [rbp+0x30]
+       vextractf128 xmm0, ymm1, 1
+       vmovaps  xmmword ptr [rbp-0xA0], xmm0
+       mov      rdi, qword ptr [rbp-0x90]
+       mov      rsi, qword ptr [rbp-0xA0]
+       mov      qword ptr [rbp-0xB0], rdi
+       mov      qword ptr [rbp-0xB8], rsi
+       xor      r15d, r15d
+						;; size=249 bbWeight=1 PerfScore 60.25
+G_M42821_IG07:
+       lea      rdi, [rbp-0xB0]
+       movsxd   r14, r15d
+       movzx    rdi, byte  ptr [rdi+r14]
+       lea      rsi, [rbp-0xB8]
+       movzx    rsi, byte  ptr [rsi+r14]
+       mov      rax, 0xD1FFAB1E      ; code for System.Runtime.Intrinsics.Scalar`1[ubyte]:Multiply(ubyte,ubyte):ubyte
+       call     [rax]System.Runtime.Intrinsics.Scalar`1[ubyte]:Multiply(ubyte,ubyte):ubyte
+       lea      rsi, [rbp-0xA8]
+       mov      byte  ptr [rsi+r14], al
+       inc      r15d
+       cmp      r15d, 8
+       jl       SHORT G_M42821_IG07
+						;; size=59 bbWeight=4 PerfScore 46.00
+G_M42821_IG08:
+       mov      rsi, qword ptr [rbp-0xA8]
+       mov      qword ptr [rbp-0xC0], rsi
+       movzx    rsi, byte  ptr [rbp-0xC0]
+       xor      edi, edi
+       mov      rax, 0xD1FFAB1E      ; code for System.Runtime.Intrinsics.Scalar`1[ubyte]:Add(ubyte,ubyte):ubyte
+       call     [rax]System.Runtime.Intrinsics.Scalar`1[ubyte]:Add(ubyte,ubyte):ubyte
+       mov      r15d, eax
+       movzx    rsi, byte  ptr [rbp-0xBF]
+       mov      edi, r15d
+       mov      rax, 0xD1FFAB1E      ; code for System.Runtime.Intrinsics.Scalar`1[ubyte]:Add(ubyte,ubyte):ubyte
+       call     [rax]System.Runtime.Intrinsics.Scalar`1[ubyte]:Add(ubyte,ubyte):ubyte
+       mov      r15d, eax
+       movzx    rsi, byte  ptr [rbp-0xBE]
+       mov      edi, r15d
+       mov      rax, 0xD1FFAB1E      ; code for System.Runtime.Intrinsics.Scalar`1[ubyte]:Add(ubyte,ubyte):ubyte
+       call     [rax]System.Runtime.Intrinsics.Scalar`1[ubyte]:Add(ubyte,ubyte):ubyte
+       mov      r15d, eax
+       movzx    rsi, byte  ptr [rbp-0xBD]
+       mov      edi, r15d
+       mov      rax, 0xD1FFAB1E      ; code for System.Runtime.Intrinsics.Scalar`1[ubyte]:Add(ubyte,ubyte):ubyte
+       call     [rax]System.Runtime.Intrinsics.Scalar`1[ubyte]:Add(ubyte,ubyte):ubyte
+       mov      r15d, eax
+       movzx    rsi, byte  ptr [rbp-0xBC]
+       mov      edi, r15d
+       mov      rax, 0xD1FFAB1E      ; code for System.Runtime.Intrinsics.Scalar`1[ubyte]:Add(ubyte,ubyte):ubyte
+       call     [rax]System.Runtime.Intrinsics.Scalar`1[ubyte]:Add(ubyte,ubyte):ubyte
+       mov      r15d, eax
+       movzx    rsi, byte  ptr [rbp-0xBB]
+       mov      edi, r15d
+       mov      rax, 0xD1FFAB1E      ; code for System.Runtime.Intrinsics.Scalar`1[ubyte]:Add(ubyte,ubyte):ubyte
+       call     [rax]System.Runtime.Intrinsics.Scalar`1[ubyte]:Add(ubyte,ubyte):ubyte
+       mov      r15d, eax
+       movzx    rsi, byte  ptr [rbp-0xBA]
+       mov      edi, r15d
+       mov      rax, 0xD1FFAB1E      ; code for System.Runtime.Intrinsics.Scalar`1[ubyte]:Add(ubyte,ubyte):ubyte
+       call     [rax]System.Runtime.Intrinsics.Scalar`1[ubyte]:Add(ubyte,ubyte):ubyte
+       mov      r15d, eax
+       movzx    rsi, byte  ptr [rbp-0xB9]
+       mov      edi, r15d
+       mov      rax, 0xD1FFAB1E      ; code for System.Runtime.Intrinsics.Scalar`1[ubyte]:Add(ubyte,ubyte):ubyte
+       call     [rax]System.Runtime.Intrinsics.Scalar`1[ubyte]:Add(ubyte,ubyte):ubyte
+       mov      r15d, eax
+       mov      rdi, qword ptr [rbp-0x88]
+       mov      rsi, qword ptr [rbp-0x98]
+       mov      qword ptr [rbp-0xD0], rdi
+       mov      qword ptr [rbp-0xD8], rsi
+       xor      r14d, r14d
+						;; size=252 bbWeight=1 PerfScore 44.25
+G_M42821_IG09:
+       lea      rdi, [rbp-0xD0]
+       movsxd   r13, r14d
+       movzx    rdi, byte  ptr [rdi+r13]
+       lea      rsi, [rbp-0xD8]
+       movzx    rsi, byte  ptr [rsi+r13]
+       mov      rax, 0xD1FFAB1E      ; code for System.Runtime.Intrinsics.Scalar`1[ubyte]:Multiply(ubyte,ubyte):ubyte
+       call     [rax]System.Runtime.Intrinsics.Scalar`1[ubyte]:Multiply(ubyte,ubyte):ubyte
+       lea      rsi, [rbp-0xC8]
+       mov      byte  ptr [rsi+r13], al
+       inc      r14d
+       cmp      r14d, 8
+       jl       SHORT G_M42821_IG09
+						;; size=59 bbWeight=4 PerfScore 46.00
+G_M42821_IG10:
+       mov      rsi, qword ptr [rbp-0xC8]
+       mov      qword ptr [rbp-0xE0], rsi
+       movzx    rsi, byte  ptr [rbp-0xE0]
+       xor      edi, edi
+       mov      rax, 0xD1FFAB1E      ; code for System.Runtime.Intrinsics.Scalar`1[ubyte]:Add(ubyte,ubyte):ubyte
+       call     [rax]System.Runtime.Intrinsics.Scalar`1[ubyte]:Add(ubyte,ubyte):ubyte
+       movzx    rsi, byte  ptr [rbp-0xDF]
+       mov      edi, eax
+       mov      rax, 0xD1FFAB1E      ; code for System.Runtime.Intrinsics.Scalar`1[ubyte]:Add(ubyte,ubyte):ubyte
+       call     [rax]System.Runtime.Intrinsics.Scalar`1[ubyte]:Add(ubyte,ubyte):ubyte
+       movzx    rsi, byte  ptr [rbp-0xDE]
+       mov      edi, eax
+       mov      rax, 0xD1FFAB1E      ; code for System.Runtime.Intrinsics.Scalar`1[ubyte]:Add(ubyte,ubyte):ubyte
+       call     [rax]System.Runtime.Intrinsics.Scalar`1[ubyte]:Add(ubyte,ubyte):ubyte
+       movzx    rsi, byte  ptr [rbp-0xDD]
+       mov      edi, eax
+       mov      rax, 0xD1FFAB1E      ; code for System.Runtime.Intrinsics.Scalar`1[ubyte]:Add(ubyte,ubyte):ubyte
+       call     [rax]System.Runtime.Intrinsics.Scalar`1[ubyte]:Add(ubyte,ubyte):ubyte
+       movzx    rsi, byte  ptr [rbp-0xDC]
+       mov      edi, eax
+       mov      rax, 0xD1FFAB1E      ; code for System.Runtime.Intrinsics.Scalar`1[ubyte]:Add(ubyte,ubyte):ubyte
+       call     [rax]System.Runtime.Intrinsics.Scalar`1[ubyte]:Add(ubyte,ubyte):ubyte
+       movzx    rsi, byte  ptr [rbp-0xDB]
+       mov      edi, eax
+       mov      rax, 0xD1FFAB1E      ; code for System.Runtime.Intrinsics.Scalar`1[ubyte]:Add(ubyte,ubyte):ubyte
+       call     [rax]System.Runtime.Intrinsics.Scalar`1[ubyte]:Add(ubyte,ubyte):ubyte
+       movzx    rsi, byte  ptr [rbp-0xDA]
+       mov      edi, eax
+       mov      rax, 0xD1FFAB1E      ; code for System.Runtime.Intrinsics.Scalar`1[ubyte]:Add(ubyte,ubyte):ubyte
+       call     [rax]System.Runtime.Intrinsics.Scalar`1[ubyte]:Add(ubyte,ubyte):ubyte
+       movzx    rsi, byte  ptr [rbp-0xD9]
+       mov      edi, eax
+       mov      rax, 0xD1FFAB1E      ; code for System.Runtime.Intrinsics.Scalar`1[ubyte]:Add(ubyte,ubyte):ubyte
+       call     [rax]System.Runtime.Intrinsics.Scalar`1[ubyte]:Add(ubyte,ubyte):ubyte
+       movzx    rdi, r15b
+       mov      esi, eax
+       mov      rax, 0xD1FFAB1E      ; code for System.Runtime.Intrinsics.Scalar`1[ubyte]:Add(ubyte,ubyte):ubyte
+       call     [rax]System.Runtime.Intrinsics.Scalar`1[ubyte]:Add(ubyte,ubyte):ubyte
+       mov      esi, eax
+       movzx    rdi, bl
+       mov      rax, 0xD1FFAB1E      ; code for System.Runtime.Intrinsics.Scalar`1[ubyte]:Add(ubyte,ubyte):ubyte
+       call     [rax]System.Runtime.Intrinsics.Scalar`1[ubyte]:Add(ubyte,ubyte):ubyte
+       nop      
+						;; size=227 bbWeight=1 PerfScore 45.75
+G_M42821_IG11:
        vzeroupper 
-       add      rsp, 128
+       add      rsp, 192
+       pop      rbx
+       pop      r13
+       pop      r14
+       pop      r15
        pop      rbp
        ret      
-						;; size=12 bbWeight=1 PerfScore 2.75
+						;; size=19 bbWeight=1 PerfScore 4.75
 
-; Total bytes of code 776, prolog size 16, PerfScore 186.25, instruction count 214, allocated bytes for code 776 (MethodHash=d63158ba) for method System.Runtime.Intrinsics.Vector256`1[ubyte]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector256<T>,T>.Dot(System.Runtime.Intrinsics.Vector256`1[ubyte],System.Runtime.Intrinsics.Vector256`1[ubyte]):ubyte (FullOpts)
+; Total bytes of code 1227, prolog size 33, PerfScore 400.75, instruction count 263, allocated bytes for code 1227 (MethodHash=d63158ba) for method System.Runtime.Intrinsics.Vector256`1[ubyte]:System.Runtime.Intrinsics.ISimdVector<System.Runtime.Intrinsics.Vector256<T>,T>.Dot(System.Runtime.Intrinsics.Vector256`1[ubyte],System.Runtime.Intrinsics.Vector256`1[ubyte]):ubyte (FullOpts)

Note: some changes were skipped as they were too large to fit into a comment.

Larger list of diffs: https://gist.github.com/MihuBot/f6ef2f25cbdf1a6fa108ff971f43d624

@MihuBot
Copy link
Owner Author

MihuBot commented Jun 5, 2024

@MihaZupan

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant