Base: correctly rounded floats constructed from rationals #49749

nsajko · 2023-05-11T00:37:51Z

Constructing a floating-point number from a Rational should now be correctly rounded.

Implementation approach:

Convert the (numerator, denominator) pair to a (sign bit, integral significand, exponent) triplet using integer arithmetic. The integer type in question must be wide enough.
Convert the above triplet into an instance of the chosen FP type. There is special support for IEEE 754 floating-point and for BigFloat, otherwise a fallback using ldexp is used.

As a bonus, constructing a BigFloat from a Rational should now be thread-safe when the rounding mode and precision are provided to the constructor, because there is no access to the global precision or rounding mode settings.

Updates #45213

Updates #50940

Updates #52507

Fixes #52394

Closes #52395

Fixes #52859

nsajko · 2023-05-11T00:42:11Z

Regarding performance:

the ldexp calls are not essential in the cases of the IEEE floating-point formats, ldexp could be replaced with a bit of bit twiddling for those common cases. I can do that now or in a future PR.
BigInt should have it's own implementation for performance reasons, although I'm not sure whether it'd be better to put that in Base or in MutableArithmetics.

Seelengrab

We have lots of screen space - please, let's use it not just vertically. We are fine with 92 characters per line (sometimes more if it's just a few characters over) after all.

base/mpfr.jl

base/rational.jl

test/rational.jl

base/rational.jl

nsajko · 2023-05-12T13:07:27Z

The second commit is the mentioned optimization of using bit twiddling instead of ldexp for IEEE floats. If you're fine with having it in this PR, I can squash them, otherwise I'll move it to a subsequent PR.

Regarding the 92 character line length limit, does it apply to comments equally? I usually try to have comments be narrower than code, but if you want I'll rewrap them to 92 columns.

Seelengrab

Thank you for sticking with this! I haven't added the style points to every occurence, but please apply it uniformly.

We can worry about performance afterwards, since this PR got started with correctness. Talking about potential performance improvements without benchmarks is a bit of a moot point.

base/float.jl

base/mpfr.jl

base/rational.jl

test/rational.jl

base/rational.jl

base/mpfr.jl

base/rational.jl

Seelengrab

Looks good!

I think the gain in correctness is nice, now let's hope this doesn't cost too much performance 😬

nsajko · 2023-05-17T12:07:17Z

performance

Profiling reveals that a considerable amount of time is spent in the line

y = Rational(promote(numerator(x), denominator(x))...)

The Rational call spends almost all of its time in divgcd, however this is unnecessary and could be avoided if we called unsafe_rational instead. I think this will be the single biggest performance win, but there are other possible optimization opportunities that should be looked into, such as optimizing rational_to_float_components_impl to do just one division and then round manually (like it was before), and the optimization of replacing ldexp with bit-level operations for IEEE floats.

Seelengrab · 2023-05-17T15:11:29Z

Which cases spend their time in promotion in particular? IIRC this should be a noop if there's nothing to pomote/convert 👀

base/rational.jl

nsajko · 2023-05-17T15:32:42Z

Which cases spend their time in promotion in particular? IIRC this should be a noop if there's nothing to pomote/convert

The problem is not promote, it's the Rational call in the same line. I already replaced it with unsafe_rational locally, to avoid the costly but unnecessary divgcd.

nsajko · 2023-05-17T16:55:20Z

JET shows that this line causes run time dispatch, I think because the type of rm is unknown:

julia/base/mpfr.jl

Line 329 in dd48fdd

c = rational_to_float_components(num, den, prec, BigInt, rm)

││││┌ @ mpfr.jl:329 Base.MPFR.rational_to_float_components(%18, %27, prec, Base.MPFR.BigInt, %102)
│││││ runtime dispatch detected: Base.MPFR.rational_to_float_components(%18::Int64, %27::Int64, prec::Int64, Base.MPFR.BigInt, %102::RoundingMode)::Base.FloatComponentsResult{BigInt, Int64}

There doesn't seem to be a way around this without introducing a new type for roundings and using it in rational_to_float_components_impl instead of RoundingMode. This doesn't seem too difficult but it may be overkill?

Seelengrab · 2023-05-17T17:47:06Z

This is just for BigFloat, right? I don't think this dispatch will be particularly problematic then - any computation involving BigFloat of sizes/precisions where it's worth using BigFloat instead of something like Float128 will probably be bottlenecked by MPFR/GMP operations under the hood instead of this dispatch (not to mention the other allocating BigFloat/BigInt operations in here).

nsajko · 2023-05-18T20:04:11Z

Is there still something to do here from my side?

nsajko · 2023-05-18T20:34:59Z

Another small optimization would be using ndigits0zpb instead of ndigits. Not sure if I should do this now or wait until this PR is merged. EDIT: probably top_set_bit would be even better than ndigits.

Seelengrab

Other than these, yeah, looks good from my POV 👍

base/float.jl

test/rational.jl

nsajko · 2024-01-14T11:06:32Z

Added a test, apart from addressing the comments.

nsajko · 2024-01-14T16:26:02Z

Expanded the / doc string with relevant information.

base/rational_to_float.jl

oscardssmith · 2024-01-14T17:53:47Z

base/rational_to_float.jl

+    (; integral_significand, exponent)
+end
+
+function to_float_components(::Type{T}, num, den, precision, max_subnormal_exp, romo, sb) where {T}


does this method need to exist? if this is purely for internal use, surely the caller can be responsible for converting num to the type they want.

It's also used in mpfr.jl

oscardssmith · 2024-01-14T17:58:37Z

base/rational.jl

+    (numerator(y), denominator(y))
+end
+
+function rational_to_floating_point(::Type{F}, x, rm = RoundNearest, prec = precision(F)) where {F}


do we actually care about giving the user the ability to specify rounding modes here? Most other functions in Base don't, and giving up that seems like it would simplify the code a lot.

We need the rounding modes for BigFloat. Also, having rounding modes other than the default one enables more thorough testing that applies to all rounding modes.

Most of the logic for the rounding modes is now in rounding.jl anyway.

to_floating_point_impl(::Type{T}, ::Type{S}, num, den, rm, prec) where {T<:Base.IEEEFloat,S} could be simplified by requiring rm == RoundNearest, but then the tests would suffer.

giving the user the ability

to be clear, that particular method is not meant to be public (I think the PR doesn't add any new public names)

Can't we do the BigFloat ones by just doing arbitrary precision BigFloat division? I'm not really sure if it's worth adding a bunch of code to make BigFloat(::Rational) fast.

I don't think that would be an improvement, by any metric. The relevant code was already added in #50691 (into rounding.jl), so now it doesn't cost anything to support the known rounding modes for BigFloat, although the IEEEFloat method could be simplified a bit if necessary, at the cost of some testing, as mentioned above.

by any metric

Maybe the way you and @Joel-Dahne propose would be faster, though? But still, having all float types under a single code path seems better as any bugs should manifest (and be fixed) sooner, so it's more maintainable.

nsajko · 2024-01-14T21:28:37Z

Oh, some of the tests are failing on 32-bit Linux and 32-bit Windows. EDIT: it's because div(::Int128, ::Int128) allocates on 32-bit Julia, which I didn't expect

Joel-Dahne · 2024-01-15T07:37:01Z

Let me try to take a closer look at this today!

Regarding the comment

do we actually care about giving the user the ability to specify rounding modes here? Most other functions in Base don't, and giving up that seems like it would simplify the code a lot.

Indeed most other functions in Base don't allow you to control the rounding, and I don't believe they should! Getting rounding correct is difficult (hence this PR) and it would probably fit better in a separate package. However, Base current does support rounding when converting to Float64, e.g. Float64(1 // 3, RoundUp). Since it is already supported I think it is a valid goal to also make it correct! For example this is used in IntervalArithmetic.jl when converting a rational interval to a Float64 interval.

nsajko · 2024-01-15T07:57:44Z

Wow! I had no idea the rounding mode argument was already supported for converting rationals to floats! I always assumed the rounding mode argument was only accepted when converting to BigFloat. Then clearly:

We do need the rounding modes to be supported explicitly in this PR
The PR needs to actually override the rounding mode methods

Joel-Dahne · 2024-01-15T08:07:07Z

I would also propose to say that this change makes constructing a float from a rational "correctly rounded", instead of "exact". It seems more clear to me at least, since most rationals are not exactly representable as floats.

Joel-Dahne

In general I think the change looks good! As far as I can tell the most important methods are to_float_components, to_floating_point_fallback and the to_floating_point_impl implementation for machine floats. The rest is more or less gluing code and tests (both of which there are a lot of).

In general the coding style is quite different from my personal one. That is of course fine, but I have some general comments:

Often times temporary names are introduced only to be used once. For example the exp variable in to_floating_point_fallback.
There are a lot of let-blocks that I don't really see the point of. For example in to_floating_point_fallback again.
There are many type annotations that I don't really see the point of. To take to_floating_point_fallback as an example again, surely the compiler can verify zero(T)::T without the type annotation?

Joel-Dahne · 2024-01-15T10:47:24Z

base/rational_to_float.jl

+    is_zero = num_is_zero & !den_is_zero
+    is_inf = !num_is_zero & den_is_zero


Here I think it would be natural to make an early return on these checks. So replace them with

num_is_zero & !den_is_zero && return zero(T) !num_is_zero & den_is_zero && return sb * T(Inf) num_is_zero & den_is_zero && return T(NaN)

That removes the need for the if-statement further down and , which, for me, simplifies the flow.

Similar things could be done in the other implementations, though it makes less of a difference there.

I prefer to avoid return when possible. It's a jump/goto statement, breaking nice control-flow properties known as structured programming. I do use jumps in control flow, but only when necessary.

I used to be a Go nut (as in "Golang aficionado"), so I see where you're coming from, though.

It's only a matter of style of course. I'm not going to fight you over it 😄

base/rational.jl

Joel-Dahne · 2024-01-15T11:05:14Z

base/rational_to_float.jl

+# `BigInt` is a safe default.
+to_float_promote_type(::Type{F}, ::Type{S}) where {F,S} = BigInt
+
+const BitIntegerOrBool  = Union{Bool,Base.BitInteger}


This variable is only used once, I think it is more natural to just use the union directly. Though in this case I guess it does make the method declaration below slightly too long to fit on one line.

My reasoning here was basically:

I'm not exactly sure what's the preferred way to break lines (or format code in general) in Julia source in the Julia project, so I prefer to avoid it completely

Adding a constant is basically free (as long as it's not huge, doesn't mess with precompilation somehow, etc.)

I understand, it also explains some of the style of code in other places!

base/mpfr.jl

base/rational_to_float.jl

test/rational.jl

base/rational_to_float.jl

Joel-Dahne · 2024-01-15T12:41:05Z

I also made a quick benchmark comparing this implementation for BigFloat with the one I proposed above. With the version in this PR I get

julia> @benchmark BigFloat($(5 // 7))
BenchmarkTools.Trial: 10000 samples with 130 evaluations.
 Range (min … max):  743.838 ns …  3.561 ms  ┊ GC (min … max):  0.00% … 31.21%
 Time  (median):       1.072 μs              ┊ GC (median):     0.00%
 Time  (mean ± σ):     1.920 μs ± 46.631 μs  ┊ GC (mean ± σ):  18.36% ±  0.86%

   ▄▆▄▂ ▁    ▃▆▇▇█▅▃▁▁  ▁▁    ▁▁ ▁▁  ▂▁        ▂▂    ▁▁▁▁      ▂
  ███████▇▆▅███████████████▆▆██████▇████▇▇▇▇▆▅▇███▆▆▇█████▇▇▅▇ █
  744 ns        Histogram: log(frequency) by time      2.09 μs <

 Memory estimate: 1.09 KiB, allocs estimate: 40.

julia> @benchmark BigFloat($(big(2)^200 // big(3)^200))
BenchmarkTools.Trial: 10000 samples with 63 evaluations.
 Range (min … max):  792.905 ns …  4.627 ms  ┊ GC (min … max):  0.00% … 49.30%
 Time  (median):       1.005 μs              ┊ GC (median):     0.00%
 Time  (mean ± σ):     1.959 μs ± 57.511 μs  ┊ GC (mean ± σ):  20.56% ±  0.71%

  ▁▄▄▄▆▇▆▃▃▆█▇▆▄▂ ▂▁▁▄▄   ▁▁▂▂▂▂▂▃▂▂▂        ▁▁▂▁▂▂▂▂▂▂▂▂▁▁    ▂
  ██████████████████████▇▇████████████▇▅▅▄▅▅▇███████████████▇▇ █
  793 ns        Histogram: log(frequency) by time      1.97 μs <

 Memory estimate: 1.16 KiB, allocs estimate: 38.

With my version I get

julia> @benchmark Base.MPFR.__BigFloat($(5 // 7))
BenchmarkTools.Trial: 10000 samples with 968 evaluations.
 Range (min … max):   78.898 ns …  27.099 μs  ┊ GC (min … max):  0.00% … 99.52%
 Time  (median):      88.857 ns               ┊ GC (median):     0.00%
 Time  (mean ± σ):   102.076 ns ± 454.203 ns  ┊ GC (mean ± σ):  12.20% ±  2.97%

            ▁▂▂▂▁       ▃█▁▂▆▄                                   
  ▂▃▅▆▆▇▇▇████████▇▅▄▄▃▄██████▇▇█▅▄▄▃▃▃▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▁▂▂▂▂▂▂▂▂ ▄
  78.9 ns          Histogram: frequency by time          111 ns <

 Memory estimate: 184 bytes, allocs estimate: 4.

julia> @benchmark Base.MPFR.__BigFloat($(big(2)^200 // big(3)^200))
BenchmarkTools.Trial: 10000 samples with 204 evaluations.
 Range (min … max):  376.975 ns … 498.437 μs  ┊ GC (min … max): 0.00% … 44.14%
 Time  (median):     410.777 ns               ┊ GC (median):    0.00%
 Time  (mean ± σ):   551.419 ns ±   6.129 μs  ┊ GC (mean ± σ):  6.40% ±  0.58%

    ▁▆█▄                                                         
  ▃▅████▅▃▂▂▂▃▆█▆▄▃▅▃▃▂▂▂▃▂▂▂▂▂▃▃▃▃▃▂▂▂▂▂▂▂▁▂▂▂▁▂▂▂▂▂▂▁▂▂▁▂▁▁▂▂ ▃
  377 ns           Histogram: frequency by time          764 ns <

 Memory estimate: 368 bytes, allocs estimate: 10.

So 10 times faster for machine size integers and 2 times faster for large integers. Even more if one counts the GC time, which one probably should for BigFloat.

I don't think these performance gains are essential though, either version would be fine I think.

nsajko · 2024-01-15T19:06:45Z

Often times temporary names are introduced only to be used once. For example the exp variable in to_floating_point_fallback.

Got rid of exp, thanks.

There are a lot of let-blocks that I don't really see the point of. For example in to_floating_point_fallback again.

Yeah, some of the lets were definitely redundant. There's less of them now. Some of the remaining ones are because I wanted to give a local scope to if blocks that have their own variables, as the lack of a new scope can be surprising there.

There are many type annotations that I don't really see the point of. To take to_floating_point_fallback as an example again, surely the compiler can verify zero(T)::T without the type annotation?

There is no way to declare a return type for a function (as in, all the function's methods) in Julia. We don't even have a guarantee for the return type of constructors (#42372)! Thus I use a type assertion after calling some function whenever I can to catch bugs earlier, and to help contain possible type-instability (which can sometimes be introduced by the caller). Note that Julia is good with eliminating type assertions based on type inference, so they're basically free when they're not helpful.

So 10 times faster for machine size integers and 2 times faster for large integers.

Thanks, I'll profile this and potentially follow up.

Joel-Dahne · 2024-01-16T16:29:52Z

Sounds good! I don't have any further comments at this point!

nsajko · 2024-01-16T16:32:46Z

Part of the reason for this PR being slow for bignums is that the current div and divrem (not touched by this PR) are unnecessarily slow. Looking at the allocation profiler, divrem(::BigInt, ::Int, ::RoundingMode{:ToZero}) is really slow and allocates too much. It calls div, which does an unnecessary promotion on the denominator, then divrem does unnecessary copying arithmetic. And GMP actually provides mpz_tdiv_q_ui that directly calculates both the quotient and the remainder. There's a bunch more examples like this, but that's clearly a matter for a separate PR. To make matters worse, touching base/div.jl seems fraught with peril, as the dispatch situation between all the different function variants (div, rem, divrem, fld, cld, fldmod) is horrifying...

oscardssmith · 2024-01-16T19:27:30Z

I think the base/div.jl issues are significantly improved recently.

Constructing a floating-point number from a `Rational` should now be correctly rounded. Implementation approach: 1. Convert the (numerator, denominator) pair to a (sign bit, integral significand, exponent) triplet using integer arithmetic. The integer type in question must be wide enough. 2. Convert the above triplet into an instance of the chosen FP type. There is special support for IEEE 754 floating-point and for `BigFloat`, otherwise a fallback using `ldexp` is used. As a bonus, constructing a `BigFloat` from a `Rational` should now be thread-safe when the rounding mode and precision are provided to the constructor, because there is no access to the global precision or rounding mode settings. Updates JuliaLang#45213 Updates JuliaLang#50940 Updates JuliaLang#52507 Fixes JuliaLang#52394 Closes JuliaLang#52395 Fixes JuliaLang#52859

nsajko · 2024-05-12T11:38:57Z

Fixed conflict. Could this be merged as per #53641 (comment) (caused by correctness fix, and the performance regression mainly applies to bignums)? The performance for bignums can be fixed later, this PR is already quite large

nsajko · 2024-05-12T15:49:40Z

The test failures are real.

Seelengrab suggested changes May 11, 2023

View reviewed changes

nsajko force-pushed the exact_rational_to_float branch 2 times, most recently from 2a10919 to 50d5aa8 Compare May 12, 2023 13:00

Seelengrab suggested changes May 13, 2023

View reviewed changes

nsajko force-pushed the exact_rational_to_float branch from 50d5aa8 to 28d0c8e Compare May 17, 2023 02:33