Add custom_jvp to jax.numpy.ldexp. #23923

carlosgmartin · 2024-09-25T20:32:45Z

jakevdp · 2024-09-25T21:51:25Z

Thanks for looking into this! I'm not sure this is actually the correct gradient behavior for this function. As I mentioned in #11467 (comment), ldexp is a bit of a strange function, because it sort of computes x * 2 ** y, but doesn't really compute that, because it's operating on the details of the bitwise representation of x and y. Because of that, I think autodiff doesn't really apply to this function, because autodiff operates only in the space of real numbers being represented by the floating point implementation, so bitwise manipulations don't really have a gradient.

Maybe the best solution here would be to define custom_jvp that raises an error saying basically "this function isn't differentiable". What do you think?

carlosgmartin · 2024-09-25T22:52:26Z

@jakevdp Thanks for your feedback.

It seems to me that, if a JAX function extensionally computes a differentiable mathematical function $f$ then, regardless of how it is implemented internally, it ought to have the same gradient.

Indeed, my understanding is that one of the primary use cases of custom_jvp is precisely to recover gradients of functions whose internal implementations are not auto-differentiable.

Since ldexp computes $f(x) = x 2^n$, the least surprising behavior from a user POV would be that $f'(x) = 2^n$. Indeed, a user (or the compiler) might encounter such an expression in a program and substitute it with ldexp as an optimization. It would seem surprising if this substitution were to cause an error on a backward pass.

Thoughts?

jakevdp · 2024-09-25T23:14:46Z

So my question is, if we're just computing $x * 2 ^ y$, why don't we fix this by changing the implementation to

def ldexp(x, y):
  return x * 2 ** y

Then no custom JVP is necessary at all.

carlosgmartin · 2024-09-26T00:04:18Z

I think (the bit-twiddling implementation of) ldexp is supposed to be a faster way to do that, at least on some platforms.

The CUDA Math API has dedicated ldexp functions for single and double precision.

The C standard library also has dedicated ldexp functions.

Not sure what the performance advantage is for different platforms.

jakevdp · 2024-09-26T12:52:43Z

Sure, but JAX does not dispatch to any of those fast kernels, and I imagine the current bit-twiddling implementation is far slower than just writing x1 * 2 ** x2. Is there any good reason to keep the current implementation?

jakevdp · 2024-09-26T12:58:37Z

I guess stepping back, here are the options:

ldexp is fundamentally a bit-twiddling operation. In this case, its autodiff behavior is poorly defined, and if we add a custom_vjp, it should probably just raise an error.
ldexp represents the mathematical operation $x * 2^y$ with platform-specific implementation details when custom kernels are available. In this case, the optimal implementation in JAX would be to write x * 2 ** y, which would have the side effect of making custom_jvp unnecessary.

Until now, we've approached this as (1). Which do you think is the right approach?

carlosgmartin · 2024-09-27T00:29:36Z

You raise an interesting point. If there's indeed no performance advantage to the bit-twiddling implementation of ldexp(x, n) over x * 2 ** n, then perhaps the former isn't necessary and can be replaced with the latter.

At least for the time being, perhaps it's worth adding a note to the documentation for ldexp stating that there's no performance advantage to its current bit-twiddling implementation over x * 2 ** n.

I'd also welcome any additional opinions from people who are more familiar with the hardware side of things.

jax/_src/numpy/ufuncs.py

jakevdp

Looks good!

One other thing: we should update the function docs with info about the implementation (this would involve removing the @implements decorator and writing a full docstring).

If you'd like to do this as part of the PR then go ahead, but I'm happy to update docs in a followup.

carlosgmartin · 2024-09-30T21:22:18Z

I'll let you handle that so you can choose the best wording.

jax/_src/numpy/ufuncs.py

jakevdp · 2024-09-30T23:38:00Z

Thanks for putting this together!

jakevdp self-assigned this Sep 25, 2024

jakevdp reviewed Sep 27, 2024

View reviewed changes

jax/_src/numpy/ufuncs.py Outdated Show resolved Hide resolved

carlosgmartin force-pushed the ldexp_custom_jvp branch from 7ff34a5 to acb47ab Compare September 30, 2024 20:59

jakevdp requested changes Sep 30, 2024

View reviewed changes

jax/_src/numpy/ufuncs.py Outdated Show resolved Hide resolved

carlosgmartin force-pushed the ldexp_custom_jvp branch from acb47ab to 4c6dfb3 Compare September 30, 2024 21:13

jakevdp approved these changes Sep 30, 2024

View reviewed changes

jakevdp added the pull ready Ready for copybara import and testing label Sep 30, 2024

jakevdp reviewed Sep 30, 2024

View reviewed changes

jax/_src/numpy/ufuncs.py Outdated Show resolved Hide resolved

Edit implementation of jax.numpy.ldexp to get correct gradient.

65a58d6

carlosgmartin force-pushed the ldexp_custom_jvp branch from 4c6dfb3 to 65a58d6 Compare September 30, 2024 22:27

jakevdp mentioned this pull request Sep 30, 2024

Improve docs for jnp.ldexp and jnp.frexp #24034

Merged

jakevdp approved these changes Sep 30, 2024

View reviewed changes

copybara-service bot merged commit 31cb3fd into jax-ml:main Sep 30, 2024
11 of 12 checks passed

carlosgmartin deleted the ldexp_custom_jvp branch September 30, 2024 23:23

carlosgmartin mentioned this pull request Oct 1, 2024

ldexp has wrong gradient 0.0 #11467

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add custom_jvp to jax.numpy.ldexp. #23923

Add custom_jvp to jax.numpy.ldexp. #23923

carlosgmartin commented Sep 25, 2024

jakevdp commented Sep 25, 2024 •

edited

Loading

carlosgmartin commented Sep 25, 2024 •

edited

Loading

jakevdp commented Sep 25, 2024

carlosgmartin commented Sep 26, 2024 •

edited

Loading

jakevdp commented Sep 26, 2024 •

edited

Loading

jakevdp commented Sep 26, 2024 •

edited

Loading

carlosgmartin commented Sep 27, 2024

jakevdp left a comment

carlosgmartin commented Sep 30, 2024

jakevdp commented Sep 30, 2024

Add custom_jvp to jax.numpy.ldexp. #23923

Add custom_jvp to jax.numpy.ldexp. #23923

Conversation

carlosgmartin commented Sep 25, 2024

jakevdp commented Sep 25, 2024 • edited Loading

carlosgmartin commented Sep 25, 2024 • edited Loading

jakevdp commented Sep 25, 2024

carlosgmartin commented Sep 26, 2024 • edited Loading

jakevdp commented Sep 26, 2024 • edited Loading

jakevdp commented Sep 26, 2024 • edited Loading

carlosgmartin commented Sep 27, 2024

jakevdp left a comment

Choose a reason for hiding this comment

carlosgmartin commented Sep 30, 2024

jakevdp commented Sep 30, 2024

jakevdp commented Sep 25, 2024 •

edited

Loading

carlosgmartin commented Sep 25, 2024 •

edited

Loading

carlosgmartin commented Sep 26, 2024 •

edited

Loading

jakevdp commented Sep 26, 2024 •

edited

Loading

jakevdp commented Sep 26, 2024 •

edited

Loading