Allow NotImplemented tangents for things that have a correct tangent of NoTangent #218

devmotion · 2021-09-21T12:37:19Z

This is one possible approach to fix #217.

An alternative would be to tell users to define rand_tangent or, probably better, pass a suitable tangent of type NotImplemented. It is a bit inconvenient though since currently we check for equality, including the message AND the LineNumberNodes. I.e., one can't just use test_rrule(f, x \vdash @not_implemented("does not work")) since the LineNumberNode would be different from the one used inside the rrule. Maybe this should be changed and we should only check if the messages are equal?

…eturns `NoTangent`

codecov-commenter · 2021-09-21T12:40:41Z

Codecov Report

Merging #218 (f57fc7e) into master (a46fbbc) will decrease coverage by 0.45%.
The diff coverage is 85.71%.

@@            Coverage Diff             @@
##           master     #218      +/-   ##
==========================================
- Coverage   91.21%   90.75%   -0.46%     
==========================================
  Files          11       11              
  Lines         296      303       +7     
==========================================
+ Hits          270      275       +5     
- Misses         26       28       +2

Impacted Files	Coverage Δ
src/testers.jl	`91.42% <85.71%> (-1.58%)`	⬇️
src/finite_difference_calls.jl	`97.22% <0.00%> (+0.07%)`	⬆️
src/check_result.jl	`89.70% <0.00%> (+0.15%)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update a46fbbc...f57fc7e. Read the comment docs.

src/testers.jl

oxinabox

I am convinced that this is the best way.

Do you want to extract some of this code into a helper method?
Its getting long.
Maybe a function for the content of the branch: accum_cotangent isa NoTangent ?
Or maybe a method that dispatches on accum_cotangent, and ad_cotangent ?

Regardless, change as you will then merge and tag when happy.

Co-authored-by: Lyndon White <oxinabox@ucc.asn.au>

src/testers.jl

oxinabox · 2021-09-21T14:01:08Z

src/testers.jl

+    # the `@test_broken` below should tell them that there is an easy
+    # implementation for this case of `NoTangent()`
+    # https://github.com/JuliaDiff/ChainRulesTestUtils.jl/issues/217
+    @test_broken false


@test_broken false is much less useful than

Suggested change

@test_broken false

@test_broken not_implemented === NoTangent()

because it doesn't give a message saying what the correct answer is.

Ah true, I thought I could simplify the code but I see that it was not a good idea 😄

Macro's they will do that to you

src/testers.jl

st-- · 2021-09-21T14:24:15Z

src/testers.jl

+    # the `@test_broken` below should tell them that there is an easy implementation for
+    # this case of `NoTangent()` (`@test_broken false` would be less useful!)
+    # https://github.com/JuliaDiff/ChainRulesTestUtils.jl/issues/217
+    @test_broken ad_cotangent isa NoTangent


the @test_broken below should tell them that there is an easy implementation for this case of NoTangent()

I don't understand what this is supposed to say, could you explain so we can make it a more helpful comment for future readers of the codebase?

From the comment by @oxinabox on the outdated diff:

The correct implementation of anything that test_rrule determined was a NoTangent is almost certainly NoTangent()
Which is easy.
It might not generalize to cases that are not tested, but for the types passed into this test it is (almost certainly) correct.

But I'm still not clear what I'd be expected to do.

My understanding (based on the CRC docs) is: If I mark something as NotImplemented, that generally means I could implement it, but it's hard (and I can't be bothered, maybe it wouldn't be used anyways in practice). So what is the reference to an "easy implementation" intended to say?

If you end up here you should have used a NoTangent() cotangent in your rrule. So the easy fix is to just replace whatever you did in your rrule with NoTangent(). @test_broken ad_cotangent isa NoTangent will display the broken expression and hence it is easy to see that ad_tangent should have been NoTangent().

If the argument is actually differentiable and NoTangent is just caused by an incorrect rand_tangent default, then you should provide a correct tangent.

Consider the MWE from #217 (comment)

function ChainRulesCore.rrule(::typeof(foo), K, fun) f = foo(K, fun) function pullback(Δf) ∂self = NoTangent() ∂K = @thunk(2Δf) ∂fun = @not_implemented("does not work") # does not work return (∂self, ∂K, ∂fun) end return f, pullback end

The correct implementation; for the types that were tested in this test_rrule, (for you to hit this test_broken are:

function ChainRulesCore.rrule(::typeof(foo), K, fun) f = foo(K, fun) function pullback(Δf) ∂self = NoTangent() ∂K = @thunk(2Δf) ∂fun = NoTangent() return (∂self, ∂K, ∂fun) end return f, pullback end

Because you can only end-up on this line if fun has no fields, and this it's tangent must be NoTangent().
The implementation is not a mystery here, NoTangent() is the correct thing to write.

Now if the rule author wanted to be generic to whether or not fun had fields.
Then the rule author should probably test that case by using a closure or some functor.

And they might end-up with something like:

∂fun = Base.issingletontype(typeof(fun)) ? NoTangent() : @not_implemented("Functors not supported")

per #217 (comment)

Where would we have ended up if fun had fields? How would I tell the dispatcher that an rrule would only apply to functions with no field?

If my example

function ChainRulesCore.rrule(::typeof(foo), K, fun) f = foo(K, fun) function pullback(Δf) ∂self = NoTangent() ∂K = @thunk(2Δf) ∂fun = @not_implemented("would have to do more maths") return (∂self, ∂K, ∂fun) end return f, pullback end

should be generic for fun (the derivative for fields of fun would exist, I just haven't worked it out yet), is @not_implemented not the right thing to do ?

In general, it is not the right thing to do, the right thing is to implement the derivatives 😛 @not_implemented is a workaround until you've done this, and therefore all tests with NotImplemented use @test_broken to indicate that something is broken and should be fixed.

Co-authored-by: st-- <st--@users.noreply.github.com>

devmotion added 2 commits September 21, 2021 14:30

Fix test_rrule if cotangent is not implemented but rand_tangent r…

c76c4ff

…eturns `NoTangent`

Bump version

1fbeedd

oxinabox reviewed Sep 21, 2021

View reviewed changes

src/testers.jl Outdated Show resolved Hide resolved

oxinabox approved these changes Sep 21, 2021

View reviewed changes

oxinabox changed the title ~~Fix https://github.com/JuliaDiff/ChainRulesTestUtils.jl/issues/217~~ Allow NotImplemented tangents for things that have a correct tangent of NoTangent Sep 21, 2021

Update src/testers.jl

5bddc46

Co-authored-by: Lyndon White <oxinabox@ucc.asn.au>

st-- reviewed Sep 21, 2021

View reviewed changes

src/testers.jl Outdated Show resolved Hide resolved

Refactor checks of cotangents

e82810c

oxinabox reviewed Sep 21, 2021

View reviewed changes

devmotion added 3 commits September 21, 2021 16:11

Improve error message

4232dec

Extend comment

6d4c04b

Fix spelling error

1c2cd12

st-- reviewed Sep 21, 2021

View reviewed changes

Update src/testers.jl

f57fc7e

Co-authored-by: st-- <st--@users.noreply.github.com>

devmotion merged commit e3ff6b4 into master Sep 21, 2021

devmotion deleted the dw/notimplemented_notangent branch September 21, 2021 18:11

st-- mentioned this pull request Sep 23, 2021

Clarify "writing good rules" documentation JuliaDiff/ChainRulesCore.jl#468

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow NotImplemented tangents for things that have a correct tangent of NoTangent #218

Allow NotImplemented tangents for things that have a correct tangent of NoTangent #218

devmotion commented Sep 21, 2021

codecov-commenter commented Sep 21, 2021 •

edited

Loading

oxinabox left a comment

oxinabox Sep 21, 2021

devmotion Sep 21, 2021

oxinabox Sep 21, 2021

st-- Sep 21, 2021

st-- Sep 21, 2021

st-- Sep 21, 2021

devmotion Sep 21, 2021

devmotion Sep 21, 2021

oxinabox Sep 21, 2021

st-- Sep 21, 2021

st-- Sep 21, 2021 •

edited

Loading

devmotion Sep 21, 2021

	@test_broken false
	@test_broken not_implemented === NoTangent()

Allow NotImplemented tangents for things that have a correct tangent of NoTangent #218

Allow NotImplemented tangents for things that have a correct tangent of NoTangent #218

Conversation

devmotion commented Sep 21, 2021

codecov-commenter commented Sep 21, 2021 • edited Loading

Codecov Report

oxinabox left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

st-- Sep 21, 2021 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

codecov-commenter commented Sep 21, 2021 •

edited

Loading

st-- Sep 21, 2021 •

edited

Loading