Computation of penalty terms #41

rgemulla · 2019-08-19T08:37:56Z

Commit e667cf9 changes the way penalties are interpreted for many models. The penalty term is currently computed only once per embedding, but with this change it's computed twice if subject and object embedder are the same (a common case). Instead of calling penalty twice, the code should check whether they are the same and, if so, call penalty only once.

The text was updated successfully, but these errors were encountered:

samuelbroscheit · 2019-08-19T09:02:09Z

What is the reasoning? They are used twice hence the penalty is computed for each factor matrix of each mode, which does matter in two cases: 1. when the penalty that is computed is weighted 2. when the penelty weight comes from a shared lookup_embedder config setting, such that we cannot learn/tune the scaling.

rgemulla · 2019-08-19T10:17:55Z

I can see arguments for both ways of doing it.

Keep as before: Do it like everywhere else (all related work). More intuitive/natural. Better performance.
Change is in this commit: No special treatment of the shared_embedders case. Anything else?

The weights are not a big deal: can be done both ways: in (1) by simply passing along a list of ids (as suggested in #39), in (2) by calling penalty twice (for s, for o). (BTW: currently, the penalty function in KgeModel does not correctly pass along ids.)

samuelbroscheit · 2019-08-19T12:43:04Z

currently, the penalty function in KgeModel does not correctly pass along ids.

How is that?

rgemulla · 2019-08-19T13:02:58Z

currently, the penalty function in KgeModel does not correctly pass along ids.

How is that?

All fine, I must have been looking at outdated code.

rgemulla · 2019-08-21T10:20:52Z

The current implementation of these penalty terms for lookup_embedders uses embed:

parameters = self.embed(kwargs['batch']['triples'][:, kwargs['slot']])

This may be flawed since this will run dropout (but shouldn't).

samuelbroscheit · 2019-08-21T10:24:28Z

Argh, of course. Fixed.

…

On Wed, Aug 21, 2019 at 12:20 PM rgemulla ***@***.***> wrote: The current implementation of these penalty terms for lookup_embedders uses embed: parameters = self.embed(kwargs['batch']['triples'][:, kwargs['slot']]) This may be flawed since this will run dropout (but shouldn't). — You are receiving this because you were assigned. Reply to this email directly, view it on GitHub <#41?email_source=notifications&email_token=AFMYSKY5XZESD5D7VFI4LITQFUJIJA5CNFSM4IM2LOA2YY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOD4ZGD6Q#issuecomment-523395578>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AFMYSK3D3ZREMTWYD5C2VBDQFUJIJANCNFSM4IM2LOAQ> .

rgemulla · 2019-08-21T10:39:06Z

Looks fine now. The only suggested change I still have is the API change from #39:

I suggest to change self.get_s_embedder().penalty(slot=0, **kwargs) to self.get_s_embedder().penalty(penality_ids=triples[slot], **kwargs) or so. This way, the embedder can be used later on when the weights do not come from triples.

rgemulla · 2019-10-21T20:02:27Z

The change above is still open.

I also think that it would be helpful (for understanding the model) if the penalty terms were named (e.g., entity_embedder.l2) and traced/printed both with and without the regularization weight being applied.

samuelbroscheit · 2020-05-16T13:25:32Z

Is this currently being worked on already?

rgemulla · 2020-05-18T09:34:52Z

Yes, by @Nzteb

rgemulla · 2020-05-25T09:15:34Z

Closed with #101

rgemulla assigned samuelbroscheit Aug 19, 2019

rgemulla mentioned this issue Aug 21, 2019

Add weighted regularization #20

Closed

samuelbroscheit mentioned this issue Aug 21, 2019

Verify sparse regularization #39

Closed

rgemulla added the enhancement New feature or request label Dec 18, 2019

rgemulla closed this as completed May 25, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Computation of penalty terms #41

Computation of penalty terms #41

rgemulla commented Aug 19, 2019

samuelbroscheit commented Aug 19, 2019

rgemulla commented Aug 19, 2019

samuelbroscheit commented Aug 19, 2019

rgemulla commented Aug 19, 2019

rgemulla commented Aug 21, 2019

samuelbroscheit commented Aug 21, 2019 via email

rgemulla commented Aug 21, 2019

rgemulla commented Oct 21, 2019 •

edited

Loading

samuelbroscheit commented May 16, 2020

rgemulla commented May 18, 2020

rgemulla commented May 25, 2020

Computation of penalty terms #41

Computation of penalty terms #41

Comments

rgemulla commented Aug 19, 2019

samuelbroscheit commented Aug 19, 2019

rgemulla commented Aug 19, 2019

samuelbroscheit commented Aug 19, 2019

rgemulla commented Aug 19, 2019

rgemulla commented Aug 21, 2019

samuelbroscheit commented Aug 21, 2019 via email

rgemulla commented Aug 21, 2019

rgemulla commented Oct 21, 2019 • edited Loading

samuelbroscheit commented May 16, 2020

rgemulla commented May 18, 2020

rgemulla commented May 25, 2020

rgemulla commented Oct 21, 2019 •

edited

Loading