Add training support for SigLIP #31495

aliencaocao · 2024-06-19T13:19:51Z

What does this PR do?

Add the sigmoid contrastive loss function of SigLIP from https://github.com/google-research/big_vision/blob/01edb81a4716f93a48be43b3a4af14e29cdb3a7f/big_vision/trainers/proj/image_text/siglip.py#L287

This will allow training/finetuning SigLIP models.

Already verified to work on my own dataset.

I saw the note on using torch.distributed for loss function and open_clip's implementation, but I'm not sure why is it needed. I ran my training with both DDP and FDSP with full sharding and it seem to work just fine, also getting the expected speedup and ability to set larger BS. The only issue is #31034 when using FDSP but I don't think its SigLIP specific.

Nonetheless, I updated the docs to mention the lack of usage of torch.distributed if that ended up important to some users.

Not sure if a training test is needed.

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

@amyeroberts

amyeroberts · 2024-06-20T16:36:13Z

@aliencaocao Could you rebase to include the upstream changes on main? This should fix the failures on the CI runs

amyeroberts

Thanks for adding!

The tests in test_modeling_siglip.py will also need to be updated so the training tests are no longer skipped

[experimental] enable GC training tests as it has worked for my own data

aliencaocao · 2024-06-21T03:38:17Z

Added the training tests and also enabled gradient checkpointing tests. I note that CLIP had issues with GC but I have used it with siglip myself and did not find any issue on convergence/accuracy on a single RTX 3080Ti with fp16 training and grad accum=16.

Will let the tests run and see how it goes.

aliencaocao · 2024-06-21T03:49:24Z

@amyeroberts seems to need you to enable slow tests?

amyeroberts

Thanks for the continued work on this!

It shouldn't be necessary for the slow tests to be enabled to test training for this model. I've added the run-slow label, nevertheless. If you push a commit with the message [run_slow] siglip then this will trigger a run of the slow tests for this model (which I'll have to approve to set off)

tests/models/siglip/test_modeling_siglip.py

[run_slow] siglip

Add skip reason for training tests for SiglipTextModel

# Conflicts: # tests/models/siglip/test_modeling_siglip.py

aliencaocao · 2024-06-28T05:44:06Z

@amyeroberts now that the GC tests are properly skipped, shall we move forward with this?

amyeroberts

Thanks for adding!

aliencaocao added 2 commits June 19, 2024 21:15

Add siglip loss function

f2ed426

Update docs

54a97dd

amyeroberts reviewed Jun 20, 2024

View reviewed changes

aliencaocao added 2 commits June 21, 2024 11:09

Merge branch 'huggingface:main' into siglip-training

a6efec1

Enable training tests

ad5475b

[experimental] enable GC training tests as it has worked for my own data

amyeroberts reviewed Jun 21, 2024

View reviewed changes

tests/models/siglip/test_modeling_siglip.py Outdated Show resolved Hide resolved

amyeroberts added the run-slow label Jun 21, 2024

aliencaocao added 5 commits June 22, 2024 12:08

Remove test_training* overrides to enable training tests

c79c01d

[run_slow] siglip

Skip training tests for Siglip text model and ImageClassificationModel

3362af6

[run_slow] siglip

Skip GC training tests for SiglipForImageClassification

7594191

Explicitly skip training tests for SiglipVisionModel

d3bd86d

Add skip reason for training tests for SiglipTextModel

Remove copied from to fix CI

355d87c

amyeroberts mentioned this pull request Jun 25, 2024

Skip tests properly #31308

Merged

Merge branch 'refs/heads/main' into siglip-training

5ee9d1f

# Conflicts: # tests/models/siglip/test_modeling_siglip.py

SunMarc requested a review from amyeroberts June 28, 2024 12:51

amyeroberts approved these changes Jul 5, 2024

View reviewed changes

amyeroberts merged commit 1d3eaa6 into huggingface:main Jul 5, 2024
18 checks passed

aliencaocao deleted the siglip-training branch July 5, 2024 15:32

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add training support for SigLIP #31495

Add training support for SigLIP #31495

aliencaocao commented Jun 19, 2024 •

edited

Loading

amyeroberts commented Jun 20, 2024

amyeroberts left a comment

aliencaocao commented Jun 21, 2024 •

edited

Loading

aliencaocao commented Jun 21, 2024

amyeroberts left a comment

aliencaocao commented Jun 28, 2024

amyeroberts left a comment

Add training support for SigLIP #31495

Add training support for SigLIP #31495

Conversation

aliencaocao commented Jun 19, 2024 • edited Loading

What does this PR do?

Before submitting

Who can review?

amyeroberts commented Jun 20, 2024

amyeroberts left a comment

Choose a reason for hiding this comment

aliencaocao commented Jun 21, 2024 • edited Loading

aliencaocao commented Jun 21, 2024

amyeroberts left a comment

Choose a reason for hiding this comment

aliencaocao commented Jun 28, 2024

amyeroberts left a comment

Choose a reason for hiding this comment

aliencaocao commented Jun 19, 2024 •

edited

Loading

aliencaocao commented Jun 21, 2024 •

edited

Loading