LLM Finetune Functionality #277

dsikka · 2023-08-08T16:39:22Z

Summary

Enable LLM finetuning in sparsify by adding support for the llmfoundry library
This is done by adding an additional task called finetune which provides support for all the steps required for llmfoundry finetuning integration
Currently, this workflow launches finetuning given a llfoundry compliant yaml file (provided using the --data cli arg to the sparsify command)
Exporting steps are currently skipped and as there is no sparseml support atm, in order to allow ddp training, finetune.py includes the steps which can be wrapped in launch_ddp and trainhook
As the only essential arg is the data arg containing the yaml file, there are no finetune specific train args but this can be added in as greater finetuneing support is flushed out

Testing

A sample yaml can be found under the new samples folder
Tested locally using both 1 GPU and for DDP Training
The following command was used for testing:

sparsify.run sparse-transfer --use-case finetune --data ../src/sparsify/auto/samples/finetune_llmfoundry_sample.yaml

This produces the following output in the running directory after training is complete.
sparse_transfer_finetune_2023_08_09_01_24_08

All paths from llmfoundry are updated to use the working directory as the root and therefore, the output directory shown above contains the checkpoints and all other llmfoundry-specific outputs.

dsikka · 2023-08-09T13:53:13Z

@eldarkurtic As part of your review, please pay special attention to the fsdp handling to ensure it is being applied how you expect it to. If you can pull down and test a model for accuracy/speed, that would also be appreciated.

Satrat

LGTM overall, just had a few nitpicky comments. I haven't tried running this on machine yet, will give it a go later today

src/sparsify/auto/scripts/main.py

src/sparsify/auto/tasks/runner.py

src/sparsify/auto/tasks/finetune/runner.py

src/sparsify/auto/tasks/finetune/finetune.py

bfineran

looks great overall, see few comments

src/sparsify/auto/tasks/finetune/finetune.py

src/sparsify/auto/tasks/finetune/runner.py

…te docstring

src/sparsify/auto/tasks/finetune/__init__.py

src/sparsify/auto/tasks/finetune/finetune.py

src/sparsify/schemas/auto_api.py

* add functions to mask weights during finetuneing * update logic for loading weights * update yaml * update mask name * add logic to update batchsize based on gpu count * make sparsify requirements less broad; move sparseml[transformers] to nm deps * remove flash-attn * quality

addressed

dsikka added 4 commits August 8, 2023 16:37

initial llmfoundry finetune functionality

76a34d6

update docstring with samples

45f5e1e

fix quality

accced4

refactor to use TaskRunner; add ddp support

3a2a535

dsikka marked this pull request as ready for review August 8, 2023 23:19

dsikka requested review from rahul-tuli, Satrat, bfineran and dbogunowicz August 9, 2023 01:42

add trainhook for single gpu/cpu run

ee55f7d

dsikka requested a review from eldarkurtic August 9, 2023 13:43

Satrat reviewed Aug 10, 2023

View reviewed changes

bfineran reviewed Aug 11, 2023

View reviewed changes

src/sparsify/auto/tasks/finetune/finetune.py Outdated Show resolved Hide resolved

src/sparsify/auto/tasks/finetune/runner.py Show resolved Hide resolved

dsikka added 4 commits August 13, 2023 02:18

add enum for llm datatypes, use task info for finetune pathways, upda…

63b7e61

…te docstring

add click for arguments, add finetune args, update entrypoints

88146b4

add try/except around imports

d356d2e

quality

def231b

dsikka requested review from bfineran and Satrat August 13, 2023 21:43

rahul-tuli previously requested changes Aug 14, 2023

View reviewed changes

PR comments

26d6f5f

dsikka requested a review from rahul-tuli August 14, 2023 19:46

Satrat previously approved these changes Aug 15, 2023

View reviewed changes

bfineran previously approved these changes Aug 17, 2023

View reviewed changes

dsikka dismissed stale reviews from bfineran and Satrat via 2409ab8 August 23, 2023 18:50

bfineran approved these changes Aug 23, 2023

View reviewed changes

dsikka requested a review from Satrat August 24, 2023 14:13

Satrat approved these changes Aug 24, 2023

View reviewed changes

dsikka merged commit 7d5b20d into main Aug 24, 2023
3 checks passed

dsikka deleted the llm_finetune branch August 24, 2023 16:36

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LLM Finetune Functionality #277

LLM Finetune Functionality #277

dsikka commented Aug 8, 2023 •

edited

Loading

dsikka commented Aug 9, 2023

Satrat left a comment

bfineran left a comment

LLM Finetune Functionality #277

LLM Finetune Functionality #277

Conversation

dsikka commented Aug 8, 2023 • edited Loading

Summary

Testing

dsikka commented Aug 9, 2023

Satrat left a comment

Choose a reason for hiding this comment

bfineran left a comment

Choose a reason for hiding this comment

dsikka commented Aug 8, 2023 •

edited

Loading