initial commit for trlx LORA support #110

ethankim00 · 2022-11-23T03:28:43Z

Basic support for low rank adaptation.

LouisCastricato · 2022-11-23T11:45:18Z

This should take a similar form to how hydra models are built. It shouldn't be required and directly integrated into ilql or PPO model

LouisCastricato · 2022-11-23T14:08:36Z

#80 Relevant issue.

Dahoas · 2022-12-09T20:30:32Z

@ethankim00 just a gentle push on when you expect to finish this?

cat-state · 2022-12-09T22:56:59Z

cc @Sayanc93

ethankim00 · 2022-12-10T19:54:57Z

I can get to it tomorrow or Monday. I'm wondering what the API should be to avoid modifying the model definitions?

cat-state · 2022-12-14T03:53:50Z

I can get to it tomorrow or Monday. I'm wondering what the API should be to avoid modifying the model definitions?

I think it would be like, instead of modifying the CausalLMWithValueHeads or GPTHydraHeadWithValueModel class definitions, the delta versions could be subclasses, and then the config can treat them as just another architecture to be trained

merge upstream changes

LouisCastricato · 2022-12-23T12:56:57Z

Circling back around on this.

merge upstream

LouisCastricato · 2022-12-23T20:50:56Z

@cat-state does it make sense to do lora + hydra or just have lora be entirely separate...

ethankim00 · 2022-12-23T20:56:46Z

We could have a function to modify the base model of each different model type rather than creating subclasses.

LouisCastricato · 2022-12-23T21:02:06Z

Hm... I differ to better software engineers haha @jon-tow your input would be great here too

jon-tow · 2023-01-06T21:11:10Z

@ethankim00 This looks great! I've made a few changes based on some testing on our cluster. Here's the summary:

Updates "gpt_neo"` model type name in the modifier map.
Fixes layer regex pattern, as the previous one could not capture on ranges with multiple digits, e.g. adapting LORA to a model's 8 through 11 block layers failed since [8-11] is an invalid regex character range.
Moves the delta model modifications to the base trainer to avoid unnecessary duplication.
Change _opendelta_available to HAS_OPENDELTA for consistency with other modules (see HAS_BNB).

Overall things look very promising. Check out these runs from the PPO sentiments task here. I'm going to begin testing on ILQL and once that's cleared up, we can get this ready for review and a merge.

Reports:

LouisCastricato · 2023-01-07T16:08:48Z

This looks fantastic! Definitely worth including for the 0.4 release next week. Let's get this merged :)

trlx/data/configs.py

trlx/trainer/accelerate_base_trainer.py

trlx/utils/modeling.py

LouisCastricato

LGTM

LouisCastricato requested a review from cat-state November 23, 2022 11:43

initial commit for trlx LORA support

39aae19

merge upstream changes

ethankim00 force-pushed the LORA branch from 97a3cdf to 39aae19 Compare December 15, 2022 03:28

reset changes

ed4db30

ethankim00 added 3 commits December 23, 2022 14:43

Merge branch 'main' into LORA

d950a8c

merge upstream

subclass PPO model w/ to use lora

45747ba

subclass ilql model to use lora method

fb1ef3c

James4Ever0 mentioned this pull request Dec 24, 2022

How to implement a conditional reward? #146

Closed

ethankim00 and others added 3 commits January 4, 2023 20:06

apply delta model to base_model attribute instead of using subclass

cd78d1c

merge main

0f20a5d

Update regex layer pattern

7cdcf5c

jon-tow added 5 commits January 6, 2023 21:14

Ignore flake8 complexity warning on regex_for_range

2148ea2

Fix delta regex for num_layers_frozen = -1

b268ae3

Add bloom/gpt/opt to delta modifier map

a0902d2

Run pre-commit formatting

79977c3

Fix module name misspelling

cfdac0e

jon-tow added this to the v0.4.0 milestone Jan 7, 2023

LouisCastricato marked this pull request as ready for review January 8, 2023 17:20

LouisCastricato reviewed Jan 8, 2023

View reviewed changes

trlx/data/configs.py Outdated Show resolved Hide resolved

trlx/trainer/accelerate_base_trainer.py Show resolved Hide resolved

trlx/utils/modeling.py Show resolved Hide resolved

trlx/utils/modeling.py Outdated Show resolved Hide resolved

jon-tow added 2 commits January 8, 2023 18:12

Update delta config for generic args

fd6c88c

Run pre-commit

db26ce3

LouisCastricato approved these changes Jan 8, 2023

View reviewed changes

LouisCastricato merged commit e9c6c86 into CarperAI:main Jan 8, 2023

jon-tow mentioned this pull request Jan 8, 2023

Add optional dependency container file #170

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

initial commit for trlx LORA support #110

initial commit for trlx LORA support #110

ethankim00 commented Nov 23, 2022

LouisCastricato commented Nov 23, 2022

LouisCastricato commented Nov 23, 2022

Dahoas commented Dec 9, 2022

cat-state commented Dec 9, 2022

ethankim00 commented Dec 10, 2022

cat-state commented Dec 14, 2022 •

edited

Loading

LouisCastricato commented Dec 23, 2022

LouisCastricato commented Dec 23, 2022

ethankim00 commented Dec 23, 2022

LouisCastricato commented Dec 23, 2022

jon-tow commented Jan 6, 2023 •

edited

Loading

LouisCastricato commented Jan 7, 2023

LouisCastricato left a comment

initial commit for trlx LORA support #110

initial commit for trlx LORA support #110

Conversation

ethankim00 commented Nov 23, 2022

LouisCastricato commented Nov 23, 2022

LouisCastricato commented Nov 23, 2022

Dahoas commented Dec 9, 2022

cat-state commented Dec 9, 2022

ethankim00 commented Dec 10, 2022

cat-state commented Dec 14, 2022 • edited Loading

LouisCastricato commented Dec 23, 2022

LouisCastricato commented Dec 23, 2022

ethankim00 commented Dec 23, 2022

LouisCastricato commented Dec 23, 2022

jon-tow commented Jan 6, 2023 • edited Loading

LouisCastricato commented Jan 7, 2023

LouisCastricato left a comment

Choose a reason for hiding this comment

cat-state commented Dec 14, 2022 •

edited

Loading

jon-tow commented Jan 6, 2023 •

edited

Loading