Update `TrainConfig` optimizer hyperparameters #82

jon-tow · 2022-11-06T23:32:57Z

This PR provides the following updates based on observations from issue #53:

Adds the previously unused weight_decay TrainConfig parameter to AccelerateRLModel optimizer initialization.
Adds an opt_eps TrainConfig parameter to the AccelerateRLModel to allow users to specify an optimizer's epsilon value.
Renames learning_rate_init and learning_rate_target to lr_init and lr_target TrainConfigs, respectively, for consistency with learning rate naming conventions in torch.optim algorithms.
Removes unused lr_ramp_steps and lr_decay_steps TrainConfig parameters which look to be leftovers from the magicCARP lr scheduler.

LouisCastricato · 2022-11-07T17:15:18Z

@herbiebradley since this is for your issue.

herbiebradley

Looks good! Can we change the default epsilon to 1.0e-7 (default in Tensorflow) since the ablation paper found it was significantly better?

Also, does anyone know where the weight decay of 1.0e-6 comes from, since it is significantly lower than the default?

maxreciprocate · 2022-11-07T21:57:58Z

Also, does anyone know where the weight decay of 1.0e-6 comes from, since it is significantly lower than the default?

From here https://github.com/EleutherAI/magiCARP/blob/3537b4d7c98a4a384964856740a65731746e04fe/configs/base_config.yml#L18

maxreciprocate

lgtm!

jon-tow · 2022-11-07T22:58:46Z

@herbiebradley I intentionally left the default $\epsilon$ to 1.0e-8 as I wasn't 100% sure that the improvement would directly carry over to AdamW (we do not use vanilla Adam). I'd like to try some experiments before updating in another PR if that's okay.

jon-tow · 2022-11-07T22:59:59Z

cc #76 for awareness

ayulockin · 2022-11-11T18:08:17Z

Thanks for tagging the issue @jon-tow.

Update TrainConfig optimizer hyperparameters

7bb1545

jon-tow marked this pull request as draft November 7, 2022 04:19

jon-tow marked this pull request as ready for review November 7, 2022 05:50

jon-tow marked this pull request as draft November 7, 2022 05:52

Update TrainConfig docstring

3ef421b

jon-tow marked this pull request as ready for review November 7, 2022 17:05

LouisCastricato requested review from herbiebradley and Dahoas November 7, 2022 17:15

herbiebradley approved these changes Nov 7, 2022

View reviewed changes

maxreciprocate reviewed Nov 7, 2022

View reviewed changes

LouisCastricato approved these changes Nov 7, 2022

View reviewed changes

LouisCastricato merged commit 0270960 into CarperAI:master Nov 7, 2022

jon-tow deleted the update-optim-hparams branch November 7, 2022 22:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update `TrainConfig` optimizer hyperparameters #82

Update `TrainConfig` optimizer hyperparameters #82

jon-tow commented Nov 6, 2022 •

edited

Loading

LouisCastricato commented Nov 7, 2022

herbiebradley left a comment

maxreciprocate commented Nov 7, 2022

maxreciprocate left a comment

jon-tow commented Nov 7, 2022

jon-tow commented Nov 7, 2022

ayulockin commented Nov 11, 2022

Update TrainConfig optimizer hyperparameters #82

Update TrainConfig optimizer hyperparameters #82

Conversation

jon-tow commented Nov 6, 2022 • edited Loading

LouisCastricato commented Nov 7, 2022

herbiebradley left a comment

Choose a reason for hiding this comment

maxreciprocate commented Nov 7, 2022

maxreciprocate left a comment

Choose a reason for hiding this comment

jon-tow commented Nov 7, 2022

jon-tow commented Nov 7, 2022

ayulockin commented Nov 11, 2022

Update `TrainConfig` optimizer hyperparameters #82

Update `TrainConfig` optimizer hyperparameters #82

jon-tow commented Nov 6, 2022 •

edited

Loading