Relax gradient assertion for mish activation #1415

seanpmorgan · 2020-03-26T01:47:16Z

Dug into this a bit and found that this check could conceivably fail for TF2.1. It does empirically appear more common on TF2.2, and I don't have an answer about what could have changed. As previously discussed we have no guarantee the py gradient should behave the exact same at this precision:
#1220 (comment)

Part of #1320

bot-of-gabrieldemarmiesse · 2020-03-26T01:47:46Z

@digantamisra98

You are owner of some files modified in this pull request.
Would you kindly review the changes whenever you have the time to?
Thank you very much.

gabrieldemarmiesse · 2020-03-26T08:32:45Z

I also noticed that only very few values were different (<5%) when printing the arrays. I guess it's ok then.

Relax gradient assertion for mish activation

7626aec

seanpmorgan requested a review from WindQAQ as a code owner March 26, 2020 01:47

boring-cyborg bot added the activations label Mar 26, 2020

googlebot added the cla: yes label Mar 26, 2020

gabrieldemarmiesse approved these changes Mar 26, 2020

View reviewed changes

gabrieldemarmiesse merged commit 98a4a70 into tensorflow:master Mar 26, 2020

seanpmorgan deleted the mish-2.2 branch March 26, 2020 12:26

jrruijli pushed a commit to jrruijli/addons that referenced this pull request Dec 23, 2020

Relax gradient assertion for mish activation (tensorflow#1415)

58ff6bf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Relax gradient assertion for mish activation #1415

Relax gradient assertion for mish activation #1415

seanpmorgan commented Mar 26, 2020

bot-of-gabrieldemarmiesse commented Mar 26, 2020

gabrieldemarmiesse commented Mar 26, 2020

Relax gradient assertion for mish activation #1415

Relax gradient assertion for mish activation #1415

Conversation

seanpmorgan commented Mar 26, 2020

bot-of-gabrieldemarmiesse commented Mar 26, 2020

gabrieldemarmiesse commented Mar 26, 2020