Fix bugged gradients when combiner == 'MEAN' #2505

Rocketknight1 · 2021-06-19T16:13:40Z

Rocketknight1 · 2021-06-19T16:30:55Z

Wait hang on, I'm dumb. Fixing the PR!

…city

bhack · 2021-06-19T17:06:43Z

tensorflow_addons/layers/embedding_bag.py

+        weights = tf.ones_like(indices, dtype=params.dtype) / tf.cast(
+            tf.shape(indices)[1], params.dtype
+        )
+        combiner = "sum"


Why we have this combiner overriding?

It was a quick workaround, but the output and grads are correct, and performance is fine! Since combiner == "mean" does not support weights, we can get the right results by creating a dummy weight array (we do that anyway), then scaling it so that we get an unweighted mean instead of an unweighted sum.

Maybe you should call tf.reduce_mean after custom op instead of integrating with it or combiner overriding.

bhack · 2021-06-19T17:47:23Z

The GPU test is failing.

Rocketknight1 · 2021-06-19T18:05:26Z

lol. This one is just a simple issue - the tolerances for the equality comparison are too tight for the float16 tests. Give me a sec and I'll fix it.

tensorflow_addons/layers/tests/embedding_bag_test.py

fsx950223 · 2021-06-20T02:58:28Z

Combiner is not used in

addons/tensorflow_addons/custom_ops/layers/cc/kernels/embedding_bag_backward_kernels.cu.cc

Line 151 in 97eb293

Combiner combiner, OpKernelContext *context) {

, which is different with cpu custom op.
Maybe you should call tf.reduce_mean after custom op instead of integrating with it.

fsx950223 · 2021-06-20T04:06:31Z

tensorflow_addons/layers/embedding_bag.py

@@ -49,8 +49,13 @@ def _embedding_bag(
    Returns:
      A `Tensor` of the format specified by `data_format`.
    """
-    if weights is None:
+    if weights is None and combiner == "sum":


Use reduction instead of combiner

The combiner is combining part of the layer and applies to the inputs/weights, not the output/loss! Is reduction still the right approach there?

Could you add combiner document?

seanpmorgan

Going to merge this as CI is broken without it. If there are any issues can we follow up in next PR. Cursory glance LGTM.

fsx950223 · 2021-07-01T04:05:09Z

Going to merge this as CI is broken without it. If there are any issues can we follow up in next PR. Cursory glance LGTM.

I can't see ubuntu gpu CI report in the PR, could you check it?

seanpmorgan · 2021-07-01T04:08:58Z

Going to merge this as CI is broken without it. If there are any issues can we follow up in next PR. Cursory glance LGTM.

I can't see ubuntu gpu CI report in the PR, could you check it?

Yeah landed on master (Shown by badge on central README): https://source.cloud.google.com/results/invocations/5a171fa3-3bad-4859-9a05-414e4c155851

fsx950223 · 2021-07-01T04:10:30Z

https://source.cloud.google.com/results/invocations/5a171fa3-3bad-4859-9a05-414e4c155851

How could I check it in the review process? Should I record the url? I could get the info in previous PRs.

Rocketknight1 · 2021-07-04T15:57:47Z

Hi, thanks for merging, and sorry for my slow replies! I've only been able to get to this at the weekends, but I am committed to maintaining it - it's unfortunate that we had to rush because of the broken CI, but I do think this PR should resolve most of the issues, and I see the follow-up PR is already open!

seanpmorgan · 2021-07-10T01:14:45Z

Hi, thanks for merging, and sorry for my slow replies! I've only been able to get to this at the weekends, but I am committed to maintaining it - it's unfortunate that we had to rush because of the broken CI, but I do think this PR should resolve most of the issues, and I see the follow-up PR is already open!

Appreciate the maintainership @Rocketknight1 ! Its volunteer work so no need to apologize for slow responses :)

Fix bugged gradients when combiner == 'MEAN'

2d2096f

boring-cyborg bot added the layers label Jun 19, 2021

google-cla bot added the cla: yes label Jun 19, 2021

Black style pass

6bd366e

Rocketknight1 added 2 commits June 19, 2021 17:48

Compute mean grads using sum + tweaks to the weight matrix for simpli…

3aa8fbf

…city

Black style pass

02afcd5

bhack added the kokoro:force-run label Jun 19, 2021

kokoro-team removed the kokoro:force-run label Jun 19, 2021

bhack reviewed Jun 19, 2021

View reviewed changes

bhack requested a review from fsx950223 June 19, 2021 17:47

Relaxing float16 tolerances for embedding_bag test

38a6eff

bhack added the kokoro:force-run label Jun 19, 2021

kokoro-team removed the kokoro:force-run label Jun 19, 2021

fsx950223 reviewed Jun 20, 2021

View reviewed changes

tensorflow_addons/layers/tests/embedding_bag_test.py Show resolved Hide resolved

fsx950223 reviewed Jun 20, 2021

View reviewed changes

Use maybe_run_functions_eagerly decorator

18e4acf

seanpmorgan approved these changes Jul 1, 2021

View reviewed changes

seanpmorgan merged commit 54a8720 into tensorflow:master Jul 1, 2021

Fix bugged gradients when combiner == 'MEAN' #2505

Fix bugged gradients when combiner == 'MEAN' #2505

Uh oh!

Conversation

Rocketknight1 commented Jun 19, 2021

Uh oh!

Rocketknight1 commented Jun 19, 2021

Uh oh!

bhack Jun 19, 2021

Choose a reason for hiding this comment

Uh oh!

Rocketknight1 Jun 19, 2021

Choose a reason for hiding this comment

Uh oh!

fsx950223 Jun 28, 2021

Choose a reason for hiding this comment

Uh oh!

bhack commented Jun 19, 2021

Uh oh!

Rocketknight1 commented Jun 19, 2021

Uh oh!

Uh oh!

fsx950223 commented Jun 20, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

fsx950223 Jun 20, 2021

Choose a reason for hiding this comment

Uh oh!

Rocketknight1 Jun 26, 2021

Choose a reason for hiding this comment

Uh oh!

fsx950223 Jun 28, 2021

Choose a reason for hiding this comment

Uh oh!

seanpmorgan left a comment

Choose a reason for hiding this comment

Uh oh!

fsx950223 commented Jul 1, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

seanpmorgan commented Jul 1, 2021

Uh oh!

fsx950223 commented Jul 1, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Rocketknight1 commented Jul 4, 2021

Uh oh!

seanpmorgan commented Jul 10, 2021

Uh oh!

Uh oh!

fsx950223 commented Jun 20, 2021 •

edited

Loading

fsx950223 commented Jul 1, 2021 •

edited

Loading

fsx950223 commented Jul 1, 2021 •

edited

Loading