Port `mlp_image_classification.py` to all backends #663

guillaumebaquiast · 2023-08-03T19:47:14Z

Port mlp_image_classification to keras_core.

This implementation works with Tensorflow, Torch, and JAX (at least on my machine).

A few points below:

Reimplemented positional embedding
Just like in (PR #602), I ran into the issue that the example wasn’t running with a Torch backend (I had the same cryptic error message: RuntimeError: Trying to backward through the graph a second time (or directly access saved tensors after they have already been freed). Saved intermediate values of the graph are freed when you call .backward() or autograd.grad(). Specify retain_graph=True if you need to backward through the graph a second time or if you need to access saved tensors after calling backward.).

To fix that, I had to re-implement the logic of positional embedding.
I believe that the original implementation of the positional embedding made Keras ignore the embedding layer, which didn’t fail with Tensorflow or JAX (although it didn't behave as expected), but did fail for Torch. Indeed, see the difference in the FNet model summaries between the two implementations:

With the original implementation, the embedding layer is missing:

With the implementation from this PR, the embedding layer is present, and the code runs without error on a Torch backend:

If my hypothesis is indeed correct, then there a few other examples should get the same fix (e.g., cct.py, image_captioning.py, token_learner.py, video_transformer.py), and it should solve the issue #566.

Reimplemented Patches
This is to leverage the newly implemented keras_core.ops.image.extract_patches.

Fixed some typos
Fixed minor typos, like arguments of function that weren’t used anywhere.

File location
I left the file at the location keras_io/tensorflow/vision/mlp_image_classification for now, so that the diff displays nicely. I can move it under keras_io/vision before merging.

guillaumebaquiast · 2023-08-03T19:48:16Z

examples/keras_io/tensorflow/vision/mlp_image_classification.py

-            input_dim=num_patches, output_dim=embedding_dim
-        )(positions)
-        x = x + position_embedding
+        x = x + PositionEmbedding(sequence_length=num_patches)(x)


This change fixed the run on Torch backend, and made the embedding appear in the summary of the model.

guillaumebaquiast · 2023-08-03T19:48:50Z

examples/keras_io/tensorflow/vision/mlp_image_classification.py

+## Implement position embedding as a layer
+"""
+
+class PositionEmbedding(keras.layers.Layer):


This class was adapted from KerasNLP.

fchollet

The fix makes sense to me -- thank you for debugging this! LGTM

port mlp_image_classification.py to all backends

10d2535

guillaumebaquiast commented Aug 3, 2023

View reviewed changes

fchollet approved these changes Aug 4, 2023

View reviewed changes

fchollet merged commit 9d9b31b into keras-team:main Aug 4, 2023

guillaumebaquiast mentioned this pull request Aug 6, 2023

Port Compact Convolutional Transformers to backend agnostic #669

Merged

guillaumebaquiast mentioned this pull request Aug 15, 2023

Fix position encoder in examples/.../token_learner.py #727

Merged

guillaumebaquiast mentioned this pull request Aug 23, 2023

possible bug: code working with tensorflow/jax backends but not with pytorch #566

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Port `mlp_image_classification.py` to all backends #663

Port `mlp_image_classification.py` to all backends #663

guillaumebaquiast commented Aug 3, 2023

guillaumebaquiast Aug 3, 2023

guillaumebaquiast Aug 3, 2023

fchollet left a comment

Port mlp_image_classification.py to all backends #663

Port mlp_image_classification.py to all backends #663

Conversation

guillaumebaquiast commented Aug 3, 2023

guillaumebaquiast Aug 3, 2023

Choose a reason for hiding this comment

guillaumebaquiast Aug 3, 2023

Choose a reason for hiding this comment

fchollet left a comment

Choose a reason for hiding this comment

Port `mlp_image_classification.py` to all backends #663

Port `mlp_image_classification.py` to all backends #663