Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Gt4py is used as a submodule #64

Merged
merged 55 commits into from
Jan 12, 2022
Merged

Gt4py is used as a submodule #64

merged 55 commits into from
Jan 12, 2022

Conversation

mcgibbon
Copy link
Collaborator

@mcgibbon mcgibbon commented Dec 10, 2021

This PR updates pace to use gt4py pinned as a submodule, rather than using GT4PY_VERSION.txt. After this PR, the gt4py caches will use the gt4py sha rather than tag in their GT4PY_VERSION.txt files.

Changes:

  • Gt4py is used as a submodule, rather than through GT4PY_VERSION.txt, resulting in the removal of duplicated logic to download gt4py and the addition of git submodule update --init commands
  • Python used for CircleCI is updated to 3.8, as required by gt4py
  • Requirements of gt4py are accounted for when compiling constraints.txt
  • constraints.txt is updated
  • gt4py is no longer installed as part of daint_venv, and is instead installed from pace
  • removed clang-format from requirements, as it is optional and is not available on mac os
  • updated tox in fv3core and pace-util to use the gt4py submodule, instead of a hardcoded tag pin

Continued from #56.

mcgibbon and others added 30 commits November 18, 2021 11:44
@mcgibbon
Copy link
Collaborator Author

Failed on final copy of caches with an error due to echoing gt4py version to the wrong file location, updated and resubmitting https://jenkins.ginko.ch/view/pace/job/pace-fv3core-cache-setup/143/

@mcgibbon
Copy link
Collaborator Author

Cancelled cache build job for now, realized the caching infrastructure currently only supports one gt4py tag at a time and that this would disrupt other PRs.

@mcgibbon
Copy link
Collaborator Author

Updated gt4py caching plans to use multiple tar.gz's for each GT4PY_VERSION. This should avoid breaking other PRs when running the caching plan. Launched caching plan https://jenkins.ginko.ch/view/pace/job/pace-fv3core-cache-setup/144/

@mcgibbon
Copy link
Collaborator Author

mcgibbon commented Jan 5, 2022

launch jenkins

@mcgibbon
Copy link
Collaborator Author

mcgibbon commented Jan 5, 2022

Caching plan likely passed since tests passed, but launched it again to see output in case an error happened later down the line. https://jenkins.ginko.ch/view/pace/job/pace-fv3core-cache-setup/158/

@mcgibbon
Copy link
Collaborator Author

mcgibbon commented Jan 6, 2022

Resubmitting one caching configuration which had a "slurm error", possibly an intermittent issue. https://jenkins.ginko.ch/view/pace/job/pace-fv3core-cache-setup/160/

@@ -111,13 +117,8 @@ echo "Benchmark directory: ${BENCHMARK_DIR}"
echo "Data directory: ${DATA_DIR}"
echo "Perf. artifact directory: ${TIMING_DIR}"
echo "Profile artifact directory: ${PROFILE_DIR}"
echo "Cache directory: ${CACHE_DIR}"
Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This was removed because CACHE_DIR is not used in this file, and because it gets printed out in the log already from the location where the variable is actually defined and used. Because this file doesn't use the variable, it could be incorrect without causing a failure.

@@ -2,24 +2,23 @@
BACKEND=$1
EXPNAME=$2
SANITIZED_BACKEND=`echo $BACKEND | sed 's/:/_/g'` #sanitize the backend from any ':'
CACHE_DIR="/scratch/snx3000/olifu/jenkins/scratch/store_gt_caches/"
CACHE_DIR="/scratch/snx3000/olifu/jenkins/scratch/store_gt_caches/${EXPNAME}/${SANITIZED_BACKEND}"
Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Here I updated the caching to use a tar.gz that has the gt4py version in its file path, so that we can have simultaneous caches for different gt4py versions. This was needed to be able to test this PR (which involves generating the caches) without breaking other PRs.

@mcgibbon mcgibbon requested review from twicki and jdahm January 8, 2022 00:19
Copy link
Contributor

@jdahm jdahm left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Nice work! There are several parts of this I don't fully understand the logic behind, but those seem to be infrastructure sections that are presumably well-covered by the unit and integration tests. Looks ready to go.

@mcgibbon
Copy link
Collaborator Author

launch jenkins

@mcgibbon
Copy link
Collaborator Author

@mcgibbon
Copy link
Collaborator Author

Performance test passed.

Copy link
Contributor

@twicki twicki left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

why is the symlinkf from pace-util/contstrains.txt required? In the install we're alyways pointing at the base constraints file, right?

@@ -11,20 +11,22 @@ exitError()

# check a virtualenv path has been provided
test -n "$1" || exitError 1001 ${virtualenv_path} "must pass an argument"
pace_dir=`dirname $0`/../
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

are we sure this always works (from wherever we call it?) Should we not aim for something like we do with the jenkins dir?
(I see that this is just moved - but just curious?)

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I am not sure that this always works, so I've been using the other pattern wherever I add new operations like this. But I've often kept the old pattern where it exists.

Should I update this to the new pattern?

Copy link
Contributor

@twicki twicki Jan 12, 2022

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'm ok with leaving it as is.
I was mostly wondering because we have several new places where you've added pieces along the lines of

SCRIPT_DIR="$( cd -- "$( dirname -- "${BASH_SOURCE[0]}" )" &> /dev/null && pwd )"
PACE_DIR=$SCRIPT_DIR/../..

And we could have used that here as well. But not really necessary

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Updated.

@mcgibbon
Copy link
Collaborator Author

why is the symlinkf from pace-util/contstrains.txt required? In the install we're alyways pointing at the base constraints file, right?

I've fixed it and now removed the symlink. We were not always using the top level file. This is also the case for fv3core, but I'm addressing it in the sphinx docs PR.

@twicki
Copy link
Contributor

twicki commented Jan 12, 2022

sounds great, thanks

Copy link
Contributor

@twicki twicki left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

awesome, thanks a lot

@mcgibbon
Copy link
Collaborator Author

Launched performance test https://jenkins.ginko.ch/job/pace-fv3core-performance-test-PR/37/.

@mcgibbon mcgibbon merged commit ff7a37b into main Jan 12, 2022
@mcgibbon mcgibbon deleted the feature/gt4py_submodule branch January 12, 2022 19:42
@mcgibbon mcgibbon mentioned this pull request Jan 12, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants