Skip to content

Conversation

tjhunter
Copy link
Collaborator

@tjhunter tjhunter commented Oct 1, 2025

Description

  • Major change: updates to cuda 12.6 . It should not cause major issues, but FYI
  • Integration test is failing: it is looking for anemoi data right now but we have not moved this directory yet.

Issue Number

Closes #1022

Checklist before asking for review

  • I have performed a self-review of my code
  • My changes comply with basic sanity checks:
    • I have fixed formatting issues with ./scripts/actions.sh lint
    • I have run unit tests with ./scripts/actions.sh unit-test
    • I have documented my code and I have updated the docstrings.
    • I have added unit tests, if relevant
  • I have tried my changes with data and code:
    • I have run the integration tests with ./scripts/actions.sh integration-test
    • (bigger changes) I have run a full training and I have written in the comment the run_id(s): launch-slurm.py --time 60
    • (bigger changes and experiments) I have shared a hegdedoc in the github issue with all the configurations and runs for this experiments
  • I have informed and aligned with people impacted by my change:
    • for config changes: the MatterMost channels and/or a design doc
    • for changes of dependencies: the MatterMost software development channel

@github-actions github-actions bot added infra Issues related to infrastructure initiative Large piece of work covering multiple sprint labels Oct 1, 2025
@tjhunter tjhunter marked this pull request as ready for review October 2, 2025 07:58
@Jubeku
Copy link
Contributor

Jubeku commented Oct 8, 2025

Tested successfully:

  • On Säntis:
    • uv run train
    • uv run inference
    • launch_slurm.py
  • On ATOS:
    • uv run train
    • uv run inference
    • launch_slurm.py

@tjhunter
Copy link
Collaborator Author

tjhunter commented Oct 8, 2025

Merging

@tjhunter tjhunter merged commit 87f6127 into develop Oct 8, 2025
6 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
infra Issues related to infrastructure initiative Large piece of work covering multiple sprint
Projects
Status: Done
Development

Successfully merging this pull request may close these issues.

Onboard santis
3 participants