Skip to content

Conversation

@casparvl
Copy link
Collaborator

@casparvl casparvl commented Oct 30, 2025

I think we should deploy the script from EESSI/software-layer-scripts#120 through this current PR, then change the build.sh back to it's original form. The issue is that EESSI/software-layer-scripts#120 can't be deployed there, because no software is built, and thus no "no missing installations" message is printed. This causes the bot to consider the build step a 'failure'.

@casparvl
Copy link
Collaborator Author

bot: build repo:eessi.io-2025.06-software instance:eessi-bot-surf for:arch=x86_64/intel/icelake,accel=nvidia/cc80

@eessi-bot-surf
Copy link

eessi-bot-surf bot commented Oct 30, 2025

New job on instance eessi-bot-surf for repository eessi.io-2025.06-software
Building on: intel-icelake and accelerator nvidia/cc80
Building for: x86_64/intel/icelake and accelerator nvidia/cc80
Job dir: /projects/eessibot/eessi-bot-surf/jobs/2025.10/pr_1278/15643733

date job status comment
Oct 30 13:02:23 UTC 2025 submitted job id 15643733 will be eligible to start in about 20 seconds
Oct 30 13:02:30 UTC 2025 received job awaits launch by Slurm scheduler
Oct 30 13:02:55 UTC 2025 running job 15643733 is running
Oct 30 13:04:05 UTC 2025 finished
😢 FAILURE (click triangle for details)
Details
✅ job output file slurm-15643733.out
✅ no message matching FATAL:
❌ found message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
❌ no message matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2025.06-software-linux-x86_64-intel-icelake-accel-nvidia-cc80-17618294030.tar.gzsize: 0 MiB (45 bytes)
entries: 0
modules under 2025.06/software/linux/x86_64/intel/icelake/accel/nvidia/cc80/modules/all
no module files in tarball
software under 2025.06/software/linux/x86_64/intel/icelake/accel/nvidia/cc80/software
no software packages in tarball
reprod directories under 2025.06/software/linux/x86_64/intel/icelake/accel/nvidia/cc80/reprod
no reprod directories in tarball
other under 2025.06/software/linux/x86_64/intel/icelake/accel/nvidia/cc80
no other files in tarball
Oct 30 13:04:05 UTC 2025 test result
😁 SUCCESS (click triangle for details)
ReFrame Summary
[ PASSED ] Ran 0/0 test case(s) from 0 check(s) (0 failure(s), 0 skipped, 0 aborted)
Details
✅ job output file slurm-15643733.out
❌ found message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case

@casparvl
Copy link
Collaborator Author

bot: build repo:eessi.io-2025.06-software instance:eessi-bot-surf for:arch=x86_64/intel/icelake,accel=nvidia/cc80

@eessi-bot-surf
Copy link

eessi-bot-surf bot commented Oct 30, 2025

New job on instance eessi-bot-surf for repository eessi.io-2025.06-software
Building on: intel-icelake and accelerator nvidia/cc80
Building for: x86_64/intel/icelake and accelerator nvidia/cc80
Job dir: /projects/eessibot/eessi-bot-surf/jobs/2025.10/pr_1278/15644119

date job status comment
Oct 30 13:16:00 UTC 2025 submitted job id 15644119 will be eligible to start in about 20 seconds
Oct 30 13:16:11 UTC 2025 received job awaits launch by Slurm scheduler
Oct 30 13:16:24 UTC 2025 running job 15644119 is running
Oct 30 13:17:58 UTC 2025 finished
😢 FAILURE (click triangle for details)
Details
✅ job output file slurm-15644119.out
✅ no message matching FATAL:
❌ found message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
❌ no message matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2025.06-software-linux-x86_64-intel-icelake-accel-nvidia-cc80-17618302260.tar.gzsize: 0 MiB (421 bytes)
entries: 1
modules under 2025.06/software/linux/x86_64/intel/icelake/accel/nvidia/cc80/modules/all
no module files in tarball
software under 2025.06/software/linux/x86_64/intel/icelake/accel/nvidia/cc80/software
no software packages in tarball
reprod directories under 2025.06/software/linux/x86_64/intel/icelake/accel/nvidia/cc80/reprod
no reprod directories in tarball
other under 2025.06/software/linux/x86_64/intel/icelake/accel/nvidia/cc80
2025.06/scripts/gpu_support/nvidia/easystacks/eessi-2025.06-eb-5.1.2-CUDA-host-injections.yml
Oct 30 13:17:58 UTC 2025 test result
😁 SUCCESS (click triangle for details)
ReFrame Summary
[ PASSED ] Ran 0/0 test case(s) from 0 check(s) (0 failure(s), 0 skipped, 0 aborted)
Details
✅ job output file slurm-15644119.out
❌ found message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case

@casparvl
Copy link
Collaborator Author

bot: build repo:eessi.io-2025.06-software instance:eessi-bot-surf for:arch=x86_64/intel/icelake,accel=nvidia/cc80

@eessi-bot-surf
Copy link

eessi-bot-surf bot commented Oct 30, 2025

New job on instance eessi-bot-surf for repository eessi.io-2025.06-software
Building on: intel-icelake and accelerator nvidia/cc80
Building for: x86_64/intel/icelake and accelerator nvidia/cc80
Job dir: /projects/eessibot/eessi-bot-surf/jobs/2025.10/pr_1278/15644178

date job status comment
Oct 30 13:19:57 UTC 2025 submitted job id 15644178 will be eligible to start in about 20 seconds
Oct 30 13:20:03 UTC 2025 received job awaits launch by Slurm scheduler
Oct 30 13:20:26 UTC 2025 running job 15644178 is running
Oct 30 13:46:57 UTC 2025 finished
😢 FAILURE (click triangle for details)
Details
✅ job output file slurm-15644178.out
✅ no message matching FATAL:
❌ found message matching ERROR:
❌ found message matching FAILED:
✅ no message matching required modules missing:
❌ no message matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2025.06-software-linux-x86_64-intel-icelake-accel-nvidia-cc80-17618319710.tar.gzsize: 0 MiB (420 bytes)
entries: 1
modules under 2025.06/software/linux/x86_64/intel/icelake/accel/nvidia/cc80/modules/all
no module files in tarball
software under 2025.06/software/linux/x86_64/intel/icelake/accel/nvidia/cc80/software
no software packages in tarball
reprod directories under 2025.06/software/linux/x86_64/intel/icelake/accel/nvidia/cc80/reprod
no reprod directories in tarball
other under 2025.06/software/linux/x86_64/intel/icelake/accel/nvidia/cc80
2025.06/scripts/gpu_support/nvidia/easystacks/eessi-2025.06-eb-5.1.2-CUDA-host-injections.yml
Oct 30 13:46:57 UTC 2025 test result
😁 SUCCESS (click triangle for details)
ReFrame Summary
[ PASSED ] Ran 0/0 test case(s) from 0 check(s) (0 failure(s), 0 skipped, 0 aborted)
Details
✅ job output file slurm-15644178.out
❌ found message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case

@casparvl
Copy link
Collaborator Author

casparvl commented Oct 30, 2025

Failure in the cuDNN host injections installations because it doesn't contain ptx code (fixed in EESSI/software-layer-scripts@e25b625 en bf2fc9c)

Also, another failure:

ERROR: Failed to create directory /cvmfs/software.eessi.io/versions/2025.06/software/linux/x86_64/intel/icelake/accel/nvidia/cc80/modules/all: [Errno 30] Read-only file system: '/cvmfs/software.eessi.io/versions/2025.06/software/linux/x86_64/intel/icelake/accel'

Not sure what's wrong here. We may be missing a mkdir -p, because this dir is not there yet since this is the first GPU software we install in this prefix. However, I thought we hit the same issue in 2023.06 and we fixed that - but it's been too long to remember. It might also be that we have a mkdir -p and that this is simply the error it hits when creating that dir...

@casparvl
Copy link
Collaborator Author

Added some extra verbosity EESSI/software-layer-scripts@54bd9ad , let's see

bot: build repo:eessi.io-2025.06-software instance:eessi-bot-surf for:arch=x86_64/intel/icelake,accel=nvidia/cc80

@eessi-bot-surf
Copy link

eessi-bot-surf bot commented Oct 30, 2025

New job on instance eessi-bot-surf for repository eessi.io-2025.06-software
Building on: intel-icelake and accelerator nvidia/cc80
Building for: x86_64/intel/icelake and accelerator nvidia/cc80
Job dir: /projects/eessibot/eessi-bot-surf/jobs/2025.10/pr_1278/15645493

date job status comment
Oct 30 14:26:09 UTC 2025 submitted job id 15645493 will be eligible to start in about 20 seconds
Oct 30 14:26:15 UTC 2025 received job awaits launch by Slurm scheduler
Oct 30 14:26:39 UTC 2025 running job 15645493 is running
Oct 30 15:05:14 UTC 2025 finished
😁 SUCCESS (click triangle for details)
Details
✅ job output file slurm-15645493.out
✅ no message matching FATAL:
✅ no message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
✅ found message(s) matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2025.06-software-linux-x86_64-intel-icelake-accel-nvidia-cc80-17618356610.tar.gzsize: 6872 MiB (7206354023 bytes)
entries: 12679
modules under 2025.06/software/linux/x86_64/intel/icelake/accel/nvidia/cc80/modules/all
CUDA/12.6.0.lua
CUDA/12.8.0.lua
cuDNN/9.10.1.4-CUDA-12.8.0.lua
cuDNN/9.5.0.50-CUDA-12.6.0.lua
software under 2025.06/software/linux/x86_64/intel/icelake/accel/nvidia/cc80/software
CUDA/12.6.0
CUDA/12.8.0
cuDNN/9.10.1.4-CUDA-12.8.0
cuDNN/9.5.0.50-CUDA-12.6.0
reprod directories under 2025.06/software/linux/x86_64/intel/icelake/accel/nvidia/cc80/reprod
CUDA/12.6.0/20251030_143907UTC
CUDA/12.8.0/20251030_144306UTC
cuDNN/9.10.1.4-CUDA-12.8.0/20251030_144727UTC
cuDNN/9.5.0.50-CUDA-12.6.0/20251030_144502UTC
other under 2025.06/software/linux/x86_64/intel/icelake/accel/nvidia/cc80
2025.06/scripts/gpu_support/nvidia/easystacks/eessi-2025.06-eb-5.1.2-CUDA-host-injections.yml
Oct 30 15:05:14 UTC 2025 test result
😁 SUCCESS (click triangle for details)
ReFrame Summary
[ PASSED ] Ran 0/0 test case(s) from 0 check(s) (0 failure(s), 0 skipped, 0 aborted)
Details
✅ job output file slurm-15645493.out
✅ no message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case

@casparvl
Copy link
Collaborator Author

casparvl commented Oct 30, 2025

Making it verbose seems to have solved the issue. That is, of course, impossible, but... things are working now:

mkdir: created directory '/cvmfs/software.eessi.io/versions/2025.06/software/linux/x86_64/intel/icelake/accel'
mkdir: created directory '/cvmfs/software.eessi.io/versions/2025.06/software/linux/x86_64/intel/icelake/accel/nvidia'
mkdir: created directory '/cvmfs/software.eessi.io/versions/2025.06/software/linux/x86_64/intel/icelake/accel/nvidia/cc80'
mkdir: created directory '/cvmfs/software.eessi.io/versions/2025.06/software/linux/x86_64/intel/icelake/accel/nvidia/cc80/modules'
mkdir: created directory '/cvmfs/software.eessi.io/versions/2025.06/software/linux/x86_64/intel/icelake/accel/nvidia/cc80/modules/all'
...
== COMPLETED: Installation ended successfully (took 3 mins 3 secs)
== Results of the build can be found in the log file(s) /cvmfs/software.eessi.io/versions/2025.06/software/linux/x86_64/intel/icelake/accel/nvidia/cc80/software/CUDA/12.6.0/easybuild/easybuild-CUDA-12.6.0-20251030.153905.log.bz2

So maybe this was just one more of unionfs's hickups?

@casparvl
Copy link
Collaborator Author

Let's get all of those host-injections installed...

All bots that run native builds (one architecture per bot is sufficient)

bot: build repo:eessi.io-2025.06-software instance:eessi-bot-vsc-ugent for:arch=x86_64/intel/cascadelake,accel=nvidia/cc70
bot: build repo:eessi.io-2025.06-software instance:eessi-bot-jsc for:arch=aarch64/nvidia/grace,accel=nvidia/cc90

x86_64 and arm archs on AWS bot:

bot: build repo:eessi.io-2025.06-software instance:eessi-bot-mc-aws on:arch=zen2 for:arch=x86_64/amd/zen2,accel=nvidia/cc70
bot: build repo:eessi.io-2025.06-software instance:eessi-bot-mc-aws on:arch=x86_64/generic for:arch=x86_64/generic,accel=nvidia/cc70

@eessi-bot-jsc
Copy link

eessi-bot-jsc bot commented Oct 30, 2025

New job on instance eessi-bot-jsc for repository eessi.io-2025.06-software
Building on: nvidia-grace and accelerator nvidia/cc90
Building for: aarch64/nvidia/grace and accelerator nvidia/cc90
Job dir: /p/project1/ceasybuilders/eessibot/jobs/2025.10/pr_1278/14161581

date job status comment
Oct 30 15:39:49 UTC 2025 submitted job id 14161581 awaits release by job manager
Oct 30 15:40:07 UTC 2025 released job awaits launch by Slurm scheduler
Oct 30 15:41:11 UTC 2025 running job 14161581 is running
Oct 30 17:17:12 UTC 2025 finished
😢 FAILURE (click triangle for details)
Details
✅ job output file slurm-14161581.out
✅ no message matching FATAL:
❌ found message matching ERROR:
❌ found message matching FAILED:
❌ found message matching required modules missing:
❌ no message matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2025.06-software-linux-aarch64-nvidia-grace-accel-nvidia-cc90-17618429730.tar.gzsize: 5980 MiB (6271255423 bytes)
entries: 8879
modules under 2025.06/software/linux/aarch64/nvidia/grace/accel/nvidia/cc90/modules/all
CUDA/12.6.0.lua
CUDA/12.8.0.lua
cuDNN/9.5.0.50-CUDA-12.6.0.lua
software under 2025.06/software/linux/aarch64/nvidia/grace/accel/nvidia/cc90/software
CUDA/12.6.0
CUDA/12.8.0
cuDNN/9.5.0.50-CUDA-12.6.0
reprod directories under 2025.06/software/linux/aarch64/nvidia/grace/accel/nvidia/cc90/reprod
CUDA/12.6.0/20251030_161626UTC
CUDA/12.8.0/20251030_163743UTC
cuDNN/9.5.0.50-CUDA-12.6.0/20251030_163905UTC
other under 2025.06/software/linux/aarch64/nvidia/grace/accel/nvidia/cc90
2025.06/scripts/gpu_support/nvidia/easystacks/eessi-2025.06-eb-5.1.2-CUDA-host-injections.yml
Oct 30 17:17:12 UTC 2025 test result
😁 SUCCESS (click triangle for details)
ReFrame Summary
[ PASSED ] Ran 0/0 test case(s) from 0 check(s) (0 failure(s), 0 skipped, 0 aborted)
Details
✅ job output file slurm-14161581.out
❌ found message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case

@eessi-bot-aws
Copy link

eessi-bot-aws bot commented Oct 30, 2025

New job on instance eessi-bot-mc-aws for repository eessi.io-2025.06-software
Building on: amd-zen2
Building for: x86_64/amd/zen2 and accelerator nvidia/cc70
Job dir: /project/def-users/SHARED/jobs/2025.10/pr_1278/100486

date job status comment
Oct 30 15:39:50 UTC 2025 submitted job id 100486 awaits release by job manager
Oct 30 15:40:11 UTC 2025 released job awaits launch by Slurm scheduler
Oct 30 15:46:16 UTC 2025 running job 100486 is running
Oct 30 16:20:40 UTC 2025 finished
😢 FAILURE (click triangle for details)
Details
✅ job output file slurm-100486.out
✅ no message matching FATAL:
❌ found message matching ERROR:
❌ found message matching FAILED:
❌ found message matching required modules missing:
❌ no message matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2025.06-software-linux-x86_64-amd-zen2-accel-nvidia-cc70-17618404130.tar.gzsize: 6197 MiB (6498851361 bytes)
entries: 12594
modules under 2025.06/software/linux/x86_64/amd/zen2/accel/nvidia/cc70/modules/all
CUDA/12.6.0.lua
CUDA/12.8.0.lua
cuDNN/9.5.0.50-CUDA-12.6.0.lua
software under 2025.06/software/linux/x86_64/amd/zen2/accel/nvidia/cc70/software
CUDA/12.6.0
CUDA/12.8.0
cuDNN/9.5.0.50-CUDA-12.6.0
reprod directories under 2025.06/software/linux/x86_64/amd/zen2/accel/nvidia/cc70/reprod
CUDA/12.6.0/20251030_155004UTC
CUDA/12.8.0/20251030_155518UTC
cuDNN/9.5.0.50-CUDA-12.6.0/20251030_155800UTC
other under 2025.06/software/linux/x86_64/amd/zen2/accel/nvidia/cc70
2025.06/scripts/gpu_support/nvidia/easystacks/eessi-2025.06-eb-5.1.2-CUDA-host-injections.yml
Oct 30 16:20:40 UTC 2025 test result
😢 FAILURE (click triangle for details)
Reason
EESSI test suite was not run, test step itself failed to execute.
Details
✅ job output file slurm-100486.out
❌ found message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case

@eessi-bot-aws
Copy link

eessi-bot-aws bot commented Oct 30, 2025

New job on instance eessi-bot-mc-aws for repository eessi.io-2025.06-software
Building on: generic
Building for: x86_64/generic and accelerator nvidia/cc70
Job dir: /project/def-users/SHARED/jobs/2025.10/pr_1278/100487

date job status comment
Oct 30 15:39:56 UTC 2025 submitted job id 100487 awaits release by job manager
Oct 30 15:40:13 UTC 2025 released job awaits launch by Slurm scheduler
Oct 30 15:46:18 UTC 2025 running job 100487 is running
Oct 30 16:33:58 UTC 2025 finished
😁 SUCCESS (click triangle for details)
Details
✅ job output file slurm-100487.out
✅ no message matching FATAL:
✅ no message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
✅ found message(s) matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2025.06-software-linux-x86_64-generic-accel-nvidia-cc70-17618411070.tar.gzsize: 6872 MiB (7206333085 bytes)
entries: 12679
modules under 2025.06/software/linux/x86_64/generic/accel/nvidia/cc70/modules/all
CUDA/12.6.0.lua
CUDA/12.8.0.lua
cuDNN/9.10.1.4-CUDA-12.8.0.lua
cuDNN/9.5.0.50-CUDA-12.6.0.lua
software under 2025.06/software/linux/x86_64/generic/accel/nvidia/cc70/software
CUDA/12.6.0
CUDA/12.8.0
cuDNN/9.10.1.4-CUDA-12.8.0
cuDNN/9.5.0.50-CUDA-12.6.0
reprod directories under 2025.06/software/linux/x86_64/generic/accel/nvidia/cc70/reprod
CUDA/12.6.0/20251030_160705UTC
CUDA/12.8.0/20251030_161220UTC
cuDNN/9.10.1.4-CUDA-12.8.0/20251030_161645UTC
cuDNN/9.5.0.50-CUDA-12.6.0/20251030_161420UTC
other under 2025.06/software/linux/x86_64/generic/accel/nvidia/cc70
2025.06/scripts/gpu_support/nvidia/easystacks/eessi-2025.06-eb-5.1.2-CUDA-host-injections.yml
Oct 30 16:33:58 UTC 2025 test result
😢 FAILURE (click triangle for details)
Reason
EESSI test suite was not run, test step itself failed to execute.
Details
✅ job output file slurm-100487.out
❌ found message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case

@casparvl casparvl added 2025.06-software.eessi.io 2025.06 version of software.eessi.io accel:nvidia labels Oct 30, 2025
@casparvl
Copy link
Collaborator Author

casparvl commented Oct 30, 2025

Edit: not sure why the previous build failed. The installations in the host_injections failed with a message that the lock file was already present. That's very strange, there should not be a lock file in the host_injections...

bot: build repo:eessi.io-2025.06-software instance:eessi-bot-mc-aws on:arch=zen2 for:arch=x86_64/amd/zen2,accel=nvidia/cc70

@eessi-bot-aws
Copy link

eessi-bot-aws bot commented Oct 30, 2025

New job on instance eessi-bot-mc-aws for repository eessi.io-2025.06-software
Building on: amd-zen2
Building for: x86_64/amd/zen2 and accelerator nvidia/cc70
Job dir: /project/def-users/SHARED/jobs/2025.10/pr_1278/100489

date job status comment
Oct 30 21:20:43 UTC 2025 submitted job id 100489 awaits release by job manager
Oct 30 21:21:32 UTC 2025 released job awaits launch by Slurm scheduler
Oct 30 21:22:34 UTC 2025 running job 100489 is running
Oct 30 21:50:35 UTC 2025 finished
😁 SUCCESS (click triangle for details)
Details
✅ job output file slurm-100489.out
✅ no message matching FATAL:
✅ no message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
✅ found message(s) matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2025.06-software-linux-x86_64-amd-zen2-accel-nvidia-cc70-17618601690.tar.gzsize: 6872 MiB (7206289626 bytes)
entries: 12679
modules under 2025.06/software/linux/x86_64/amd/zen2/accel/nvidia/cc70/modules/all
CUDA/12.6.0.lua
CUDA/12.8.0.lua
cuDNN/9.10.1.4-CUDA-12.8.0.lua
cuDNN/9.5.0.50-CUDA-12.6.0.lua
software under 2025.06/software/linux/x86_64/amd/zen2/accel/nvidia/cc70/software
CUDA/12.6.0
CUDA/12.8.0
cuDNN/9.10.1.4-CUDA-12.8.0
cuDNN/9.5.0.50-CUDA-12.6.0
reprod directories under 2025.06/software/linux/x86_64/amd/zen2/accel/nvidia/cc70/reprod
CUDA/12.6.0/20251030_212519UTC
CUDA/12.8.0/20251030_212935UTC
cuDNN/9.10.1.4-CUDA-12.8.0/20251030_213418UTC
cuDNN/9.5.0.50-CUDA-12.6.0/20251030_213140UTC
other under 2025.06/software/linux/x86_64/amd/zen2/accel/nvidia/cc70
2025.06/scripts/gpu_support/nvidia/easystacks/eessi-2025.06-eb-5.1.2-CUDA-host-injections.yml
Oct 30 21:50:35 UTC 2025 test result
😢 FAILURE (click triangle for details)
Reason
EESSI test suite was not run, test step itself failed to execute.
Details
✅ job output file slurm-100489.out
❌ found message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case

@casparvl
Copy link
Collaborator Author

Oh crap, I see the issue, the other build was x86_64/generic, while intended to start it on ARM...

bot: build repo:eessi.io-2023.06-software instance:eessi-bot-mc-aws for:arch=aarch64/generic

@eessi-bot-aws
Copy link

eessi-bot-aws bot commented Oct 30, 2025

New job on instance eessi-bot-mc-aws for repository eessi.io-2023.06-software
Building on: generic
Building for: aarch64/generic
Job dir: /project/def-users/SHARED/jobs/2025.10/pr_1278/100490

date job status comment
Oct 30 21:24:47 UTC 2025 submitted job id 100490 awaits release by job manager
Oct 30 21:25:39 UTC 2025 released job awaits launch by Slurm scheduler
Oct 30 21:30:47 UTC 2025 running job 100490 is running
Oct 30 21:33:55 UTC 2025 finished
😢 FAILURE (click triangle for details)
Details
✅ job output file slurm-100490.out
✅ no message matching FATAL:
✅ no message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
❌ no message matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2023.06-software-linux-aarch64-generic-17618598470.tar.gzsize: 0 MiB (45 bytes)
entries: 0
modules under 2023.06/software/linux/aarch64/generic/modules/all
no module files in tarball
software under 2023.06/software/linux/aarch64/generic/software
no software packages in tarball
reprod directories under 2023.06/software/linux/aarch64/generic/reprod
no reprod directories in tarball
other under 2023.06/software/linux/aarch64/generic
no other files in tarball
Oct 30 21:33:55 UTC 2025 test result
😁 SUCCESS (click triangle for details)
ReFrame Summary
[ OK ] ( 1/10) EESSI_LAMMPS_lj %device_type=cpu %module_name=LAMMPS/29Aug2024-foss-2023b-kokkos %scale=1_node /aeb2d9df @BotBuildTests:aarch64_generic+default
P: perf: 699.891 timesteps/s (r:0, l:None, u:None)
[ OK ] ( 2/10) EESSI_LAMMPS_lj %device_type=cpu %module_name=LAMMPS/2Aug2023_update2-foss-2023a-kokkos %scale=1_node /04ff9ece @BotBuildTests:aarch64_generic+default
P: perf: 697.918 timesteps/s (r:0, l:None, u:None)
[ OK ] ( 3/10) EESSI_OSU_coll %benchmark_info=mpi.collective.osu_allreduce %module_name=OSU-Micro-Benchmarks/7.2-gompi-2023b %scale=1_node %device_type=cpu /775175bf @BotBuildTests:aarch64_generic+default
P: latency: 3.24 us (r:0, l:None, u:None)
[ OK ] ( 4/10) EESSI_OSU_coll %benchmark_info=mpi.collective.osu_allreduce %module_name=OSU-Micro-Benchmarks/7.1-1-gompi-2023a %scale=1_node %device_type=cpu /52707c40 @BotBuildTests:aarch64_generic+default
P: latency: 3.47 us (r:0, l:None, u:None)
[ OK ] ( 5/10) EESSI_OSU_coll %benchmark_info=mpi.collective.osu_alltoall %module_name=OSU-Micro-Benchmarks/7.2-gompi-2023b %scale=1_node %device_type=cpu /b1aacda9 @BotBuildTests:aarch64_generic+default
P: latency: 5.51 us (r:0, l:None, u:None)
[ OK ] ( 6/10) EESSI_OSU_coll %benchmark_info=mpi.collective.osu_alltoall %module_name=OSU-Micro-Benchmarks/7.1-1-gompi-2023a %scale=1_node %device_type=cpu /c6bad193 @BotBuildTests:aarch64_generic+default
P: latency: 5.57 us (r:0, l:None, u:None)
[ OK ] ( 7/10) EESSI_OSU_pt2pt_CPU %benchmark_info=mpi.pt2pt.osu_latency %module_name=OSU-Micro-Benchmarks/7.2-gompi-2023b %scale=1_node /15cad6c4 @BotBuildTests:aarch64_generic+default
P: latency: 0.44 us (r:0, l:None, u:None)
[ OK ] ( 8/10) EESSI_OSU_pt2pt_CPU %benchmark_info=mpi.pt2pt.osu_latency %module_name=OSU-Micro-Benchmarks/7.1-1-gompi-2023a %scale=1_node /6672deda @BotBuildTests:aarch64_generic+default
P: latency: 0.46 us (r:0, l:None, u:None)
[ OK ] ( 9/10) EESSI_OSU_pt2pt_CPU %benchmark_info=mpi.pt2pt.osu_bw %module_name=OSU-Micro-Benchmarks/7.2-gompi-2023b %scale=1_node /2a9a47b1 @BotBuildTests:aarch64_generic+default
P: bandwidth: 20802.34 MB/s (r:0, l:None, u:None)
[ OK ] (10/10) EESSI_OSU_pt2pt_CPU %benchmark_info=mpi.pt2pt.osu_bw %module_name=OSU-Micro-Benchmarks/7.1-1-gompi-2023a %scale=1_node /1b24ab8e @BotBuildTests:aarch64_generic+default
P: bandwidth: 20541.79 MB/s (r:0, l:None, u:None)
[ PASSED ] Ran 10/10 test case(s) from 10 check(s) (0 failure(s), 0 skipped, 0 aborted)
Details
✅ job output file slurm-100490.out
✅ no message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case

@casparvl
Copy link
Collaborator Author

bot: build repo:eessi.io-2023.06-software instance:eessi-bot-mc-aws for:arch=aarch64/generic

@eessi-bot-aws
Copy link

eessi-bot-aws bot commented Oct 30, 2025

New job on instance eessi-bot-mc-aws for repository eessi.io-2023.06-software
Building on: generic
Building for: aarch64/generic
Job dir: /project/def-users/SHARED/jobs/2025.10/pr_1278/100491

date job status comment
Oct 30 21:37:29 UTC 2025 submitted job id 100491 awaits release by job manager
Oct 30 21:38:02 UTC 2025 released job awaits launch by Slurm scheduler
Oct 30 21:39:07 UTC 2025 running job 100491 is running
Oct 30 21:41:12 UTC 2025 finished
😢 FAILURE (click triangle for details)
Details
✅ job output file slurm-100491.out
✅ no message matching FATAL:
✅ no message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
❌ no message matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2023.06-software-linux-aarch64-generic-17618603120.tar.gzsize: 0 MiB (45 bytes)
entries: 0
modules under 2023.06/software/linux/aarch64/generic/modules/all
no module files in tarball
software under 2023.06/software/linux/aarch64/generic/software
no software packages in tarball
reprod directories under 2023.06/software/linux/aarch64/generic/reprod
no reprod directories in tarball
other under 2023.06/software/linux/aarch64/generic
no other files in tarball
Oct 30 21:41:12 UTC 2025 test result
😁 SUCCESS (click triangle for details)
ReFrame Summary
[ OK ] ( 1/10) EESSI_LAMMPS_lj %device_type=cpu %module_name=LAMMPS/29Aug2024-foss-2023b-kokkos %scale=1_node /aeb2d9df @BotBuildTests:aarch64_generic+default
P: perf: 696.808 timesteps/s (r:0, l:None, u:None)
[ OK ] ( 2/10) EESSI_LAMMPS_lj %device_type=cpu %module_name=LAMMPS/2Aug2023_update2-foss-2023a-kokkos %scale=1_node /04ff9ece @BotBuildTests:aarch64_generic+default
P: perf: 706.728 timesteps/s (r:0, l:None, u:None)
[ OK ] ( 3/10) EESSI_OSU_coll %benchmark_info=mpi.collective.osu_allreduce %module_name=OSU-Micro-Benchmarks/7.2-gompi-2023b %scale=1_node %device_type=cpu /775175bf @BotBuildTests:aarch64_generic+default
P: latency: 3.51 us (r:0, l:None, u:None)
[ OK ] ( 4/10) EESSI_OSU_coll %benchmark_info=mpi.collective.osu_allreduce %module_name=OSU-Micro-Benchmarks/7.1-1-gompi-2023a %scale=1_node %device_type=cpu /52707c40 @BotBuildTests:aarch64_generic+default
P: latency: 3.5 us (r:0, l:None, u:None)
[ OK ] ( 5/10) EESSI_OSU_coll %benchmark_info=mpi.collective.osu_alltoall %module_name=OSU-Micro-Benchmarks/7.2-gompi-2023b %scale=1_node %device_type=cpu /b1aacda9 @BotBuildTests:aarch64_generic+default
P: latency: 5.45 us (r:0, l:None, u:None)
[ OK ] ( 6/10) EESSI_OSU_coll %benchmark_info=mpi.collective.osu_alltoall %module_name=OSU-Micro-Benchmarks/7.1-1-gompi-2023a %scale=1_node %device_type=cpu /c6bad193 @BotBuildTests:aarch64_generic+default
P: latency: 5.62 us (r:0, l:None, u:None)
[ OK ] ( 7/10) EESSI_OSU_pt2pt_CPU %benchmark_info=mpi.pt2pt.osu_latency %module_name=OSU-Micro-Benchmarks/7.2-gompi-2023b %scale=1_node /15cad6c4 @BotBuildTests:aarch64_generic+default
P: latency: 0.45 us (r:0, l:None, u:None)
[ OK ] ( 8/10) EESSI_OSU_pt2pt_CPU %benchmark_info=mpi.pt2pt.osu_latency %module_name=OSU-Micro-Benchmarks/7.1-1-gompi-2023a %scale=1_node /6672deda @BotBuildTests:aarch64_generic+default
P: latency: 0.44 us (r:0, l:None, u:None)
[ OK ] ( 9/10) EESSI_OSU_pt2pt_CPU %benchmark_info=mpi.pt2pt.osu_bw %module_name=OSU-Micro-Benchmarks/7.2-gompi-2023b %scale=1_node /2a9a47b1 @BotBuildTests:aarch64_generic+default
P: bandwidth: 20690.98 MB/s (r:0, l:None, u:None)
[ OK ] (10/10) EESSI_OSU_pt2pt_CPU %benchmark_info=mpi.pt2pt.osu_bw %module_name=OSU-Micro-Benchmarks/7.1-1-gompi-2023a %scale=1_node /1b24ab8e @BotBuildTests:aarch64_generic+default
P: bandwidth: 20851.54 MB/s (r:0, l:None, u:None)
[ PASSED ] Ran 10/10 test case(s) from 10 check(s) (0 failure(s), 0 skipped, 0 aborted)
Details
✅ job output file slurm-100491.out
✅ no message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case

@casparvl
Copy link
Collaborator Author

Wrong version...

bot: build repo:eessi.io-2025.06-software instance:eessi-bot-mc-aws for:arch=aarch64/generic

@laraPPr
Copy link
Collaborator

laraPPr commented Dec 1, 2025

Weird ^[[31mERROR: Clone of the test suite /eessi_bot_job/EESSI-test-suite is not available!^[[0m

@casparvl
Copy link
Collaborator Author

casparvl commented Dec 1, 2025

That seems completely unrelated to the update to the software-layer-scripts EESSI/software-layer-scripts#124 . I seem to vaguely remember you had this issue before, no?

@laraPPr
Copy link
Collaborator

laraPPr commented Dec 1, 2025

This seem a fluke on github site no?

Cloning into 'EESSI-test-suite'...
fatal: unable to access 'https://github.com/EESSI/test-suite/': Could not resolve host: github.com

@laraPPr
Copy link
Collaborator

laraPPr commented Dec 1, 2025

bot: build repo:eessi.io-2025.06-software instance:eessi-bot-vsc-ugent for:arch=x86_64/intel/cascadelake,accel=nvidia/cc70

@gpu-bot-ugent
Copy link

gpu-bot-ugent bot commented Dec 1, 2025

New job on instance eessi-bot-vsc-ugent for repository eessi.io-2025.06-software
Building on: intel-cascadelake and accelerator nvidia/cc70
Building for: x86_64/intel/cascadelake and accelerator nvidia/cc70
Job dir: /scratch/gent/vo/002/gvo00211/SHARED/jobs/2025.12/pr_1278/40746922

date job status comment
Dec 01 14:18:50 UTC 2025 submitted job id 40746922 awaits release by job manager
Dec 01 14:19:13 UTC 2025 released job awaits launch by Slurm scheduler
Dec 01 14:21:20 UTC 2025 running job 40746922 is running
Dec 01 14:29:37 UTC 2025 finished
😁 SUCCESS (click triangle for details)
Details
✅ job output file slurm-40746922.out
✅ no message matching FATAL:
✅ no message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
✅ found message(s) matching No missing installations
✅ found message matching .tar.* created!
Artefacts
eessi-2025.06-software-linux-x86_64-intel-cascadelake-accel-nvidia-cc70-17645992440.tar.zstsize: 5122 MiB (5371360980 bytes)
entries: 12593
modules under 2025.06/software/linux/x86_64/intel/cascadelake/accel/nvidia/cc70/modules/all
CUDA/12.6.0.lua
CUDA/12.8.0.lua
cuDNN/9.5.0.50-CUDA-12.6.0.lua
software under 2025.06/software/linux/x86_64/intel/cascadelake/accel/nvidia/cc70/software
CUDA/12.6.0
CUDA/12.8.0
cuDNN/9.5.0.50-CUDA-12.6.0
reprod directories under 2025.06/software/linux/x86_64/intel/cascadelake/accel/nvidia/cc70/reprod
CUDA/12.6.0/20251201_142223UTC
CUDA/12.8.0/20251201_142526UTC
cuDNN/9.5.0.50-CUDA-12.6.0/20251201_142711UTC
other under 2025.06/software/linux/x86_64/intel/cascadelake/accel/nvidia/cc70
no other files in tarball
Dec 01 14:29:37 UTC 2025 test result
😢 FAILURE (click triangle for details)
Reason
EESSI test suite was not run, test step itself failed to execute.
Details
✅ job output file slurm-40746922.out
❌ found message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case

@laraPPr
Copy link
Collaborator

laraPPr commented Dec 1, 2025

Or a problem on the node where it ran

@laraPPr
Copy link
Collaborator

laraPPr commented Dec 1, 2025

bot: build repo:eessi.io-2023.06-software instance:eessi-bot-vsc-ugent for:arch=x86_64/amd/zen3,accel=nvidia/cc80

@gpu-bot-ugent
Copy link

gpu-bot-ugent bot commented Dec 1, 2025

New job on instance eessi-bot-vsc-ugent for repository eessi.io-2023.06-software
Building on: amd-zen3 and accelerator nvidia/cc80
Building for: x86_64/amd/zen3 and accelerator nvidia/cc80
Job dir: /scratch/gent/vo/002/gvo00211/SHARED/jobs/2025.12/pr_1278/15563989

date job status comment
Dec 01 14:36:40 UTC 2025 submitted job id 15563989 awaits release by job manager
Dec 01 14:37:46 UTC 2025 released job awaits launch by Slurm scheduler
Dec 01 14:57:54 UTC 2025 running job 15563989 is running
Dec 01 14:59:56 UTC 2025 finished
😢 FAILURE (click triangle for details)
Details
✅ job output file slurm-15563989.out
✅ no message matching FATAL:
✅ no message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
❌ no message matching No missing installations
✅ found message matching .tar.* created!
Artefacts
eessi-2023.06-software-linux-x86_64-amd-zen3-accel-nvidia-cc80-17646010120.tar.zstsize: 0 MiB (22 bytes)
entries: 0
modules under 2023.06/software/linux/x86_64/amd/zen3/accel/nvidia/cc80/modules/all
no module files in tarball
software under 2023.06/software/linux/x86_64/amd/zen3/accel/nvidia/cc80/software
no software packages in tarball
reprod directories under 2023.06/software/linux/x86_64/amd/zen3/accel/nvidia/cc80/reprod
no reprod directories in tarball
other under 2023.06/software/linux/x86_64/amd/zen3/accel/nvidia/cc80
no other files in tarball
Dec 01 14:59:56 UTC 2025 test result
😁 SUCCESS (click triangle for details)
ReFrame Summary
[ SKIP ] (1/9) Skipping GPU test : only 1 GPU available for this test case
[ SKIP ] (2/9) Skipping GPU test : only 1 GPU available for this test case
[ SKIP ] (3/9) Skipping GPU test : only 1 GPU available for this test case
[ SKIP ] (4/9) Skipping GPU test : only 1 GPU available for this test case
[ SKIP ] (5/9) Skipping test : 1 GPU(s) available for this test case, need exactly 2
[ SKIP ] (6/9) Skipping test : 1 GPU(s) available for this test case, need exactly 2
[ SKIP ] (7/9) Skipping test : 1 GPU(s) available for this test case, need exactly 2
[ SKIP ] (8/9) Skipping test : 1 GPU(s) available for this test case, need exactly 2
[ OK ] (9/9) EESSI_LAMMPS_lj %device_type=gpu %module_name=LAMMPS/2Aug2023_update2-foss-2023a-kokkos-CUDA-12.1.1 %scale=1_4_node /497af4b1 @BotBuildTests:gpu_a100+default
P: perf: 4344.962 timesteps/s (r:0, l:None, u:None)
[ PASSED ] Ran 1/9 test case(s) from 9 check(s) (0 failure(s), 8 skipped, 0 aborted)
Details
✅ job output file slurm-15563989.out
✅ no message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case

@laraPPr
Copy link
Collaborator

laraPPr commented Dec 1, 2025

Ok so the tests seem to be fine I don't know what is going wrong with clone on our Cascadelake cluster today.

@casparvl
Copy link
Collaborator Author

casparvl commented Dec 1, 2025

Ok, I'll try this once more, in order to test updates from EESSI/software-layer-scripts#124

bot: build repo:eessi.io-2025.06-software instance:eessi-bot-surf for:arch=x86_64/intel/icelake,accel=nvidia/cc80

@eessi-bot-surf
Copy link

eessi-bot-surf bot commented Dec 1, 2025

New job on instance eessi-bot-surf for repository eessi.io-2025.06-software
Building on: intel-icelake and accelerator nvidia/cc80
Building for: x86_64/intel/icelake and accelerator nvidia/cc80
Job dir: /projects/eessibot/eessi-bot-surf/jobs/2025.12/pr_1278/16946934

date job status comment
Dec 01 15:44:47 UTC 2025 submitted job id 16946934 will be eligible to start in about 20 seconds
Dec 01 15:44:56 UTC 2025 received job awaits launch by Slurm scheduler
Dec 01 15:45:14 UTC 2025 running job 16946934 is running
Dec 01 15:57:23 UTC 2025 finished
😁 SUCCESS (click triangle for details)
Details
✅ job output file slurm-16946934.out
✅ no message matching FATAL:
✅ no message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
✅ found message(s) matching No missing installations
✅ found message matching .tar.* created!
Artefacts
eessi-2025.06-software-linux-x86_64-intel-icelake-accel-nvidia-cc80-17646045000.tar.zstsize: 5126 MiB (5375529320 bytes)
entries: 12593
modules under 2025.06/software/linux/x86_64/intel/icelake/accel/nvidia/cc80/modules/all
CUDA/12.6.0.lua
CUDA/12.8.0.lua
cuDNN/9.5.0.50-CUDA-12.6.0.lua
software under 2025.06/software/linux/x86_64/intel/icelake/accel/nvidia/cc80/software
CUDA/12.6.0
CUDA/12.8.0
cuDNN/9.5.0.50-CUDA-12.6.0
reprod directories under 2025.06/software/linux/x86_64/intel/icelake/accel/nvidia/cc80/reprod
CUDA/12.6.0/20251201_154903UTC
CUDA/12.8.0/20251201_155249UTC
cuDNN/9.5.0.50-CUDA-12.6.0/20251201_155447UTC
other under 2025.06/software/linux/x86_64/intel/icelake/accel/nvidia/cc80
no other files in tarball
Dec 01 15:57:23 UTC 2025 test result
😁 SUCCESS (click triangle for details)
ReFrame Summary
[ PASSED ] Ran 0/0 test case(s) from 0 check(s) (0 failure(s), 0 skipped, 0 aborted)
Details
✅ job output file slurm-16946934.out
✅ no message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case

@laraPPr
Copy link
Collaborator

laraPPr commented Dec 1, 2025

We opened an issue for the git clone problem, EESSI/compatibility-layer#232. Maybe @bedroge can have a look.

@casparvl
Copy link
Collaborator Author

casparvl commented Dec 2, 2025

bot:status last_build

@gpu-bot-ugent
Copy link

gpu-bot-ugent bot commented Dec 2, 2025

This is the status of all the bot: build commands:

on for repo result date status url
generic aarch64/generic eessi.io-2025.06-software 😢 FAILURE Oct 30 22:12:07 UTC 2025 finished #1278 (comment)
generic aarch64/generic eessi.io-2023.06-software 😢 FAILURE Oct 30 21:41:12 UTC 2025 finished #1278 (comment)
generic aarch64/generic, nvidia/cc70 eessi.io-2025.06-software 😁 SUCCESS Nov 11 23:24:48 UTC 2025 finished #1278 (comment)
generic aarch64/generic, nvidia/cc80 eessi.io-2025.06-software 😁 SUCCESS Nov 12 08:45:50 UTC 2025 finished #1278 (comment)
neoverse_n1 aarch64/neoverse_n1, nvidia/cc70 eessi.io-2025.06-software 😁 SUCCESS Nov 11 23:23:36 UTC 2025 finished #1278 (comment)
neoverse_v1 aarch64/neoverse_v1, nvidia/cc70 eessi.io-2025.06-software 😁 SUCCESS Nov 11 23:22:25 UTC 2025 finished #1278 (comment)
nvidia-grace aarch64/nvidia/grace, nvidia/cc70 eessi.io-2025.06-software 😁 SUCCESS Nov 11 23:25:39 UTC 2025 finished #1278 (comment)
nvidia-grace aarch64/nvidia/grace, nvidia/cc80 eessi.io-2025.06-software 😁 SUCCESS Nov 12 08:42:57 UTC 2025 finished #1278 (comment)
nvidia-grace, nvidia/cc90 aarch64/nvidia/grace, nvidia/cc90 eessi.io-2025.06-software 😁 SUCCESS Nov 04 17:01:52 UTC 2025 finished #1278 (comment)
amd-zen2 x86_64/amd/zen2, nvidia/cc70 eessi.io-2025.06-software 😁 SUCCESS Nov 11 23:22:27 UTC 2025 finished #1278 (comment)
amd-zen2 x86_64/amd/zen2, nvidia/cc80 eessi.io-2025.06-software 😁 SUCCESS Nov 12 08:53:54 UTC 2025 finished #1278 (comment)
amd-zen3 x86_64/amd/zen3, nvidia/cc70 eessi.io-2025.06-software 😁 SUCCESS Nov 11 23:18:29 UTC 2025 finished #1278 (comment)
amd-zen3, nvidia/cc80 x86_64/amd/zen3, nvidia/cc80 eessi.io-2025.06-software 😁 SUCCESS Nov 06 15:28:59 UTC 2025 finished #1278 (comment)
amd-zen4 x86_64/amd/zen4, nvidia/cc70 eessi.io-2025.06-software 😁 SUCCESS Nov 11 23:12:37 UTC 2025 finished #1278 (comment)
amd-zen4 x86_64/amd/zen4, nvidia/cc80 eessi.io-2025.06-software 😁 SUCCESS Nov 12 08:45:54 UTC 2025 finished #1278 (comment)
amd-zen4, nvidia/cc90 x86_64/amd/zen4, nvidia/cc90 eessi.io-2025.06-software 😁 SUCCESS Nov 04 17:05:24 UTC 2025 finished #1278 (comment)
generic x86_64/generic, nvidia/cc70 eessi.io-2025.06-software 😁 SUCCESS Nov 11 23:25:51 UTC 2025 finished #1278 (comment)
generic x86_64/generic, nvidia/cc80 eessi.io-2025.06-software 😁 SUCCESS Nov 12 08:49:52 UTC 2025 finished #1278 (comment)
intel-cascadelake, nvidia/cc70 x86_64/intel/cascadelake, nvidia/cc70 eessi.io-2025.06-software 😁 SUCCESS Nov 06 15:31:03 UTC 2025 finished #1278 (comment)
intel-cascadelake x86_64/intel/cascadelake, nvidia/cc80 eessi.io-2025.06-software 😁 SUCCESS Nov 12 08:47:50 UTC 2025 finished #1278 (comment)
intel-haswell x86_64/intel/haswell, nvidia/cc70 eessi.io-2025.06-software 😁 SUCCESS Nov 11 23:23:39 UTC 2025 finished #1278 (comment)
intel-haswell x86_64/intel/haswell, nvidia/cc80 eessi.io-2025.06-software 😁 SUCCESS Nov 12 08:55:39 UTC 2025 finished #1278 (comment)
intel-icelake x86_64/intel/icelake, nvidia/cc70 eessi.io-2025.06-software 😁 SUCCESS Nov 11 23:21:09 UTC 2025 finished #1278 (comment)
intel-icelake, nvidia/cc80 x86_64/intel/icelake, nvidia/cc80 eessi.io-2025.06-software 😁 SUCCESS Nov 11 21:35:13 UTC 2025 finished #1278 (comment)
intel-sapphirerapids x86_64/intel/sapphirerapids, nvidia/cc70 eessi.io-2025.06-software 😁 SUCCESS Nov 11 23:29:59 UTC 2025 finished #1278 (comment)
intel-sapphirerapids x86_64/intel/sapphirerapids, nvidia/cc80 eessi.io-2025.06-software 😁 SUCCESS Nov 12 09:22:32 UTC 2025 finished #1278 (comment)
intel-skylake_avx512 x86_64/intel/skylake_avx512, nvidia/cc70 eessi.io-2025.06-software 😁 SUCCESS Nov 11 23:23:41 UTC 2025 finished #1278 (comment)
intel-skylake_avx512 x86_64/intel/skylake_avx512, nvidia/cc80 eessi.io-2025.06-software 😁 SUCCESS Nov 12 08:49:56 UTC 2025 finished #1278 (comment)

4 similar comments
@eessi-bot-surf
Copy link

eessi-bot-surf bot commented Dec 2, 2025

This is the status of all the bot: build commands:

on for repo result date status url
generic aarch64/generic eessi.io-2025.06-software 😢 FAILURE Oct 30 22:12:07 UTC 2025 finished #1278 (comment)
generic aarch64/generic eessi.io-2023.06-software 😢 FAILURE Oct 30 21:41:12 UTC 2025 finished #1278 (comment)
generic aarch64/generic, nvidia/cc70 eessi.io-2025.06-software 😁 SUCCESS Nov 11 23:24:48 UTC 2025 finished #1278 (comment)
generic aarch64/generic, nvidia/cc80 eessi.io-2025.06-software 😁 SUCCESS Nov 12 08:45:50 UTC 2025 finished #1278 (comment)
neoverse_n1 aarch64/neoverse_n1, nvidia/cc70 eessi.io-2025.06-software 😁 SUCCESS Nov 11 23:23:36 UTC 2025 finished #1278 (comment)
neoverse_v1 aarch64/neoverse_v1, nvidia/cc70 eessi.io-2025.06-software 😁 SUCCESS Nov 11 23:22:25 UTC 2025 finished #1278 (comment)
nvidia-grace aarch64/nvidia/grace, nvidia/cc70 eessi.io-2025.06-software 😁 SUCCESS Nov 11 23:25:39 UTC 2025 finished #1278 (comment)
nvidia-grace aarch64/nvidia/grace, nvidia/cc80 eessi.io-2025.06-software 😁 SUCCESS Nov 12 08:42:57 UTC 2025 finished #1278 (comment)
nvidia-grace, nvidia/cc90 aarch64/nvidia/grace, nvidia/cc90 eessi.io-2025.06-software 😁 SUCCESS Nov 04 17:01:52 UTC 2025 finished #1278 (comment)
amd-zen2 x86_64/amd/zen2, nvidia/cc70 eessi.io-2025.06-software 😁 SUCCESS Nov 11 23:22:27 UTC 2025 finished #1278 (comment)
amd-zen2 x86_64/amd/zen2, nvidia/cc80 eessi.io-2025.06-software 😁 SUCCESS Nov 12 08:53:54 UTC 2025 finished #1278 (comment)
amd-zen3 x86_64/amd/zen3, nvidia/cc70 eessi.io-2025.06-software 😁 SUCCESS Nov 11 23:18:29 UTC 2025 finished #1278 (comment)
amd-zen3, nvidia/cc80 x86_64/amd/zen3, nvidia/cc80 eessi.io-2025.06-software 😁 SUCCESS Nov 06 15:28:59 UTC 2025 finished #1278 (comment)
amd-zen4 x86_64/amd/zen4, nvidia/cc70 eessi.io-2025.06-software 😁 SUCCESS Nov 11 23:12:37 UTC 2025 finished #1278 (comment)
amd-zen4 x86_64/amd/zen4, nvidia/cc80 eessi.io-2025.06-software 😁 SUCCESS Nov 12 08:45:54 UTC 2025 finished #1278 (comment)
amd-zen4, nvidia/cc90 x86_64/amd/zen4, nvidia/cc90 eessi.io-2025.06-software 😁 SUCCESS Nov 04 17:05:24 UTC 2025 finished #1278 (comment)
generic x86_64/generic, nvidia/cc70 eessi.io-2025.06-software 😁 SUCCESS Nov 11 23:25:51 UTC 2025 finished #1278 (comment)
generic x86_64/generic, nvidia/cc80 eessi.io-2025.06-software 😁 SUCCESS Nov 12 08:49:52 UTC 2025 finished #1278 (comment)
intel-cascadelake, nvidia/cc70 x86_64/intel/cascadelake, nvidia/cc70 eessi.io-2025.06-software 😁 SUCCESS Nov 06 15:31:03 UTC 2025 finished #1278 (comment)
intel-cascadelake x86_64/intel/cascadelake, nvidia/cc80 eessi.io-2025.06-software 😁 SUCCESS Nov 12 08:47:50 UTC 2025 finished #1278 (comment)
intel-haswell x86_64/intel/haswell, nvidia/cc70 eessi.io-2025.06-software 😁 SUCCESS Nov 11 23:23:39 UTC 2025 finished #1278 (comment)
intel-haswell x86_64/intel/haswell, nvidia/cc80 eessi.io-2025.06-software 😁 SUCCESS Nov 12 08:55:39 UTC 2025 finished #1278 (comment)
intel-icelake x86_64/intel/icelake, nvidia/cc70 eessi.io-2025.06-software 😁 SUCCESS Nov 11 23:21:09 UTC 2025 finished #1278 (comment)
intel-icelake, nvidia/cc80 x86_64/intel/icelake, nvidia/cc80 eessi.io-2025.06-software 😁 SUCCESS Nov 11 21:35:13 UTC 2025 finished #1278 (comment)
intel-sapphirerapids x86_64/intel/sapphirerapids, nvidia/cc70 eessi.io-2025.06-software 😁 SUCCESS Nov 11 23:29:59 UTC 2025 finished #1278 (comment)
intel-sapphirerapids x86_64/intel/sapphirerapids, nvidia/cc80 eessi.io-2025.06-software 😁 SUCCESS Nov 12 09:22:32 UTC 2025 finished #1278 (comment)
intel-skylake_avx512 x86_64/intel/skylake_avx512, nvidia/cc70 eessi.io-2025.06-software 😁 SUCCESS Nov 11 23:23:41 UTC 2025 finished #1278 (comment)
intel-skylake_avx512 x86_64/intel/skylake_avx512, nvidia/cc80 eessi.io-2025.06-software 😁 SUCCESS Nov 12 08:49:56 UTC 2025 finished #1278 (comment)

@eessi-bot-aws
Copy link

eessi-bot-aws bot commented Dec 2, 2025

This is the status of all the bot: build commands:

on for repo result date status url
generic aarch64/generic eessi.io-2025.06-software 😢 FAILURE Oct 30 22:12:07 UTC 2025 finished #1278 (comment)
generic aarch64/generic eessi.io-2023.06-software 😢 FAILURE Oct 30 21:41:12 UTC 2025 finished #1278 (comment)
generic aarch64/generic, nvidia/cc70 eessi.io-2025.06-software 😁 SUCCESS Nov 11 23:24:48 UTC 2025 finished #1278 (comment)
generic aarch64/generic, nvidia/cc80 eessi.io-2025.06-software 😁 SUCCESS Nov 12 08:45:50 UTC 2025 finished #1278 (comment)
neoverse_n1 aarch64/neoverse_n1, nvidia/cc70 eessi.io-2025.06-software 😁 SUCCESS Nov 11 23:23:36 UTC 2025 finished #1278 (comment)
neoverse_v1 aarch64/neoverse_v1, nvidia/cc70 eessi.io-2025.06-software 😁 SUCCESS Nov 11 23:22:25 UTC 2025 finished #1278 (comment)
nvidia-grace aarch64/nvidia/grace, nvidia/cc70 eessi.io-2025.06-software 😁 SUCCESS Nov 11 23:25:39 UTC 2025 finished #1278 (comment)
nvidia-grace aarch64/nvidia/grace, nvidia/cc80 eessi.io-2025.06-software 😁 SUCCESS Nov 12 08:42:57 UTC 2025 finished #1278 (comment)
nvidia-grace, nvidia/cc90 aarch64/nvidia/grace, nvidia/cc90 eessi.io-2025.06-software 😁 SUCCESS Nov 04 17:01:52 UTC 2025 finished #1278 (comment)
amd-zen2 x86_64/amd/zen2, nvidia/cc70 eessi.io-2025.06-software 😁 SUCCESS Nov 11 23:22:27 UTC 2025 finished #1278 (comment)
amd-zen2 x86_64/amd/zen2, nvidia/cc80 eessi.io-2025.06-software 😁 SUCCESS Nov 12 08:53:54 UTC 2025 finished #1278 (comment)
amd-zen3 x86_64/amd/zen3, nvidia/cc70 eessi.io-2025.06-software 😁 SUCCESS Nov 11 23:18:29 UTC 2025 finished #1278 (comment)
amd-zen3, nvidia/cc80 x86_64/amd/zen3, nvidia/cc80 eessi.io-2025.06-software 😁 SUCCESS Nov 06 15:28:59 UTC 2025 finished #1278 (comment)
amd-zen4 x86_64/amd/zen4, nvidia/cc70 eessi.io-2025.06-software 😁 SUCCESS Nov 11 23:12:37 UTC 2025 finished #1278 (comment)
amd-zen4 x86_64/amd/zen4, nvidia/cc80 eessi.io-2025.06-software 😁 SUCCESS Nov 12 08:45:54 UTC 2025 finished #1278 (comment)
amd-zen4, nvidia/cc90 x86_64/amd/zen4, nvidia/cc90 eessi.io-2025.06-software 😁 SUCCESS Nov 04 17:05:24 UTC 2025 finished #1278 (comment)
generic x86_64/generic, nvidia/cc70 eessi.io-2025.06-software 😁 SUCCESS Nov 11 23:25:51 UTC 2025 finished #1278 (comment)
generic x86_64/generic, nvidia/cc80 eessi.io-2025.06-software 😁 SUCCESS Nov 12 08:49:52 UTC 2025 finished #1278 (comment)
intel-cascadelake, nvidia/cc70 x86_64/intel/cascadelake, nvidia/cc70 eessi.io-2025.06-software 😁 SUCCESS Nov 06 15:31:03 UTC 2025 finished #1278 (comment)
intel-cascadelake x86_64/intel/cascadelake, nvidia/cc80 eessi.io-2025.06-software 😁 SUCCESS Nov 12 08:47:50 UTC 2025 finished #1278 (comment)
intel-haswell x86_64/intel/haswell, nvidia/cc70 eessi.io-2025.06-software 😁 SUCCESS Nov 11 23:23:39 UTC 2025 finished #1278 (comment)
intel-haswell x86_64/intel/haswell, nvidia/cc80 eessi.io-2025.06-software 😁 SUCCESS Nov 12 08:55:39 UTC 2025 finished #1278 (comment)
intel-icelake x86_64/intel/icelake, nvidia/cc70 eessi.io-2025.06-software 😁 SUCCESS Nov 11 23:21:09 UTC 2025 finished #1278 (comment)
intel-icelake, nvidia/cc80 x86_64/intel/icelake, nvidia/cc80 eessi.io-2025.06-software 😁 SUCCESS Nov 11 21:35:13 UTC 2025 finished #1278 (comment)
intel-sapphirerapids x86_64/intel/sapphirerapids, nvidia/cc70 eessi.io-2025.06-software 😁 SUCCESS Nov 11 23:29:59 UTC 2025 finished #1278 (comment)
intel-sapphirerapids x86_64/intel/sapphirerapids, nvidia/cc80 eessi.io-2025.06-software 😁 SUCCESS Nov 12 09:22:32 UTC 2025 finished #1278 (comment)
intel-skylake_avx512 x86_64/intel/skylake_avx512, nvidia/cc70 eessi.io-2025.06-software 😁 SUCCESS Nov 11 23:23:41 UTC 2025 finished #1278 (comment)
intel-skylake_avx512 x86_64/intel/skylake_avx512, nvidia/cc80 eessi.io-2025.06-software 😁 SUCCESS Nov 12 08:49:56 UTC 2025 finished #1278 (comment)

@eessi-bot-jsc
Copy link

eessi-bot-jsc bot commented Dec 2, 2025

This is the status of all the bot: build commands:

on for repo result date status url
generic aarch64/generic eessi.io-2025.06-software 😢 FAILURE Oct 30 22:12:07 UTC 2025 finished #1278 (comment)
generic aarch64/generic eessi.io-2023.06-software 😢 FAILURE Oct 30 21:41:12 UTC 2025 finished #1278 (comment)
generic aarch64/generic, nvidia/cc70 eessi.io-2025.06-software 😁 SUCCESS Nov 11 23:24:48 UTC 2025 finished #1278 (comment)
generic aarch64/generic, nvidia/cc80 eessi.io-2025.06-software 😁 SUCCESS Nov 12 08:45:50 UTC 2025 finished #1278 (comment)
neoverse_n1 aarch64/neoverse_n1, nvidia/cc70 eessi.io-2025.06-software 😁 SUCCESS Nov 11 23:23:36 UTC 2025 finished #1278 (comment)
neoverse_v1 aarch64/neoverse_v1, nvidia/cc70 eessi.io-2025.06-software 😁 SUCCESS Nov 11 23:22:25 UTC 2025 finished #1278 (comment)
nvidia-grace aarch64/nvidia/grace, nvidia/cc70 eessi.io-2025.06-software 😁 SUCCESS Nov 11 23:25:39 UTC 2025 finished #1278 (comment)
nvidia-grace aarch64/nvidia/grace, nvidia/cc80 eessi.io-2025.06-software 😁 SUCCESS Nov 12 08:42:57 UTC 2025 finished #1278 (comment)
nvidia-grace, nvidia/cc90 aarch64/nvidia/grace, nvidia/cc90 eessi.io-2025.06-software 😁 SUCCESS Nov 04 17:01:52 UTC 2025 finished #1278 (comment)
amd-zen2 x86_64/amd/zen2, nvidia/cc70 eessi.io-2025.06-software 😁 SUCCESS Nov 11 23:22:27 UTC 2025 finished #1278 (comment)
amd-zen2 x86_64/amd/zen2, nvidia/cc80 eessi.io-2025.06-software 😁 SUCCESS Nov 12 08:53:54 UTC 2025 finished #1278 (comment)
amd-zen3 x86_64/amd/zen3, nvidia/cc70 eessi.io-2025.06-software 😁 SUCCESS Nov 11 23:18:29 UTC 2025 finished #1278 (comment)
amd-zen3, nvidia/cc80 x86_64/amd/zen3, nvidia/cc80 eessi.io-2025.06-software 😁 SUCCESS Nov 06 15:28:59 UTC 2025 finished #1278 (comment)
amd-zen4 x86_64/amd/zen4, nvidia/cc70 eessi.io-2025.06-software 😁 SUCCESS Nov 11 23:12:37 UTC 2025 finished #1278 (comment)
amd-zen4 x86_64/amd/zen4, nvidia/cc80 eessi.io-2025.06-software 😁 SUCCESS Nov 12 08:45:54 UTC 2025 finished #1278 (comment)
amd-zen4, nvidia/cc90 x86_64/amd/zen4, nvidia/cc90 eessi.io-2025.06-software 😁 SUCCESS Nov 04 17:05:24 UTC 2025 finished #1278 (comment)
generic x86_64/generic, nvidia/cc70 eessi.io-2025.06-software 😁 SUCCESS Nov 11 23:25:51 UTC 2025 finished #1278 (comment)
generic x86_64/generic, nvidia/cc80 eessi.io-2025.06-software 😁 SUCCESS Nov 12 08:49:52 UTC 2025 finished #1278 (comment)
intel-cascadelake, nvidia/cc70 x86_64/intel/cascadelake, nvidia/cc70 eessi.io-2025.06-software 😁 SUCCESS Nov 06 15:31:03 UTC 2025 finished #1278 (comment)
intel-cascadelake x86_64/intel/cascadelake, nvidia/cc80 eessi.io-2025.06-software 😁 SUCCESS Nov 12 08:47:50 UTC 2025 finished #1278 (comment)
intel-haswell x86_64/intel/haswell, nvidia/cc70 eessi.io-2025.06-software 😁 SUCCESS Nov 11 23:23:39 UTC 2025 finished #1278 (comment)
intel-haswell x86_64/intel/haswell, nvidia/cc80 eessi.io-2025.06-software 😁 SUCCESS Nov 12 08:55:39 UTC 2025 finished #1278 (comment)
intel-icelake x86_64/intel/icelake, nvidia/cc70 eessi.io-2025.06-software 😁 SUCCESS Nov 11 23:21:09 UTC 2025 finished #1278 (comment)
intel-icelake, nvidia/cc80 x86_64/intel/icelake, nvidia/cc80 eessi.io-2025.06-software 😁 SUCCESS Nov 11 21:35:13 UTC 2025 finished #1278 (comment)
intel-sapphirerapids x86_64/intel/sapphirerapids, nvidia/cc70 eessi.io-2025.06-software 😁 SUCCESS Nov 11 23:29:59 UTC 2025 finished #1278 (comment)
intel-sapphirerapids x86_64/intel/sapphirerapids, nvidia/cc80 eessi.io-2025.06-software 😁 SUCCESS Nov 12 09:22:32 UTC 2025 finished #1278 (comment)
intel-skylake_avx512 x86_64/intel/skylake_avx512, nvidia/cc70 eessi.io-2025.06-software 😁 SUCCESS Nov 11 23:23:41 UTC 2025 finished #1278 (comment)
intel-skylake_avx512 x86_64/intel/skylake_avx512, nvidia/cc80 eessi.io-2025.06-software 😁 SUCCESS Nov 12 08:49:56 UTC 2025 finished #1278 (comment)

@eessi-bot-deucalion
Copy link

This is the status of all the bot: build commands:

on for repo result date status url
generic aarch64/generic eessi.io-2025.06-software 😢 FAILURE Oct 30 22:12:07 UTC 2025 finished #1278 (comment)
generic aarch64/generic eessi.io-2023.06-software 😢 FAILURE Oct 30 21:41:12 UTC 2025 finished #1278 (comment)
generic aarch64/generic, nvidia/cc70 eessi.io-2025.06-software 😁 SUCCESS Nov 11 23:24:48 UTC 2025 finished #1278 (comment)
generic aarch64/generic, nvidia/cc80 eessi.io-2025.06-software 😁 SUCCESS Nov 12 08:45:50 UTC 2025 finished #1278 (comment)
neoverse_n1 aarch64/neoverse_n1, nvidia/cc70 eessi.io-2025.06-software 😁 SUCCESS Nov 11 23:23:36 UTC 2025 finished #1278 (comment)
neoverse_v1 aarch64/neoverse_v1, nvidia/cc70 eessi.io-2025.06-software 😁 SUCCESS Nov 11 23:22:25 UTC 2025 finished #1278 (comment)
nvidia-grace aarch64/nvidia/grace, nvidia/cc70 eessi.io-2025.06-software 😁 SUCCESS Nov 11 23:25:39 UTC 2025 finished #1278 (comment)
nvidia-grace aarch64/nvidia/grace, nvidia/cc80 eessi.io-2025.06-software 😁 SUCCESS Nov 12 08:42:57 UTC 2025 finished #1278 (comment)
nvidia-grace, nvidia/cc90 aarch64/nvidia/grace, nvidia/cc90 eessi.io-2025.06-software 😁 SUCCESS Nov 04 17:01:52 UTC 2025 finished #1278 (comment)
amd-zen2 x86_64/amd/zen2, nvidia/cc70 eessi.io-2025.06-software 😁 SUCCESS Nov 11 23:22:27 UTC 2025 finished #1278 (comment)
amd-zen2 x86_64/amd/zen2, nvidia/cc80 eessi.io-2025.06-software 😁 SUCCESS Nov 12 08:53:54 UTC 2025 finished #1278 (comment)
amd-zen3 x86_64/amd/zen3, nvidia/cc70 eessi.io-2025.06-software 😁 SUCCESS Nov 11 23:18:29 UTC 2025 finished #1278 (comment)
amd-zen3, nvidia/cc80 x86_64/amd/zen3, nvidia/cc80 eessi.io-2025.06-software 😁 SUCCESS Nov 06 15:28:59 UTC 2025 finished #1278 (comment)
amd-zen4 x86_64/amd/zen4, nvidia/cc70 eessi.io-2025.06-software 😁 SUCCESS Nov 11 23:12:37 UTC 2025 finished #1278 (comment)
amd-zen4 x86_64/amd/zen4, nvidia/cc80 eessi.io-2025.06-software 😁 SUCCESS Nov 12 08:45:54 UTC 2025 finished #1278 (comment)
amd-zen4, nvidia/cc90 x86_64/amd/zen4, nvidia/cc90 eessi.io-2025.06-software 😁 SUCCESS Nov 04 17:05:24 UTC 2025 finished #1278 (comment)
generic x86_64/generic, nvidia/cc70 eessi.io-2025.06-software 😁 SUCCESS Nov 11 23:25:51 UTC 2025 finished #1278 (comment)
generic x86_64/generic, nvidia/cc80 eessi.io-2025.06-software 😁 SUCCESS Nov 12 08:49:52 UTC 2025 finished #1278 (comment)
intel-cascadelake, nvidia/cc70 x86_64/intel/cascadelake, nvidia/cc70 eessi.io-2025.06-software 😁 SUCCESS Nov 06 15:31:03 UTC 2025 finished #1278 (comment)
intel-cascadelake x86_64/intel/cascadelake, nvidia/cc80 eessi.io-2025.06-software 😁 SUCCESS Nov 12 08:47:50 UTC 2025 finished #1278 (comment)
intel-haswell x86_64/intel/haswell, nvidia/cc70 eessi.io-2025.06-software 😁 SUCCESS Nov 11 23:23:39 UTC 2025 finished #1278 (comment)
intel-haswell x86_64/intel/haswell, nvidia/cc80 eessi.io-2025.06-software 😁 SUCCESS Nov 12 08:55:39 UTC 2025 finished #1278 (comment)
intel-icelake x86_64/intel/icelake, nvidia/cc70 eessi.io-2025.06-software 😁 SUCCESS Nov 11 23:21:09 UTC 2025 finished #1278 (comment)
intel-icelake, nvidia/cc80 x86_64/intel/icelake, nvidia/cc80 eessi.io-2025.06-software 😁 SUCCESS Nov 11 21:35:13 UTC 2025 finished #1278 (comment)
intel-sapphirerapids x86_64/intel/sapphirerapids, nvidia/cc70 eessi.io-2025.06-software 😁 SUCCESS Nov 11 23:29:59 UTC 2025 finished #1278 (comment)
intel-sapphirerapids x86_64/intel/sapphirerapids, nvidia/cc80 eessi.io-2025.06-software 😁 SUCCESS Nov 12 09:22:32 UTC 2025 finished #1278 (comment)
intel-skylake_avx512 x86_64/intel/skylake_avx512, nvidia/cc70 eessi.io-2025.06-software 😁 SUCCESS Nov 11 23:23:41 UTC 2025 finished #1278 (comment)
intel-skylake_avx512 x86_64/intel/skylake_avx512, nvidia/cc80 eessi.io-2025.06-software 😁 SUCCESS Nov 12 08:49:56 UTC 2025 finished #1278 (comment)

@casparvl
Copy link
Collaborator Author

casparvl commented Dec 2, 2025

Interesting, I guess we're exceeding some length with the bot:status last_build command. Either in parsing the input data or in generating the output table. E.g. #1278 (comment) this build for neoverse + cc90 was definitely successful, but not reported.

@casparvl
Copy link
Collaborator Author

casparvl commented Dec 2, 2025

bot: build repo:eessi.io-2025.06-software instance:eessi-bot-mc-aws on:arch=zen2 for:arch=x86_64/amd/zen2,accel=nvidia/cc100

@eessi-bot-aws
Copy link

eessi-bot-aws bot commented Dec 2, 2025

New job on instance eessi-bot-mc-aws for repository eessi.io-2025.06-software
Building on: amd-zen2
Building for: x86_64/amd/zen2 and accelerator nvidia/cc100
Job dir: /project/def-users/SHARED/jobs/2025.12/pr_1278/109615

date job status comment
Dec 02 10:34:52 UTC 2025 submitted job id 109615 awaits release by job manager
Dec 02 10:35:11 UTC 2025 released job awaits launch by Slurm scheduler
Dec 02 10:41:16 UTC 2025 running job 109615 is running
Dec 02 10:42:17 UTC 2025 finished
😢 FAILURE (click triangle for details)
Details
✅ job output file slurm-109615.out
✅ no message matching FATAL:
❌ found message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
❌ no message matching No missing installations
✅ found message matching .tar.* created!
Artefacts
eessi-2025.06-software-linux-x86_64-amd-zen2-accel-nvidia-cc100-17646720890.tar.zstsize: 0 MiB (22 bytes)
entries: 0
modules under 2025.06/software/linux/x86_64/amd/zen2/accel/nvidia/cc100/modules/all
no module files in tarball
software under 2025.06/software/linux/x86_64/amd/zen2/accel/nvidia/cc100/software
no software packages in tarball
reprod directories under 2025.06/software/linux/x86_64/amd/zen2/accel/nvidia/cc100/reprod
no reprod directories in tarball
other under 2025.06/software/linux/x86_64/amd/zen2/accel/nvidia/cc100
no other files in tarball
Dec 02 10:42:17 UTC 2025 test result
😢 FAILURE (click triangle for details)
Reason
EESSI test suite was not run, test step itself failed to execute.
Details
✅ job output file slurm-109615.out
❌ found message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case

@casparvl
Copy link
Collaborator Author

casparvl commented Dec 2, 2025

Ooof, seems that EasyBuild does a hard check on the format of the cuda-compute-capabilities:

>> Active EasyBuild configuration when checking for missing installations:
ERROR: Failed to parse configuration options: "Found problems validating the options: Incorrect values in --cuda-compute-capabilities (expected pattern: '^[0-9]+\\
.[0-9]+a?$'): 10.0f"

@casparvl
Copy link
Collaborator Author

casparvl commented Dec 2, 2025

Hm, this surprised me, because I tried --cuda-compute-capabilities=9.0a manually to install cuDNN 9.10.1.4 (which only supports 9.0a over 9.0) and that worked. But now I see why: it expects an optional 'a' suffix, but not an 'f' suffix.

This will require a framework change I think :(

@boegel
Copy link
Contributor

boegel commented Dec 2, 2025

Hm, this surprised me, because I tried --cuda-compute-capabilities=9.0a manually to install cuDNN 9.10.1.4 (which only supports 9.0a over 9.0) and that worked. But now I see why: it expects an optional 'a' suffix, but not an 'f' suffix.

This will require a framework change I think :(

Please open a PR for this ASAP.

An EasyBuild release is way overdue, so I'll be closing the window of opportunity to get changes in that can be included in EasyBuild v5.2.0 very soon...

Seems like accepting 10.0f should be a trivial change, just making a regex less strict?

@bedroge
Copy link
Collaborator

bedroge commented Dec 9, 2025

Hm, this surprised me, because I tried --cuda-compute-capabilities=9.0a manually to install cuDNN 9.10.1.4 (which only supports 9.0a over 9.0) and that worked. But now I see why: it expects an optional 'a' suffix, but not an 'f' suffix.
This will require a framework change I think :(

Please open a PR for this ASAP.

An EasyBuild release is way overdue, so I'll be closing the window of opportunity to get changes in that can be included in EasyBuild v5.2.0 very soon...

Seems like accepting 10.0f should be a trivial change, just making a regex less strict?

@boegel Done in easybuilders/easybuild-framework#5067.

@casparvl
Copy link
Collaborator Author

And now we wait for EB 5.2.0 :)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

2025.06-software.eessi.io 2025.06 version of software.eessi.io accel:nvidia

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants