Hello,
I am running hundreds of simultaneous k-Wave simulations on a linux cluster. Each instance runs Matlab on a computing node and executes the simulation using compiled k-Wave code. After each simulation is run, it will run another one. So far, it's working fine. However, on occasion some of the instances are giving me trouble. It will crash when writing out the xxx_input.hdf5.
It's not a huge deal, but any idea what might be going wrong? Perhaps something is not installed correctly on some of the nodes? I am running my script that runs the simulation using try/catch, so when I get the problem it moves on to the next simulation. When I get this error, it happens for all subsequent ones, which makes me wonder if it's something going on with the specific node.
Here is what I see:
Running k-Wave simulation...
start time: 10-Dec-2021 17:31:03
reference sound speed: 2000m/s
dt: 48.7013ns, t_end: 133.7825us, time steps: 2748
input grid size: 44 by 44 by 492 grid points (14.2857 by 14.2857 by 159.7403mm)
maximum supported frequency: 2.31MHz
expanding computational grid...
computational grid size: 64 by 64 by 512 grid points
precomputation completed in 0.83197s
saving input files to disk...
ME =
MException with properties:
identifier: 'MATLAB:imagesci:hdf5lib:libraryError'
message: 'The HDF5 library encountered an error and produced the following stack trace information: ...'
cause: {0×1 cell}
stack: [8×1 struct]