Hello,
I am getting the following error when kspaceFirstOrder3DG
is called in MATLAB on my university's computing cluster, a description of the problem is mentioned below the error message:
kspaceFirstOrder3D-CUDA v1.2
Reading simulation configuration: Done
Selected GPU device id: Failed
!!! K-Wave experienced a fatal error !!!
Error: Insufficient CUDA driver version. Install the latest
drivers.
Execution terminated
Error using h5readc
The filename specified was either not found on the MATLAB path or it contains unsupported characters.
Error in h5read (line 58)
[data,var_class] = h5readc(Filename,Dataset,start,count,stride);
Error in kspaceFirstOrder3DC (line 394)
Nx = h5read(output_filename, '/Nx');
Error in kspaceFirstOrder3DG (line 64)
sensor_data = kspaceFirstOrder3DC(varargin{:}, 'BinaryName', binary_name);
Error in kwave_3d_GPU (line 213)
sensor_data = kspaceFirstOrder3DG(kgrid, medium, transducer, sensor, input_args{:});
For context, to get around a previous error involving the CentOS using an older version of glibc, a singularity container by Nvidia is being used to emulate Ubuntu 18.04 on CentOS. The container can see the video card, MATLAB can see the video card and drivers are at the latest version available from Nvidia. The cluster computing support team suggests the possibilities causing the error among: cuda being too new, being run inside a container, other libraries too old/new.
Please note the version of k-Wave being used is 1.2.1.
Following are the details pertaining to the GPU:
+-----------------------------------------------------------------------------+
| NVIDIA-SMI 440.33.01 Driver Version: 440.33.01 CUDA Version: 10.2 |
|-------------------------------+----------------------+----------------------+
| GPU Name Persistence-M| Bus-Id Disp.A | Volatile Uncorr. ECC |
| Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. |
|===============================+======================+======================|
| 0 Tesla V100-SXM2... Off | 00000000:18:00.0 Off | 0 |
| N/A 35C P0 56W / 300W | 0MiB / 16160MiB | 0% Default |
+-------------------------------+----------------------+----------------------+
+-----------------------------------------------------------------------------+
| Processes: GPU Memory |
| GPU PID Type Process name Usage |
|=============================================================================|
| No running processes found |
+-----------------------------------------------------------------------------+
>> gpuDevice
ans =
CUDADevice with properties:
Name: 'Tesla V100-SXM2-16GB'
Index: 1
ComputeCapability: '7.0'
SupportsDouble: 1
DriverVersion: 10.2000
ToolkitVersion: 9
MaxThreadsPerBlock: 1024
MaxShmemPerBlock: 49152
MaxThreadBlockSize: [1024 1024 64]
MaxGridSize: [2.1475e+09 65535 65535]
SIMDWidth: 32
TotalMemory: 1.6946e+10
AvailableMemory: 1.6458e+10
MultiprocessorCount: 80
ClockRateKHz: 1530000
ComputeMode: 'Default'
GPUOverlapsTransfers: 1
KernelExecutionTimeout: 0
CanMapHostMemory: 1
DeviceSupported: 1
DeviceSelected: 1
Thank you.