Troubleshooting Guide
This guide is created to aid users in resolving issues with their GPU-equipped Nosana Node configuration on Linux operating systems.
Error Messages
Nvidia
nvidia-smi: command not found
Solution
It means that you do not have NVIDIA drivers installed. To install them, download and install the correct drivers from the NVIDIA website: https://www.nvidia.com/download/index.aspx
Error: setting up CDI devices: unresolvable CDI devices nvidia.com/gpu=all
Solution
It means that you did not install and configure the Nvidia Container Toolkit correctly:
Docker
The command 'docker' could not be found.
Solution
Ensure that you have Docker installed and that it is running. Follow the Docker installation guide for Linux to install Docker on your system.
Podman
Error: Could not connect to Podman
Solution
When you see this error, check your Docker daemon configuration. Ensure Docker is properly installed and running on your Linux system.
Error: container create failed (no logs from conmon): conmon bytes "": readObjectStart: expect { or n, but found , error found in #0 byte of ...||..., bigger context ...||...
Solution
This error is caused by the latest version of conmon having known issues, downgrade conmon to resolve this, like this:
wget https://download.opensuse.org/repositories/devel:/kubic:/libcontainers:/stable/xUbuntu_22.04/amd64/conmon_2.1.2~0_amd64.deb -O /tmp/conmon_2.1.2.deb
sudo apt install /tmp/conmon_2.1.2.debThen you can rerun the podman command
podman run --rm --device nvidia.com/gpu=all --security-opt=label=disable ubuntu nvidia-smi -L