Debugging the TensorFlow / Cuda error on AWS — ImportError: cannot open shared object file

Cause of error

Check our versions

pip list | grep tensorflow-gpu
ls -l /usr/local/cuda

Investigate issue

Option 1. Upgrading/Downgrading system Cuda

# Look at the current cuda version 
ls -l /usr/local/cuda
# Look at the required cuda version
ls /usr/local/cuda-9.0
# Remove softlink to current cuda version
sudo rm /usr/local/cuda
# Add softlink to new version
ln -s /usr/local/cuda-9.0 /usr/local/cuda

Option 2. Upgrading/Downgrading tensorflow-gpu

pip uninstall tensorflow-gpu 
pip install tensorflow-gpu==1.4.0
pip list | grep tensorflow

Thats all

