EMLI CONTAINER STACK - QSG
AS OF 9/14/21
Congratulations on purchasing a docker integrated system for Deep Learning. Below is a Quick Start Guide for Docker and Nvidia-docker.
Docker images available on this system:
- nvidia/cuda
- nvidia/caffe
- nvidia/digits
- portainer
- tensorflow/tensorflow:latest-gpu
- PyTorch
- RapidsAI
root@u105724:~# docker pull nvcr.io/nvidia/cuda:9.1-devel |
View pulled Images on system
root@u105724:~# docker images |
View all containers on the system (including running and stopped)
[root@c101086 ~]# docker ps -a |
Run command inside of the container (interactively)
# to execute a shell within the container |
For additional docker images, please go to: https://hub.docker.com/
NVIDIA Digits
DIGITS Quickstart Script (found in the root's home folder Directory and /usr/local/bin)
This is now also loaded in /usr/local/bin/startDigits so you may run #startDigits from anywhere to start a new unique container
|
Portainer
Portainer is a simple management solution for Docker. Easily manage your Docker hosts and Docker Swarm clusters via Portainer web user interface.
[root@c101086 ~]# docker images | grep portainer
|
Initial portainer container instance password configured on the system is : password@1
If the portainer container was removed, then the end user will have to supply a new password for the new container instance.
Initial Startup / Configure for a new instance of Portainer:
Type in a password for admin
Click on Create User to continue.
Dashboard View -
provides an overview of the container(s) running on the systems, along with the related volumes and network info.
Portainer Containers view -
Overview of loaded containers status, and control / manage of the containers
Portainer Images View -
Overview of pulled images on the system, or download (pull) additional images available at the DockerHub Registry
Rapids Container and Notebook Server
NOTE: This will run JupyterLab on port 8888 on your host machine.
Command:
- docker run --runtime=nvidia --rm -it -p 8888:8888 -p 8787:8787 -p 8786:8786 nvcr.io/nvidia/rapidsai/rapidsai:cuda10.1-runtime-ubuntu18.04
- utils/start-jupyter.sh
[root@c105017 ~]# docker run --runtime=nvidia --rm -it -p 8888:8888 -p 8787:8787 -p 8786:8786 nvcr.io/nvidia/rapidsai/rapidsai:cuda10.1-runtime-ubuntu18.04 ## Starting jupyter service (rapids) root@712e75ae4a0e:/rapids/notebooks# bash utils/start-jupyter.sh jupyter-lab --allow-root --ip=0.0.0.0 --no-browser --NotebookApp.token='' [I 19:26:58.713 LabApp] Writing notebook server cookie secret to /root/.local/share/jupyter/runtime/notebook_cookie_secret [W 19:26:58.951 LabApp] All authentication is disabled. Anyone who can connect to this server will be able to run code. [I 19:26:58.964 LabApp] JupyterLab extension loaded from /conda/envs/rapids/lib/python3.6/site-packages/jupyterlab [I 19:26:58.964 LabApp] JupyterLab application directory is /conda/envs/rapids/share/jupyter/lab [W 19:26:58.966 LabApp] JupyterLab server extension not enabled, manually loading... [I 19:26:58.968 LabApp] JupyterLab extension loaded from /conda/envs/rapids/lib/python3.6/site-packages/jupyterlab [I 19:26:58.968 LabApp] JupyterLab application directory is /conda/envs/rapids/share/jupyter/lab [I 19:26:58.969 LabApp] Serving notebooks from local directory: /rapids/notebooks [I 19:26:58.969 LabApp] The Jupyter Notebook is running at: [I 19:26:58.969 LabApp] http://(712e75ae4a0e or 127.0.0.1):8888/ [I 19:26:58.969 LabApp] Use Control-C to stop this server and shut down all kernels (twice to skip confirmation). [I 19:27:29.919 LabApp] 302 GET / (172.25.10.173) 1.71ms [W 19:27:30.730 LabApp] Could not determine jupyterlab build status without nodejs [W 19:27:30.925 LabApp] 404 GET /lab/api/workspaces/lab?1549654049120 (172.25.10.173): Workspace 'lab' ('lab-a511') not found [W 19:27:30.925 LabApp] Workspace 'lab' ('lab-a511') not found [W 19:27:30.926 LabApp] 404 GET /lab/api/workspaces/lab?1549654049120 (172.25.10.173) 1.45ms referer=http://172.25.10.206:8888/lab?
Screen shot - <host IP>:8888
To exit, select Shutdown from the File Menu:
Tensorflow:
NOTE: This will start Tensorflow container and switch to interactive console:
Command:
docker run --runtime=nvidia -it nvcr.io/nvidia/tensorflow:19.12-tf2-py3 bash
Please read the README.MD inside of the container for detail, or visit www.tensorflow.org for more information
Note:
Docker version earlier then 19.03 with nvidia-docker2 installed will need to use --runtime=nvidia flag for the NVIDIA GPU support in the container.
Docker version 19.03 and later with nvidia-container-toolkit installed will need to use ---gpus all flag for the NVIDIA GPU support in the container.
For Additional Technical Support, Please contact us at: www.exxactcorp.com/support