Installing JupyterHub¶
Now that we have a Kubernetes cluster and Helm setup, we can proceed by using Helm to install JupyterHub and related Kubernetes resources using a Helm chart.
Prepare configuration file¶
In this step we will prepare a YAML
configuration file that we will refer to as config.yaml
. It will contain the multiple
Helm values to be provided to a JupyterHub Helm chart developed
specifically together with this guide.
Helm charts contains templates that with provided values will render to Kubernetes resources to be installed in a Kubernetes cluster. This config file will provide the values to be used by our Helm chart.
Generate a random hex string representing 32 bytes to use as a security token. Run this command in a terminal and copy the output:
openssl rand -hex 32
Create and start editing a file called
config.yaml
. In the code snippet below we start the widely available nano editor, but any editor will do.nano config.yaml
Write the following into the
config.yaml
file but instead of writing<RANDOM-HEX>
paste the generated hex string you copied in step 1.proxy: secretToken: "<RANDOM_HEX>"
Note
It is common practice for Helm and Kubernetes YAML files to indent using two spaces.
Save the
config.yaml
file. In the nano editor this is done by pressing CTRL+X or CMD+X followed by a confirmation to save the changes.
Install JupyterHub¶
Make Helm aware of the JupyterHub Helm chart repository so you can install the JupyterHub chart from it without having to use a long URL name.
helm repo add jupyterhub https://jupyterhub.github.io/helm-chart/ helm repo update
This should show output like:
Hang tight while we grab the latest from your chart repositories... ...Skip local chart repository ...Successfully got an update from the "stable" chart repository ...Successfully got an update from the "jupyterhub" chart repository Update Complete. ⎈ Happy Helming!⎈
Now install the chart configured by your
config.yaml
by running this command from the directory that contains yourconfig.yaml
:# Suggested values: advanced users of Kubernetes and Helm should feel # free to use different values. RELEASE=jhub NAMESPACE=jhub helm upgrade --cleanup-on-fail \ --install $RELEASE jupyterhub/jupyterhub \ --namespace $NAMESPACE \ --create-namespace \ --version=0.9.0 \ --values config.yaml
where:
RELEASE
refers to a Helm release name, an identifier used to differentiate chart installations. You need it when you are changing or deleting the configuration of this chart installation. If your Kubernetes cluster will contain multiple JupyterHubs make sure to differentiate them. You can list your Helm releases withhelm list
.NAMESPACE
refers to a Kubernetes namespace, an identifier used to group Kubernetes resources, in this case all Kubernetes resources associated with the JupyterHub chart. You’ll need the namespace identifier for performing any commands withkubectl
.
Note
This step may take a moment, during which time there will be no output to your terminal. JupyterHub is being installed in the background.
If you get a
release named <YOUR-RELEASE-NAME> already exists
error, then you should delete the release by runninghelm delete <YOUR-RELEASE-NAME>
. Then reinstall by repeating this step. If it persists, also dokubectl delete namespace <YOUR-NAMESPACE>
and try again.In general, if something goes wrong with the install step, delete the Helm release by running
helm delete <YOUR-RELEASE-NAME>
before re-running the install command.If you’re pulling from a large Docker image you may get a
Error: timed out waiting for the condition
error, add a--timeout=<NUMBER-OF-MINUTES>m<NUMBER-OF-SECONDS>s
parameter to thehelm install
command.The
--version
parameter corresponds to the version of the Helm chart, not the version of JupyterHub. Each version of the JupyterHub Helm chart is paired with a specific version of JupyterHub. E.g.,0.7.0
of the Helm chart runs JupyterHub0.9.2
. For a list of which JupyterHub version is installed in each version of the Z2JH Helm Chart, see the Helm Chart repository.
While Step 2 is running, you can see the pods being created by entering in a different terminal:
kubectl get pod --namespace jhub
Note
To remain sane we recommend that you enable autocompletion for kubectl and set a default value for the
--namespace
flag:kubectl config set-context $(kubectl config current-context) --namespace ${NAMESPACE:-jhub}
Wait for the hub and proxy pod to enter the
Running
state.Find the IP we can use to access the JupyterHub. Run the following command until the
EXTERNAL-IP
of theproxy-public
service is available like in the example output.kubectl get service --namespace jhub
NAME TYPE CLUSTER-IP EXTERNAL-IP PORT(S) AGE hub ClusterIP 10.51.243.14 <none> 8081/TCP 1m proxy-api ClusterIP 10.51.247.198 <none> 8001/TCP 1m proxy-public LoadBalancer 10.51.248.230 104.196.41.97 80:31916/TCP 1m
Note
If the IP for
proxy-public
is too long to fit into the window, you can find the longer version by calling:kubectl describe service proxy-public --namespace jhub
To use JupyterHub, enter the external IP for the
proxy-public
service in to a browser. JupyterHub is running with a default dummy authenticator so entering any username and password combination will let you enter the hub.
Congratulations! Now that you have basic JupyterHub running, you can extend it and optimize it in many ways to meet your needs.
Some examples of customisations are:
Configure the login to use the account that makes sense to you (Google, GitHub, etc.).
Use a suitable pre-built image for the user container or build your own.
Host it on https://your-domain.com.