Make an ephemeral hub

Make an ephemeral hub#

We can support users who want a mybinder.org type experience, but with better resources & faster startup. They get redirected to us when the public mybinder.org deployment can not support them (like this), or just because they want this experience.

The primary features offered would be:

No per-user authentication required.
A shared, systemwide password is present to protect against cryptobros abusing these resources.
No persistent storage
Pre-pulled images, for faster startup.

(1) and (3) also help reduce the amount of user data we store, reducing data privacy issues as well.

The limitations of this set up are:

No users means no admin users, so the JupyterHub configurator is unavailable. All config must be set in our config files, and deployed via GitHub.
No home page is visible, so our home page customizations do not work.
We do not cull users, because that would cause problems with counting active users. This is a trade-off, as if we end up with a huge list of users, it might slow down hub deployments.

Authentication with `tmpauthenticator`#

We will use tmpauthenticator to automatically create temporary users whenever any user comes to the hub. They will automatically get UUIDs assigned.

jupyterhub:
  hub:
    config:
      JupyterHub:
        authenticator_class: tmp
      Authenticator:
        allow_all: True

No persistent home directory#

As users are temporary and can not be accessed again, there is no reason to provide persistent storage. So we turn it all off - particularly the home directories.

# nfs functionality explicitly disabled in case a common.values.yaml
# file is used to enable it for all hubs in the cluster
nfs:
  enabled: false
  pv:
    enabled: false

jupyterhub-home-nfs:
  enabled: false

jupyterhub:
  custom:
    singleuserAdmin:
      # Turn off trying to mount shared-readwrite folder for admins
      extraVolumeMounts:
  singleuser:
    initContainers: []
    storage:
      # No persistent storage should be kept to reduce any potential data
      # retention & privacy issues.
      type: none
      extraVolumeMounts:
      extraVolumes:

(Optional) Sharing `shared` directories from another hub with an ephemeral hub#

In some specific cases, we may need to share a shared directory from another hub on the same cluster with the ephemeral hub. The ‘source’ hub whose shared directory we mount may be used to provide common data files, teaching materials, etc for the ephemeral hub’s users.

Setup the PersistentVolume in the ephemeral hub’s config to point to the same NFS share that the ‘source’ hub is pointing to, with the following config:
```
# nfs functionality enabled for this ephemeral hub to mount
# a shared folder from another hub in the cluster
nfs:
  enabled: true
  dirsizeReporter:
    enabled: false
  pv:
    enabled: true
    mountOptions: <copied-from-source-hub>
    serverIP: <copied-from-source-hub>
    baseShareName: <copied-from-source-hub>
    shareNameOverride: <name-of-source-hub>
```
A few options should copied from the config of the ‘source’ hub, and shareNameOverride should be set to whatever is the name of the ‘source’ hub in cluster.yaml.

When deployed, this should set up a new PersistentVolume for the ephemeral hub to use that references the same NFS share of the ‘source’ hub. You can validate this by comparing them:
```
# Get the source hub's NFS volume
kubectl get pv <source-hub-name>-home-nfs -o yaml
# Get the ephemeral hub's NFS volume
kubectl get pv <ephemeral-hub-name>-home-nfs -o yaml
```
The section under spec.nfs should match for both these PersistentVolume options.

Note

If you want to learn more about how this is setup, look into helm-charts/basehub/templates/nfs.yaml

Mount just the shared directory appropriately:

jupyterhub:
  singleuser:
    storage:
      # We still don't want to have per-user storage
      type: none
      extraVolumes:
        1-shared-dir-volume:
          name: shared-dir-pv
          persistentVolumeClaim:
            claimName: home-nfs
      extraVolumeMounts:
        1-shared-readonly-volumemount:
          name: shared-dir-pv
          mountPath: /home/jovyan/shared-readonly
          subPath: _shared
          readOnly: true

This will mount the shared directory from the ‘source’ hub under shared in the ephemeral hub - so admins can write stuff to the shared-readwrite directory in the ‘source’ hub and it’ll immediately show up here! It’s mounted to be read-only - since there are no real ‘users’ in an ephemeral hub, if we make it readwrite, it can be easily deleted (accidentally or intentionally) with no accountability.

Image configuration in chart#

The image needs to be specified in the chart directly and not via the JupyterHub configurator because with tmpauthenticator we can’t distinguish admin users to have such rights without providing it to every user.

jupyterhub:
  singleuser:
    # image could also be configured via singleuser.profileList configuration
    image:
      name: <image-name>
      tag: <tag>

Enable hook pre-puller & disable JupyterHub#

Startup time is very important in ephemeral hubs, so the hook pre-puller can be enabled.

jupyterhub:
  prePuller:
    hook:
      enabled: true

Disabling home page customizations#

tmpauthenticator doesn’t actually show the home page - it just launches users directly into the notebook server. This means our home page customizations are not applied anywhere. So we set them to empty strings.

jupyterhub:
  custom:
    homepage:
      # tmpauthenticator does *not* show a home page by default,
      # so these are not visible anywhere. But our schema requires we set
      # them to strings, so we specify empty strings here.
      templateVars:
        org:
          name: ""
          url: ""
          logo_url: ""
        designed_by:
          name: ""
          url: ""
        operated_by:
          name: ""
          url: ""
        funded_by:
          name: ""
          url: ""

Use `nbgitpuller` for distributing content#

We encourage users to use nbgitpuller for distributing content. This allows creation of a specific link that will put users who click it on a specific notebook with a specific UI (such as lab, classic notebook, RStudio, etc).

The nbgitpuller link generator supports mybinder.org style links, but for use with ephemeral hubs, just use the regular ‘JupyterHub’ link generator. Firefox and Google Chrome extensions are also available.