Introduction to containers

Overview

Teaching: 10 min
Exercises: 5 min
Questions
Objectives
  • Define the term “container”

  • Discuss when you would benefit from using containers in your workflow

Apps vs Containers

To use software on a computer traditionally you were required to install the software. This means placing files on the filesystem, libraries in folders, configuration files in the correct place etc. Applications that have a setup program help by placing the files in the right place. This can lead to problems with conflicting files. What if two packages need to install the Widget library but App A needs Widget version 1 and App B needs Widget version 2? On a HPC, managing the installed software for hundreds or thousands of users becomes a real challenge.

When people use packages like Python or R, they typically need a great deal of libraries to also be installed so their scripts can run.

A container is an entity providing an isolated software environment (or filesystem) for an application and its dependencies.

Apps vs Containers

The key difference here is a container will have all the libraries and dependent apps needed to run the package. It does this by bundling the App, libraries and operation system into a single image.

The benefits are:

Containers and your workflow

There are a number of reasons for using containers in your daily work:

A few examples of how containers are being used include:

Terminology

An image is a file (or set of files) that contains the application and all its dependencies, libraries, run-time systems, etc. required to run. You can copy images around, upload them, download them etc.

A container is an instantiation of an image. That is, the image in a running state. You can run multiple containers from the same image, much like you might run the same application with different options or arguments.

In abstract, an image corresponds to a file, a container corresponds to a process.

A registry is a place where images are stored and can be accessed by users. It can be public (e.g. Docker Hub) or private.

To build an image we need a recipe. A recipe file is called a Definition File, or def file, in the Singularity jargon and a Dockerfile in the Docker world.

Container engines

A number of tools are available to create, deploy and run containerised applications. We will only be using Singularity in this lesson:

Key Points

  • Containers enable you to package up an application and its dependencies.

  • By using containers, you can better enforce reproducibility, portability and share-ability of your computational workflows.