Skip to content

Using Julia on the OSPool

Overview

This guide provides an introduction to running Julia code on the Open Science Pool. The Quickstart Instructions provide an outline of job submission. The following sections provide more details about installing Julia packages (Install Julia Packages) and creating a complete job submission (Submit Julia Jobs). This guide assumes that you have a script written in Julia and can identify the additional Julia packages needed to run the script.

If you are using many Julia packages or have other software dependencies as part of your job, you may want to manage your software via a container instead of using the tar.gz file method described in this guide. The Research Computing Facilitation (RCF) team maintains a Julia container that can be used as a starting point for creating a customized container with added packages, or you can build your container from scratch.

Option 1: Build a Container

Containers are an efficient way of ensuring a consistent software environment across different machines.

Identify Components

  • A list of your Julia packages or the Project.toml and Manifest.toml
  • A list of other dependencies, such as.
  • System packages/libraries
  • Git repositories

Build Julia in Apptainer

To build an Apptainer image, follow our general guide and use one of the definition files listed below as a starting point. You can edit the definition file to create your custom software environment.

Example definition file: installing from a list of packages

Bootstrap: docker
From: julia:1.10

%post
    # Install any needed system packages first
    apt-get update -y
    apt-get install -y wget \
            git \
            cmake \
            g++

    # Install Julia packages
    export JULIA_DEPOT_PATH="/opt/julia"
    julia -e 'using Pkg; Pkg.add(["DifferentialEquations", "DataFrames", "StaticArrays"]); Pkg.instantiate(); Pkg.precompile()'

%environment
    export JULIA_DEPOT_PATH=":/opt/julia"

Example definition file: installing from a Project.toml and Manifest.toml

Bootstrap: docker
From: julia:1.10

%files
   Project.toml /opt/julia/environments/v1.10/
   Manifest.toml /opt/julia/environments/v1.10/

%post
    # Install any needed system packages first
    apt-get update -y
    apt-get install -y wget \
            git \
            cmake \
            g++

    # Install Julia packages
    export JULIA_DEPOT_PATH="/opt/julia"
    julia -e 'using Pkg; Pkg.instantiate(); Pkg.precompile()'

%environment
    export JULIA_DEPOT_PATH=":/opt/julia"

Build Julia in Docker

The OSPool uses Apptainer/Singularity to execute containers. It is recommended that if you are building your own custom container, you use the Apptainer/Singularity image defintion format, as Docker images will need to be converted to Apptainer/Singularity images. However, you can still build a Julia environment within a Docker image for use on the OSPool, following this guide. Edit the Dockerfile to create your custom software environment.

Example Dockerfile: installing a list of packages

FROM julia:1.10

RUN apt-get update -y
RUN apt-get install -y wget git cmake g++

ENV JULIA_DEPOT_PATH=":/opt/julia"

RUN julia -e 'using Pkg; Pkg.add(["DifferentialEquations", "DataFrames", "StaticArrays"]); Pkg.instantiate(); Pkg.precompile()'

Example Dockerfile: installing from a Project.toml or Manifest.toml

FROM julia:1.10

RUN apt-get update -y
RUN apt-get install -y wget git cmake g++

ENV JULIA_DEPOT_PATH=":/opt/julia"

ADD Project.toml Manifest.toml /opt/julia/environments/v1.10/

RUN julia -e 'using Pkg; Pkg.instantiate(); Pkg.precompile()'

Submit Jobs with Julia Container Images

To use a Julia container in a job, you'll need to move the container to your OSPool data directory, and specify the container image in your job's submit file, like below:

+SingularityImage = "osdf:///ospool/[AP]/data/username/julia.sif"

Read more about submitting container jobs.

HTCondor will launch your job inside the software environment you built. You can run Julia code normally without any special considerations.

Example executable script:

#!/bin/bash

# Execute Julia code
julia -e 'using DataFrames; x = DataFrame(a=1:4, b=["M", "F", "F", "M"]); show(x)'

Option 2: Build a Portable Project

You can opt for a one-time installation of packages within a Julia project. A copy of the project can then be saved for use in subsequent job submissions. For more details, please see Julia's documentation at Julia Pkg.jl.

Download Julia and set up a "project"

If you have not already downloaded a copy of Julia, download the precompiled Julia software from https://julialang.org/downloads/. You will need the 64-bit, tarball compiled for general use on a Linux x86 system. The file name will resemble something like julia-#.#.#-linux-x86_64.tar.gz.

We will need a copy of the original tar.gz file for running jobs, but to install packages, we also need an unpacked version of the software. Run the following commands to extract the Julia software and add Julia to your PATH:

$ tar -xzf julia-#.#.#-linux-x86_64.tar.gz
$ export PATH=$PWD/julia-#.#.#/bin:$PATH

After these steps, you should be able to run Julia from the command line, e.g.

$ julia --version

Now create a project directory to install your packages (we've called it my-project/ below) and tell Julia its name:

$ mkdir my-project
$ export JULIA_DEPOT_PATH=$PWD/my-project

If you already have a directory with Julia packages on the login node, you can add to it by skipping the mkdir step above and going straight to setting the JULIA_DEPOT_PATH variable.

You can choose whatever name to use for this directory -- if you have different projects that you use for different jobs, you could use a more descriptive name than "my-project".

Install Packages

We will now use Julia to install any needed packages to the project directory we created in the previous step.

Open Julia with the --project option set to the project directory:

$ julia --project=my-project

Once you've started up the Julia REPL (interpreter), start the Pkg REPL, used to install packages, by typing ]. Then install and test packages by using Julia's add Package syntax.

               _
   _       _ _(_)_     |  Documentation: https://docs.julialang.org
  (_)     | (_) (_)    |
   _ _   _| |_  __ _   |  Type "?" for help, "]?" for Pkg help.
  | | | | | | |/ _` |  |
  | | |_| | | | (_| |  |  Version 1.0.5 (2019-09-09)
 _/ |\__'_|_|_|\__'_|  |  Official https://julialang.org/ release
|__/                   |

julia> ]
(my-project) pkg> add Package
(my-project) pkg> test Package

If you have multiple packages to install they can be combined into a single command, e.g. (my-project) pkg> add Package1 Package2 Package3.

If you encounter issues getting packages to install successfully, please contact us at [email protected]

Once you are done, you can exit the Pkg REPL by typing the DELETE key and then typing exit()

(my-project) pkg> 
julia> exit()

Your packages will have been installed to the my_project directory; we want to compress this folder so that it is easier to copy to jobs.

$ tar -czf my-project.tar.gz my-project/

Submit Julia Jobs

To submit a job that runs a Julia script, create a bash script and HTCondor submit file following the examples in this section. These example assume that you have downloaded a copy of Julia for Linux as a tar.gz file and if using packages, you have gone through the steps above to install them and create an additional tar.gz file of the installed packages.

Create Executable Bash Script

Your job will use a bash script as the HTCondor executable. This script will contain all the steps needed to unpack the Julia binaries and execute your Julia script (script.jl below). What follows are two example bash scripts, one which can be used to execute a script with base Julia only, and one that will use packages you installed to a project directory (see Install Julia Packages).

Example Bash Script For Base Julia Only

If your Julia script can run without additional packages (other than base Julia and the Julia Standard library) use the example script directly below.

#!/bin/bash

# julia-job.sh

# extract Julia tar.gz file
tar -xzf julia-#.#.#-linux-x86_64.tar.gz

# add Julia binary to PATH
export PATH=$_CONDOR_SCRATCH_DIR/julia-#.#.#/bin:$PATH

# run Julia script
julia script.jl

Example Bash Script For Julia With Installed Packages

#!/bin/bash

# julia-job.sh

# extract Julia tar.gz file and project tar.gz file
tar -xzf julia-#.#.#-linux-x86_64.tar.gz
tar -xzf my-project.tar.gz

# add Julia binary to PATH
export PATH=$_CONDOR_SCRATCH_DIR/julia-#.#.#/bin:$PATH
# add Julia packages to DEPOT variable
export JULIA_DEPOT_PATH=$_CONDOR_SCRATCH_DIR/my-project

# run Julia script
julia --project=my-project script.jl

Create HTCondor Submit File

After creating a bash script named julia-job.sh to run Julia, then create a submit file to submit the job.

More details about setting up a submit file, including a submit file template, can be found in our quickstart guide: Quickstart Tutorial

# File Name = julia-job.sub

executable = julia-job.sh

transfer_input_files = julia-#.#.#-linux-x86_64.tar.gz, script.jl
should_transfer_files   = Yes
when_to_transfer_output = ON_EXIT

output        = job.$(Cluster).$(Process).out
error         = job.$(Cluster).$(Process).error
log           = job.$(Cluster).$(Process).log

+JobDurationCategory = "Medium"

requirements   = (OSGVO_OS_STRING == "RHEL 9")
request_cpus   = 1
request_memory = 2GB
request_disk   = 2GB

queue 1

If your Julia script needs to use packages installed for a project, be sure to include my-project.tar.gz as an input file in julia-job.sub. For project tarballs that are <1 GB, you can follow the below example:

transfer_input_files = julia-#.#.#-linux-x86_64.tar.gz, script.jl, my-project.tar.gz

Modify the CPU/memory request lines to match what is needed by the job. Test a few jobs for disk space/memory usage in order to make sure your requests for a large batch are accurate! Disk space and memory usage can be found in the log file after the job completes.