Getting started on NeSI¶
Install RJM¶
On Windows you could download the executables from the latest release on the GitHub repository.
Otherwise, install the rjm python package, for example using a virtual environment and installing the main branch from the git repo:
python -m venv venv
source venv/bin/activate
python -m pip install git+https://github.com/chrisdjscott/RemoteJobManager
Prerequisites for running the setup script¶
In order to proceed with the setup you will need the following:
NeSI credentials (username, first and second factor passwords)
NeSI project code that you belong to (e.g. uoa00106)
Globus account (https://app.globus.org/)
You can usually sign up with your institutional credentials (e.g. University of Auckland) for convenience but it is not essential
Note: during the setup you may be asked to authenticate multiple times and in some cases a browser window may be opened automatically for you to authenticate in and in other cases you may need to copy a link to a browser manually and authenticate there. Please follow the instructions that show up when running the setup script.
Linking Globus and NeSI accounts¶
You may be asked for a linked identity with the NeSI Wellington OIDC Server. If you already have a linked identity it should appear in the list like <username>@wlg-dtn-oidc.nesi.org.nz (where <username> is your NeSI username). Otherwise, you can follow the instructions to Link an identity from NeSI Wellington OIDC Server.
Run the NeSI setup script (once per machine)¶
Running the NeSI setup script with no arguments (rjm_config
) will
do the following:
configure and start a "funcX endpoint" on NeSI, which RJM will use to execute commands on NeSI
setup a periodic task (via scrontab) to check that the funcX endpoint is running and restart it if it has stopped
create a "Globus Guest Collection" in your NeSI nobackup directory, which will be used to transfer files to and from NeSI
write configutation values from the above steps into the RJM config file on your local machine
obtain the required authentication tokens so that you can start using RJM (tokens are cached on the local machine so you should not need to reauthenticate on the same machine)
When you run the setup script you will need to input values and authenticate multiple times, which can't be avoided at this time unfortunately. In some cases a browser window will open where you will need to do the authentication, in other cases you will enter details directly at the terminal. You will need to authenticate with NeSI (first and sector factor) and with Globus, which can usually be done using your institutional credentials, e.g. University of Auckland.
Note: you only need to run this script once per machine.
Run a simple test¶
Health check script¶
The health check script tests that basic functionality is working by uploading a file to NeSI and executing a simple command.
rjm_health_check
If should exit successfully with no errors if things are working.
Example simulation¶
Clone the repository and run the example simulation using the steps below:
git clone https://github.com/chrisdjscott/RemoteJobManager.git
cd RemoteJobManager/examples/nonmem
rjm_batch_submit -f localdirs.txt -ll info
rjm_batch_wait -f localdirs.txt -ll info
Note: for the above to work you need to either make sure the RJM executables are in your PATH or provide the full path to them.
If the above worked there should be a file created at files/output.zip.
You should also notice the file files/remote_job.json. This file records the
progress of the remote job. If you try to run a simulation again in the same
directory, RJM should detect that it already ran before and won't repeat the
same steps again. You can override this behaviour by passing the --force
option to rjm_batch_submit
or delete the remote_job.json file.