July 28, 2015

HPC Made Easy

Updated on 01.03.2018

The Problem

Let's consider the problem of performing a lot of numerical experiments using some third-party solver to generate the data to build an approximation model. Each solver’s run may be time-consuming and performing multiple calculations on a single computer can take plenty of time.

The calculations can be speeded up by using computer clusters. pSeven provides integration with the following types of cluster resource manager software:

  • TORQUE
  • SLURM
  • LSF / OpenLava

Integration with computer clusters is performed by Program block which also supports Job Array mechanism simplifying management of collections of similar tasks.

The Workflow

To solve the problem we use 2.04 General - Approximation model training from pSeven General Examples. The workflow generates a data sample which is used both as the solver input sample and the input part of the model training sample.

The workflow:

The Solver is a Composite block containing the workflow to process a single vector of input data.

The Solver block expanded:

The option “Parallel execution” of Solver block is enabled and this allows the block to automatically process the input sample matrix as a list of input vectors. The option also defines how many instances of the internal workflow will be run at the same moment.

The Launcher block command:

How to Speed Up the Workflow

If the solver requires many CPU/memory resources then running multiple solvers on one computer will be not very effective.

Let’s modify the settings of Launcher block:

  • batch manager: “slurm”
  • ssh connection information (“Host”, “Port”, “Username”, “Password”)
  • synchronization: (“local to remote”)

Now Launcher block will run the solver on the cluster automatically uploading input files and downloading output results. Due to parallel execution of Launcher instances, the solver on a cluster will be run in parallel too.

Conclusion

Using parallel execution of composite blocks along with running the solver on computer cluster is an easy way to speed up the workflow by just a few clicks. You may increase performance even further, by making the workflow prepare all input files before submitting the job to the cluster and making Program block run a whole bunch of calculations via a mechanism called Job Array.

 

By Stanislav Markevich, Senior Software Engineer, DATADVANCE

LinkedIn
VK

Interested in the solution?

Click to request a free 30-day demo.

Request demo