Batch Experiments for mlr3 • mlr3batchmark

A connector between mlr3 and batchtools. This allows to run large-scale benchmark experiments on scheduled high-performance computing clusters.

The package comes with two core functions for switching between mlr3 and batchtools to perform a benchmark:

After creating a design object (as required for mlr3’s benchmark() function), instead of benchmark() call batchmark() which populates an ExperimentRegistry for the computational jobs of the benchmark. You are now in the world of batchtools where you can selectively submit jobs with different resources, monitor the progress or resubmit as needed.
After the computations are finished, collect the results with reduceResultsBatchmark() to return to mlr3. The resulting object is a regular BenchmarkResult.

Example

library("mlr3")
library("batchtools")
library("mlr3batchmark")
tasks = tsks(c("iris", "sonar"))
learners = lrns(c("classif.featureless", "classif.rpart"))
resamplings = rsmp("cv", folds = 3)

design = benchmark_grid(
  tasks = tasks,
  learners = learners,
  resamplings = resamplings
)

reg = makeExperimentRegistry(NA)

## No readable configuration file found

## Created registry in '/tmp/RtmpS8hWnq/registry73a7b42e20d' using cluster functions 'Interactive'

ids = batchmark(design, reg = reg)

## Adding algorithm 'run_learner'

## Adding problem 'abc694dd29a7a8ce'

## Exporting new objects: '9e46aff6e4cf00b1' ...

## Exporting new objects: 'c555f9dfec9c1e4f' ...

## Exporting new objects: '02253ecc9afd614a' ...

## Exporting new objects: 'ecf8ee265ec56766' ...

## Overwriting previously exported object: 'ecf8ee265ec56766'

## Adding 6 experiments ('abc694dd29a7a8ce'[1] x 'run_learner'[2] x repls[3]) ...

## Adding problem 'f9791e97f9813150'

## Exporting new objects: '2c33cdf2caba8316' ...

## Adding 6 experiments ('f9791e97f9813150'[1] x 'run_learner'[2] x repls[3]) ...

submitJobs()

## Submitting 12 jobs in 12 chunks using cluster functions 'Interactive' ...

getStatus()

## Status for 12 jobs at 2026-07-02 11:49:27:
##   Submitted    : 12 (100.0%)
##   -- Queued    :  0 (  0.0%)
##   -- Started   : 12 (100.0%)
##   ---- Running :  0 (  0.0%)
##   ---- Done    : 12 (100.0%)
##   ---- Error   :  0 (  0.0%)
##   ---- Expired :  0 (  0.0%)

reduceResultsBatchmark()

## 
## ── <BenchmarkResult> of 12 rows with 4 resampling run ──────────────────────────
##  nr task_id          learner_id resampling_id iters warnings errors
##   1    iris classif.featureless            cv     3        0      0
##   2    iris       classif.rpart            cv     3        0      0
##   3   sonar classif.featureless            cv     3        0      0
##   4   sonar       classif.rpart            cv     3        0      0

Resources

The Large-Scale Benchmarking chapter of the mlr3 book

mlr3batchmark

Example

Resources

Links

License

Citation

Developers