See the discussion in https://github.com/TabArena/tabarena_benchmarking_examples/issues/6 Add documentation or work towards this goal.