Add a diff subcommand to the evobench evaluation tool that takes two result datasets and prints percentage differences. Refinement needed.