The processing of CellProfiler output is written inefficiently. This process uses find to identify files to move from all the group subdirectories for a project, it can take 40 min on larger on a 16k image dataset.
Quick suggestion in code/noncode, lol:
- cat */{model_name}.csv > unpublished file
- sed “remove header” from upublished file > publish final .csv