-
Notifications
You must be signed in to change notification settings - Fork 0
Open
Labels
Description
Enhancement suggestion
Currently, you would need to create a wrapper script with a high array count to easily parallelise the peak comparison tool. Peak comparing multiple regions is embarrassingly parallel but the tool doesn't lend well to this. The main problem is that each run through reads the exact same data (and data reading is the bottleneck here).
It would be good to allow a vector of start and end values instead of individual values. From here, you'd only need to read in files once (and if they are all on one chromosome, you can continue to use grep and other faster tools to reduce file size).
If you were to allow vectors of start and end values then you can create arrays of BedBase objects which should allow for parallelisation within python.
Reactions are currently unavailable