1. as the size of vcf increases with more strains, may need to create a version of vcf without annotation. 2. bcsq needs to be done after each time subsetting strains 3. channel definition could be simplified in workflow