Releases: assemblerflow/flowcraft
Releases · assemblerflow/flowcraft
Spring cleaning
New features
- Added support for the report system to:
maxbin2
- Added new
manifest.configwith the pipeline metadata
New components
Kraken2: Taxonomic identification on FastQ files
Bug fixes
- Fix bug in
mompscomponent related to added in the introduction of the clear input parameter - Fixed bug with the
-ftparameters not retrieving the dockerhub tags for
all the components. - Fixed bug in the
megahitprocess where the fastg mode would break the process - Fix inspect and report mode to fetch the nextflow file independently of its
position in thenextflow runcommand inside the .nextflow.log file. - Fix parsing of .nextflow.log file when searching for
nextflow runcommand. - Fixed bug between mash_sketch_fasta and mash_dist.
Minor/Other changes
- Added option to
dengue_typingto retrieve closest reference sequence and link it
with a secondary channel intomafft - New version of DEN-IM recipe
- Now prints an ordered list of components
- Moved taxonomy results from
results/annotation/to `results/taxonomy/
Now prints an ordered list of components
Moved taxonomy results from results/annotation/ to results/taxonomy/
Grandma's recipes
New features
- Added new
recipesystem to flowcraft along with 6 starting recipes.
Recipes are pre-made and curated pipelines that address specific questions.
To create a recipe, the-r <recipe_name>can be used. To list available
recipes, the--recipe-listand--recipe-list-shortoptions were added. - Added
-ftor--fetch-tagswhich allows to retrieve all DockerHub
container tags. - Added function to collect all the components from the components classes,
replacing the current process_map dictionary implementation. Now, it will be
generated from the engine rather than hardcoded into the dict.
Components changes
- Added new
disableRRparam in thespadescomponent that disables repeat
resolution - The
abyssandspadescomponents emit GFA in a secondary channel. - The new
bandagecomponent can accept either FASTA from a primary channel
or GFA from a secondary channel. - Updated skesa to version 2.3.0.
- Updated mash based components for the latest version - 1.6.0-1.
New components
- Added component
abyss. - Added component
bandage. - Added component
unicycler. - Added component
prokka. - Added component
bcalm. - Added component
diamond.
Minor/Other changes
- Added removal of duplicate IDs from
reads_downloadcomponent input. - Added seed parameter to
downsample_fastqcomponent. - Added bacmet database to
abricatecomponent. - Added default docker option to avoid docker permission errors.
- Changed the default URL generated by inspect and report commands.
- Added directives to
-Lparameter of build module.
Bug fixes
- Fixed forks with same source process name.
- Fixed
inspectissue when tasks took more than a day in duration. - Added hardware address to
inpsectandreporthash.
Clear!
Features
- Added a new
clearInputparameter to components that change their input.
The aim of this option is to allow the controlled removal of temporary files,
which is particularly useful in very large workflows.
Components changes
- Updated images for components
mash_dist,mash_screenand
mapping_patlas.
New components
- Added component
fast_ani.
Minor/Other changes
- Added
--export-directivesoption tobuildmode to export component's
directives in JSON format to standard output. - Added more date information in
inspectmode, including the year and the
locale of the executing system.
Report!!
Introducing the new Report system that is hosted by the Flowcraft web application. With the new report mode, it is now possible to visualize the reports of a flowcraft pipeline in the browser, either at the end of the pipeline run or in real-time while the pipeline is in execution.
Features
- Added
reportrun mode to Flowcraft that displays the report of any given
pipeline in the Flowcraft's web application. Thereportmode can be executed
after a pipeline ended or during the pipeline execution using the--watch
option. - Added standalone report HTML at the end of the pipeline execution.
- Components with support for the new report system:
abricateassembly_mappingcheck_coveragechewbbacadengue_typingfastqcfastqc_trimmomaticintegrity_coveragemlstpatho_typingpilonprocess_mappingprocess_newickprocess_skesaprocess_spadesprocess_viral_assemblyseq_typingtrimmomatictrue_coverage
Minor/Other changes
- Refactored report json for components
mash_dist,mash_screenand
mapping_patlas
Bug fixes
- Fixed issue where
seq_typingandpatho_typingprocesses were not feeding
report data to report compiler. - Fixed fail messages for
process_assemblyandprocess_viral_assembly
components
Indriya
Components changes
mapping_patlas: refactored to remove temporary files used to create
sam and bam files and added data to .report.json. Updated databases to pATLAS
version 1.5.2.mash_screenandmash_dist: added data to .report.json. Updated databases
to pATLAS version 1.5.2.
New components
- Added component
fasterq_dump - Added component
mash_sketch_fasta - Added component
mash_sketch_fastq - Added component
downsample_fastqfor FastQ read sub sampling using seqtk - Added component
mompsfor typing of Legionella pneumophila - Added component
split_assembly - Added component
mafft - Added component
raxml - Added component
viral_assembly - Added component
progressive_mauve - Added component
dengue_typing
Minor/Other changes
- Added check for
params.accessionsthat enables to report a proper
error when it is set tonull. - Added
buildoption to export component parameters information in JSON format. - Fixed minor issue preventing the
maxbin2andsplit_assemblycomponents
from being used multiples times in a pipeline - Added a catch to the
filter_polyprocess for cases where the input file is empty. - spades template now reports the exit code of spades' execution
Bug fixes
- Removed the need for the nf process templates to have an empty line
at the beginning of the template files. - Fixed issue when the
inspectmode was executed on a pipeline directory
with failed processes but with the work directory removed (the log files
where no longer available). - Fixed issue when the
inspectmode was executed on a pipeline without the
memory directory defined. - Fixed issue in the
inspectmode, where there is a rare race condition between
tags in the log and trace files. - Fixed bug on
midas_speciesprocess where the output file was not being
linked correctly, causing the process to fail - Fixed bug on
bowtiewhere the reference parameter was missing the pid - Fixed bug on
filter_polywhere the tag was missing
Param Param!
Improvements
- The parameter system has been revamped, and parameters are now component-specific
and independent by default. This allows a better fine-tuning of the parameters
and also the execution of the same component multiple times (for instance in a fork)
with different parameters. The old parameter system that merged identical parameters
is still available by using the--merge-paramsflag when building the pipeline. - Added a global
--clearAtCheckpointparameter that, when set to true, will remove
temporary files that are no longer necessary for downstream steps of the pipeline
from the work directory. This option is currently supported for thetrimmomatic,
fastqc_trimmomatic,skesaandspadescomponents.
New components
maxbin2: An automatic tool for binning metagenomic sequences.bowtie2: Align short paired-end sequencing reads to long reference
sequences.retrieve_mapped: Retrieves the mapped reads of a previous bowtie2 mapping process.
New recipes
plasmids: A recipe to perform mapping, mash screen on reads
and also mash dist for assembly based approaches (all to detect
plasmids). This also includes annotation with abricate for the assembly.plasmids_mapping: A recipe to perform mapping for plasmids.plasmids_mash: A recipe to perform mash screen for plasmids.plasmids_assembly: A recipe to perform mash dist for plasmid
assemblies.
Minor/Other changes
- Added "smart" check when the user provides a typo in pipeline string
for a given process, outputting some "educated" guesses to the
terminal.http://192.92.149.169:80/inspect/5b585070ed1cb0dbb9a696c724b7f9ed7844e1afe5188ef3aa44c7c3cc5b539d - Added "-cr" option to show current recipe
pipeline_string. - Changed the way recipes were being parsed by
proc_collectorfor the
usage of-land-Loptions. - Added check for non-ascii characters in colored_print.
- Fixed log when a file with the pipeline is provided to -t option
instead of a string.
Bug fixes
- Fixed pipeline names that contain new line characters.
- Fixed pipeline generation when automatic dependencies were added right after a fork
- Template: sistr.nf: Fixed comparison that determined process status.
- Fixed issue with
--versionoption.
Inspector gadget
New components
card_rgi: Anti-microbial gene screening for assembliesfilter_poly: Runs PrinSeq on paired-end FastQ files to remove low complexity sequenceskraken: Taxonomical identification of FastQ filesmegahit: Metagenomic assembler for paired-end FastQ filesmetaprob: Preforms read binning on metagenomic FastQ filesmetamlst: Checks the Sequence Type of metagenomic FastQ reads using Multilocus Sequence Typingmetaspades: Metagenomic assembler for paired-end FastQ filesmidas_species: Taxonomical identification of FastQ files on the species levelremove host: Read mapping with Bowtie2 against the host genome (default hg19) and removes the mapping readssistr: Salmonella in silico typing component for assemblies.
Features
- Added
inspectrun mode to flowcraft for displaying the progress overview
during a nextflow run. This run mode has anoverviewandbroadcastoptions
for viewing the progress of a pipeline.
Minor/Other changes
- Changed
mapping_patlasdocker container tag and variable
(PR #76). - The
envscope of nextflow.config now extends thePYTHONPATH
environmental variable. - Updated indexes for both
mapping_patlasandmashbased processes. - New logo!
Bug Fixes
- Template: fastqc_report.py: Added fix to trim range evaluation.
- Script: merge_json.py: Fixed chewbbaca JSON merge function.
Press Start!
First official release of assemblerflow.