Skip to content

Output VCF is not valid due to symbolic allele used as reference allele #121

@vlshesketh

Description

@vlshesketh

The VCFs generated by QDNAseq do not conform to the specification for the reference allele field, as they contain symbolic alleles e.g. <DIP>. This results in errors when trying to either validate the VCF using VCF validator:

Error: Reference is not a string of bases. This occurs 3 time(s), first time in line 14.

Or when loading the VCF into IGV:

The provided VCF file is malformed at approximately line number 14: Symbolic alleles not allowed as reference allele

The specification requires that the reference should a string of one or more bases (A,C,G,T or N, from https://samtools.github.io/hts-specs/VCFv4.4.pdf). Page 31 of the guidelines gives some examples of how to represent copy number events, or alternatively other structural variant/CNV callers use N as the reference base.

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions