Skip to content

Additional annotation check whether the sequence ID exists in a given genome

Workum, Dirk-Jan van requested to merge fix_incorrect_seqid_annotation into master

We discovered that if a gene annotation file contains a sequence ID (first column gff file) that does not exist for the given genome, PanTools will give a NullPointerException with a stack trace instead of a more meaningful error.

Here I suggest to add features with an impossible sequence ID to the list of invalid features. This list of invalid features will contain all correctly parsed features that do not correspond with the given genome, so I believe it makes sense to add them to this list.

Merge request reports