TagOtherFileBySequence #
Marks reads based on wether sequences are present in another file.
[[step]]
action = "TagOtherFileBySequence"
out_label = "present_in_other_file"
filename = "names.fastq" # Can read fastq (also compressed), or SAM/BAM, or fasta files
segment = "read1" # Any of your input segments
false_positive_rate = 0.01 # false positive rate (0..1)
seed = 42 # seed for randomness
include_mapped = true # in case of BAM/SAM, whether to include aligned reads
include_unmapped = true # in case of BAM/SAM, whether to include unaligned reads
This step annotates reads by comparing their sequences against sequences from another file.
Please note our remarks about cuckoo filters.