Tag Other File by Sequence

TagOtherFileBySequence #

Marks reads based on wether sequences are present in another file.

[[step]]
    action = "TagOtherFileBySequence"
    out_label = "present_in_other_file"
    filename = "names.fastq" # Can read fastq (also compressed), or SAM/BAM, or fasta files
    segment = "read1" # Any of your input segments
    false_positive_rate = 0.01 # false positive rate (0..1)
    seed = 42 # seed for randomness
    include_mapped = true # in case of BAM/SAM, whether to include aligned reads
    include_unmapped = true # in case of BAM/SAM, whether to include unaligned reads

This step annotates reads by comparing their sequences against sequences from another file.

Please note our remarks about cuckoo filters.