ExtractLongestPolyX #
Find the longest homopolymer stretch anywhere in the read (unlike ExtractPolyTail, which only considers suffixes).
[[step]]
action = "ExtractLongestPolyX"
out_label = "my_tag"
segment = "read1"
min_length = 5
base = '.' # search for any homopolymer (A/C/G/T/N)
max_mismatch_rate = 0.15
max_consecutive_mismatches = 2
baseaccepts a concrete nucleotide (A,C,G,T,N) or.to search all ofACGTand report the longest hit.max_mismatch_rateandmax_consecutive_mismatchesmirrorExtractPolyTail; they control how permissive the run detection is.- When no run satisfies
min_length, the tag is reported as missing. - only one run is reported, even if multiple runs of the same length exist; in this case, the first run found is reported.