Extract Regions of Low Quality

ExtractRegionsOfLowQuality #

Extract regions (min size 1 bp) where bases have quality scores below threshold.

[[step]]
    action = "ExtractRegionsOfLowQuality"
    segment = "read1" # Any of your input segments
    min_quality = 60  # Quality threshold (Phred+33)
    label = "low_quality_regions"

This transformation scans through quality scores of the specified segment and identifies contiguous regions where quality scores are below the specified threshold. Each low-quality region becomes a tagged region with location information (start position and length).

Parameters #

  • segment: Which read to analyze for low-quality regions
  • min_quality: Quality score threshold using Phred+33 encoding. See Phred quality score for ASCII character mapping
  • label: Tag name to store the extracted regions

Example #

With min_quality = 60 (ASCII ‘<’), any bases with quality scores below ‘<’ will be identified as low-quality regions. This is useful for masking or filtering poor-quality sequences.