<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Region Tag Generation on mbf-fastq-processor documentation</title><link>https://tyberiusprime.github.io/fastqrab/v0.8.0/docs/reference/tag-steps/extract/</link><description>Recent content in Region Tag Generation on mbf-fastq-processor documentation</description><generator>Hugo</generator><language>en-us</language><atom:link href="https://tyberiusprime.github.io/fastqrab/v0.8.0/docs/reference/tag-steps/extract/index.xml" rel="self" type="application/rss+xml"/><item><title/><link>https://tyberiusprime.github.io/fastqrab/v0.8.0/docs/reference/tag-steps/extract/extractanchor/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://tyberiusprime.github.io/fastqrab/v0.8.0/docs/reference/tag-steps/extract/extractanchor/</guid><description>&lt;h1 id="extractanchor">
 ExtractAnchor
 &lt;a class="anchor" href="#extractanchor">#&lt;/a>
&lt;/h1>
&lt;p>Extract regions relative to a previously tagged anchor position.&lt;/p>
&lt;div class="highlight">&lt;pre tabindex="0" style="color:#f8f8f2;background-color:#272822;-moz-tab-size:4;-o-tab-size:4;tab-size:4;">&lt;code class="language-toml" data-lang="toml">&lt;span style="display:flex;">&lt;span>&lt;span style="color:#75715e"># First create an anchor tag. Iupac, regex, ExtractRegion, your choice.&lt;/span>
&lt;/span>&lt;/span>&lt;span style="display:flex;">&lt;span>[[&lt;span style="color:#a6e22e">step&lt;/span>]]
&lt;/span>&lt;/span>&lt;span style="display:flex;">&lt;span> &lt;span style="color:#a6e22e">action&lt;/span> = &lt;span style="color:#e6db74">&amp;#34;ExtractIUPAC&amp;#34;&lt;/span>
&lt;/span>&lt;/span>&lt;span style="display:flex;">&lt;span> &lt;span style="color:#a6e22e">search&lt;/span> = &lt;span style="color:#e6db74">&amp;#34;CAYA&amp;#34;&lt;/span>
&lt;/span>&lt;/span>&lt;span style="display:flex;">&lt;span> &lt;span style="color:#a6e22e">out_label&lt;/span> = &lt;span style="color:#e6db74">&amp;#34;anchor_tag&amp;#34;&lt;/span>
&lt;/span>&lt;/span>&lt;span style="display:flex;">&lt;span> &lt;span style="color:#a6e22e">segment&lt;/span> = &lt;span style="color:#e6db74">&amp;#34;read1&amp;#34;&lt;/span>
&lt;/span>&lt;/span>&lt;span style="display:flex;">&lt;span> &lt;span style="color:#a6e22e">anchor&lt;/span> = &lt;span style="color:#e6db74">&amp;#34;Anywhere&amp;#34;&lt;/span>
&lt;/span>&lt;/span>&lt;span style="display:flex;">&lt;span> &lt;span style="color:#a6e22e">max_mismatches&lt;/span> = &lt;span style="color:#ae81ff">0&lt;/span>
&lt;/span>&lt;/span>&lt;span style="display:flex;">&lt;span>
&lt;/span>&lt;/span>&lt;span style="display:flex;">&lt;span>&lt;span style="color:#75715e"># Then extract relative to that anchor&lt;/span>
&lt;/span>&lt;/span>&lt;span style="display:flex;">&lt;span>[[&lt;span style="color:#a6e22e">step&lt;/span>]]
&lt;/span>&lt;/span>&lt;span style="display:flex;">&lt;span> &lt;span style="color:#a6e22e">action&lt;/span> = &lt;span style="color:#e6db74">&amp;#34;ExtractAnchor&amp;#34;&lt;/span>
&lt;/span>&lt;/span>&lt;span style="display:flex;">&lt;span> &lt;span style="color:#a6e22e">out_label&lt;/span> = &lt;span style="color:#e6db74">&amp;#34;mytag&amp;#34;&lt;/span>
&lt;/span>&lt;/span>&lt;span style="display:flex;">&lt;span> &lt;span style="color:#a6e22e">in_label&lt;/span> = &lt;span style="color:#e6db74">&amp;#34;anchor_tag&amp;#34;&lt;/span> &lt;span style="color:#75715e"># tag that provides the anchor position&lt;/span>
&lt;/span>&lt;/span>&lt;span style="display:flex;">&lt;span> &lt;span style="color:#a6e22e">regions&lt;/span> = [[&lt;span style="color:#ae81ff">-2&lt;/span>, &lt;span style="color:#ae81ff">4&lt;/span>], [&lt;span style="color:#ae81ff">4&lt;/span>, &lt;span style="color:#ae81ff">1&lt;/span>]] &lt;span style="color:#75715e"># [start, length] pairs relative to anchor&lt;/span>
&lt;/span>&lt;/span>&lt;span style="display:flex;">&lt;span> &lt;span style="color:#a6e22e">region_separator&lt;/span> = &lt;span style="color:#e6db74">&amp;#34;_&amp;#34;&lt;/span> &lt;span style="color:#75715e"># (optional) separator between regions&lt;/span>
&lt;/span>&lt;/span>&lt;/code>&lt;/pre>&lt;/div>&lt;p>This transformation uses the leftmost position of a previously established tag as the anchor point and extracts specified regions relative to that position.&lt;/p></description></item><item><title/><link>https://tyberiusprime.github.io/fastqrab/v0.8.0/docs/reference/tag-steps/extract/extractregex/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://tyberiusprime.github.io/fastqrab/v0.8.0/docs/reference/tag-steps/extract/extractregex/</guid><description>&lt;h1 id="extractregex">
 ExtractRegex
 &lt;a class="anchor" href="#extractregex">#&lt;/a>
&lt;/h1>
&lt;p>Extract a regexp result. Stores an empty string if not found.&lt;/p>
&lt;div class="highlight">&lt;pre tabindex="0" style="color:#f8f8f2;background-color:#272822;-moz-tab-size:4;-o-tab-size:4;tab-size:4;">&lt;code class="language-toml" data-lang="toml">&lt;span style="display:flex;">&lt;span>[[&lt;span style="color:#a6e22e">step&lt;/span>]]
&lt;/span>&lt;/span>&lt;span style="display:flex;">&lt;span> &lt;span style="color:#a6e22e">action&lt;/span> = &lt;span style="color:#e6db74">&amp;#34;ExtractRegex&amp;#34;&lt;/span>
&lt;/span>&lt;/span>&lt;span style="display:flex;">&lt;span> &lt;span style="color:#a6e22e">out_label&lt;/span> = &lt;span style="color:#e6db74">&amp;#34;mytag&amp;#34;&lt;/span>
&lt;/span>&lt;/span>&lt;span style="display:flex;">&lt;span> &lt;span style="color:#a6e22e">search&lt;/span> = &lt;span style="color:#e6db74">&amp;#34;^CT(..)CT&amp;#34;&lt;/span>
&lt;/span>&lt;/span>&lt;span style="display:flex;">&lt;span> &lt;span style="color:#a6e22e">replacement&lt;/span> = &lt;span style="color:#e6db74">&amp;#34;$1&amp;#34;&lt;/span> &lt;span style="color:#75715e"># standard regex replacement syntax&lt;/span>
&lt;/span>&lt;/span>&lt;span style="display:flex;">&lt;span> &lt;span style="color:#a6e22e">source&lt;/span> = &lt;span style="color:#e6db74">&amp;#34;read1&amp;#34;&lt;/span> &lt;span style="color:#75715e"># An input segment (to read from sequence), or name:&amp;lt;segment&amp;gt; to read from a tag&lt;/span>
&lt;/span>&lt;/span>&lt;/code>&lt;/pre>&lt;/div>&lt;p>This transformation searches for a regular expression pattern in the specified read and extracts the matching portion as a tag.&lt;/p>
&lt;p>The value actually &amp;rsquo;extracted&amp;rsquo; is after replacement has been performed.&lt;/p></description></item><item><title/><link>https://tyberiusprime.github.io/fastqrab/v0.8.0/docs/reference/tag-steps/extract/extractregion/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://tyberiusprime.github.io/fastqrab/v0.8.0/docs/reference/tag-steps/extract/extractregion/</guid><description>&lt;h1 id="extractregion">
 ExtractRegion
 &lt;a class="anchor" href="#extractregion">#&lt;/a>
&lt;/h1>
&lt;p>Extract a fixed position region.&lt;/p>
&lt;div class="highlight">&lt;pre tabindex="0" style="color:#f8f8f2;background-color:#272822;-moz-tab-size:4;-o-tab-size:4;tab-size:4;">&lt;code class="language-toml" data-lang="toml">&lt;span style="display:flex;">&lt;span>[[&lt;span style="color:#a6e22e">step&lt;/span>]]
&lt;/span>&lt;/span>&lt;span style="display:flex;">&lt;span> &lt;span style="color:#a6e22e">action&lt;/span> = &lt;span style="color:#e6db74">&amp;#34;ExtractRegion&amp;#34;&lt;/span>
&lt;/span>&lt;/span>&lt;span style="display:flex;">&lt;span> &lt;span style="color:#a6e22e">start&lt;/span> = &lt;span style="color:#ae81ff">5&lt;/span>
&lt;/span>&lt;/span>&lt;span style="display:flex;">&lt;span> &lt;span style="color:#a6e22e">length&lt;/span> = &lt;span style="color:#ae81ff">8&lt;/span>
&lt;/span>&lt;/span>&lt;span style="display:flex;">&lt;span> &lt;span style="color:#a6e22e">segment&lt;/span> = &lt;span style="color:#e6db74">&amp;#34;read1&amp;#34;&lt;/span> &lt;span style="color:#75715e"># Any of your input segments&lt;/span>
&lt;/span>&lt;/span>&lt;span style="display:flex;">&lt;span> &lt;span style="color:#a6e22e">out_label&lt;/span> = &lt;span style="color:#e6db74">&amp;#34;umi&amp;#34;&lt;/span>
&lt;/span>&lt;/span>&lt;/code>&lt;/pre>&lt;/div>&lt;p>This transformation extracts a fixed-length region from the specified read at a given position and stores it as a tag.&lt;/p>
&lt;p>Use &lt;a href="https://tyberiusprime.github.io/fastqrab/v0.8.0/fastqrab/v0.8.0/docs/reference/tag-steps/extract/extractregions/">ExtractRegions&lt;/a> if your region is actually multiple regions (possibly from different segments).&lt;/p>
&lt;p>If the read is shorter than requested, the region will be shorter (and might be of varying length).&lt;/p></description></item><item><title/><link>https://tyberiusprime.github.io/fastqrab/v0.8.0/docs/reference/tag-steps/extract/extractregions/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://tyberiusprime.github.io/fastqrab/v0.8.0/docs/reference/tag-steps/extract/extractregions/</guid><description>&lt;h1 id="extractregions">
 ExtractRegions
 &lt;a class="anchor" href="#extractregions">#&lt;/a>
&lt;/h1>
&lt;p>Extract from multiple fixed position regions.&lt;/p>
&lt;div class="highlight">&lt;pre tabindex="0" style="color:#f8f8f2;background-color:#272822;-moz-tab-size:4;-o-tab-size:4;tab-size:4;">&lt;code class="language-toml" data-lang="toml">&lt;span style="display:flex;">&lt;span>[[&lt;span style="color:#a6e22e">step&lt;/span>]]
&lt;/span>&lt;/span>&lt;span style="display:flex;">&lt;span> &lt;span style="color:#a6e22e">action&lt;/span> = &lt;span style="color:#e6db74">&amp;#34;ExtractRegions&amp;#34;&lt;/span>
&lt;/span>&lt;/span>&lt;span style="display:flex;">&lt;span> &lt;span style="color:#a6e22e">regions&lt;/span> = [
&lt;/span>&lt;/span>&lt;span style="display:flex;">&lt;span> {&lt;span style="color:#a6e22e">segment&lt;/span> = &lt;span style="color:#e6db74">&amp;#34;read1&amp;#34;&lt;/span>, &lt;span style="color:#a6e22e">start&lt;/span> = &lt;span style="color:#ae81ff">0&lt;/span>, &lt;span style="color:#a6e22e">length&lt;/span> = &lt;span style="color:#ae81ff">8&lt;/span>},
&lt;/span>&lt;/span>&lt;span style="display:flex;">&lt;span> {&lt;span style="color:#a6e22e">segment&lt;/span> = &lt;span style="color:#e6db74">&amp;#34;read1&amp;#34;&lt;/span>, &lt;span style="color:#a6e22e">start&lt;/span> = &lt;span style="color:#ae81ff">12&lt;/span>, &lt;span style="color:#a6e22e">length&lt;/span> = &lt;span style="color:#ae81ff">4&lt;/span>},
&lt;/span>&lt;/span>&lt;span style="display:flex;">&lt;span> ]
&lt;/span>&lt;/span>&lt;span style="display:flex;">&lt;span> &lt;span style="color:#a6e22e">out_label&lt;/span> = &lt;span style="color:#e6db74">&amp;#34;barcode&amp;#34;&lt;/span>
&lt;/span>&lt;/span>&lt;/code>&lt;/pre>&lt;/div>&lt;p>This transformation extracts multiple fixed-length regions from reads and concatenates them into a single tag.&lt;/p>
&lt;p>ExtractRegions with only one region are exactly equivalent to &lt;a href="https://tyberiusprime.github.io/fastqrab/v0.8.0/fastqrab/v0.8.0/docs/reference/tag-steps/extract/extractregion/">ExtractRegion&lt;/a>.&lt;/p>
&lt;p>If the read is shorter than requested, the region will be shorter (and might be of varying length).&lt;/p></description></item><item><title/><link>https://tyberiusprime.github.io/fastqrab/v0.8.0/docs/reference/tag-steps/extract/extractregionsoflowquality/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://tyberiusprime.github.io/fastqrab/v0.8.0/docs/reference/tag-steps/extract/extractregionsoflowquality/</guid><description>&lt;h1 id="extractregionsoflowquality">
 ExtractRegionsOfLowQuality
 &lt;a class="anchor" href="#extractregionsoflowquality">#&lt;/a>
&lt;/h1>
&lt;p>Extract regions (min size 1 bp) where bases have quality scores below threshold.&lt;/p>
&lt;div class="highlight">&lt;pre tabindex="0" style="color:#f8f8f2;background-color:#272822;-moz-tab-size:4;-o-tab-size:4;tab-size:4;">&lt;code class="language-toml" data-lang="toml">&lt;span style="display:flex;">&lt;span>[[&lt;span style="color:#a6e22e">step&lt;/span>]]
&lt;/span>&lt;/span>&lt;span style="display:flex;">&lt;span> &lt;span style="color:#a6e22e">action&lt;/span> = &lt;span style="color:#e6db74">&amp;#34;ExtractRegionsOfLowQuality&amp;#34;&lt;/span>
&lt;/span>&lt;/span>&lt;span style="display:flex;">&lt;span> &lt;span style="color:#a6e22e">segment&lt;/span> = &lt;span style="color:#e6db74">&amp;#34;read1&amp;#34;&lt;/span> &lt;span style="color:#75715e"># Any of your input segments&lt;/span>
&lt;/span>&lt;/span>&lt;span style="display:flex;">&lt;span> &lt;span style="color:#a6e22e">min_quality&lt;/span> = &lt;span style="color:#ae81ff">60&lt;/span> &lt;span style="color:#75715e"># Quality threshold (Phred+33)&lt;/span>
&lt;/span>&lt;/span>&lt;span style="display:flex;">&lt;span> &lt;span style="color:#a6e22e">out_label&lt;/span> = &lt;span style="color:#e6db74">&amp;#34;low_quality_regions&amp;#34;&lt;/span>
&lt;/span>&lt;/span>&lt;/code>&lt;/pre>&lt;/div>&lt;p>This transformation scans through quality scores of the specified segment and identifies contiguous regions where quality scores are below the specified threshold. Each low-quality region becomes a tagged region with location information (start position and length).&lt;/p>
&lt;h2 id="parameters">
 Parameters
 &lt;a class="anchor" href="#parameters">#&lt;/a>
&lt;/h2>
&lt;ul>
&lt;li>&lt;code>segment&lt;/code>: Which read to analyze for low-quality regions&lt;/li>
&lt;li>&lt;code>min_quality&lt;/code>: Quality score threshold using Phred+33 encoding. See &lt;a href="https://en.wikipedia.org/wiki/Phred_quality_score#Symbols">Phred quality score&lt;/a> for ASCII character mapping&lt;/li>
&lt;li>&lt;code>out_label&lt;/code>: Tag name to store the extracted regions&lt;/li>
&lt;/ul>
&lt;h2 id="example">
 Example
 &lt;a class="anchor" href="#example">#&lt;/a>
&lt;/h2>
&lt;p>With &lt;code>min_quality = 60&lt;/code> (ASCII &amp;lsquo;&amp;lt;&amp;rsquo;), any bases with quality scores below &amp;lsquo;&amp;lt;&amp;rsquo; will be identified as low-quality regions. This is useful for masking or filtering poor-quality sequences.&lt;/p></description></item><item><title>Extract IUPAC</title><link>https://tyberiusprime.github.io/fastqrab/v0.8.0/docs/reference/tag-steps/extract/extractiupac/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://tyberiusprime.github.io/fastqrab/v0.8.0/docs/reference/tag-steps/extract/extractiupac/</guid><description>&lt;h1 id="extractiupac">
 ExtractIUPAC
 &lt;a class="anchor" href="#extractiupac">#&lt;/a>
&lt;/h1>
&lt;div class="highlight">&lt;pre tabindex="0" style="color:#f8f8f2;background-color:#272822;-moz-tab-size:4;-o-tab-size:4;tab-size:4;">&lt;code class="language-toml" data-lang="toml">&lt;span style="display:flex;">&lt;span>[[&lt;span style="color:#a6e22e">step&lt;/span>]]
&lt;/span>&lt;/span>&lt;span style="display:flex;">&lt;span> &lt;span style="color:#a6e22e">action&lt;/span> = &lt;span style="color:#e6db74">&amp;#34;ExtractIUPAC&amp;#34;&lt;/span>
&lt;/span>&lt;/span>&lt;span style="display:flex;">&lt;span> &lt;span style="color:#a6e22e">out_label&lt;/span> = &lt;span style="color:#e6db74">&amp;#34;mytag&amp;#34;&lt;/span>
&lt;/span>&lt;/span>&lt;span style="display:flex;">&lt;span> &lt;span style="color:#a6e22e">anchor&lt;/span> = &lt;span style="color:#e6db74">&amp;#39;Left&amp;#39;&lt;/span> &lt;span style="color:#75715e"># Left | Right | Anywhere&lt;/span>
&lt;/span>&lt;/span>&lt;span style="display:flex;">&lt;span> &lt;span style="color:#a6e22e">search&lt;/span> = &lt;span style="color:#e6db74">&amp;#34;CTN&amp;#34;&lt;/span> &lt;span style="color:#75715e"># what we are searching&lt;/span>
&lt;/span>&lt;/span>&lt;span style="display:flex;">&lt;span> &lt;span style="color:#a6e22e">segment&lt;/span> = &lt;span style="color:#e6db74">&amp;#39;read1&amp;#39;&lt;/span> &lt;span style="color:#75715e"># Any of your input segments&lt;/span>
&lt;/span>&lt;/span>&lt;/code>&lt;/pre>&lt;/div>&lt;p>Search and extract a sequence from the read, defined by a &lt;a href="https://doi.org/10.1093%2Fnar%2F13.9.3021">IUPAC string&lt;/a>.&lt;/p>
&lt;p>If anchor = &amp;lsquo;Anywhere&amp;rsquo;, ExtractIUPAC will find the left most occurance.&lt;/p></description></item><item><title>Extract IUPAC suffix</title><link>https://tyberiusprime.github.io/fastqrab/v0.8.0/docs/reference/tag-steps/extract/extractiupacsuffix/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://tyberiusprime.github.io/fastqrab/v0.8.0/docs/reference/tag-steps/extract/extractiupacsuffix/</guid><description>&lt;h1 id="extractiupacsuffix">
 ExtractIUPACSuffix
 &lt;a class="anchor" href="#extractiupacsuffix">#&lt;/a>
&lt;/h1>
&lt;div class="highlight">&lt;pre tabindex="0" style="color:#f8f8f2;background-color:#272822;-moz-tab-size:4;-o-tab-size:4;tab-size:4;">&lt;code class="language-toml" data-lang="toml">&lt;span style="display:flex;">&lt;span>[[&lt;span style="color:#a6e22e">step&lt;/span>]]
&lt;/span>&lt;/span>&lt;span style="display:flex;">&lt;span> &lt;span style="color:#a6e22e">action&lt;/span> = &lt;span style="color:#e6db74">&amp;#34;ExtractIUPACSuffix&amp;#34;&lt;/span>
&lt;/span>&lt;/span>&lt;span style="display:flex;">&lt;span> &lt;span style="color:#a6e22e">out_label&lt;/span> = &lt;span style="color:#e6db74">&amp;#34;mytag&amp;#34;&lt;/span>
&lt;/span>&lt;/span>&lt;span style="display:flex;">&lt;span> &lt;span style="color:#a6e22e">query&lt;/span> = &lt;span style="color:#e6db74">&amp;#34;AGTCA&amp;#34;&lt;/span> &lt;span style="color:#75715e"># the adapter to trim. Straigth bases only, no IUPAC.&lt;/span>
&lt;/span>&lt;/span>&lt;span style="display:flex;">&lt;span> &lt;span style="color:#a6e22e">segment&lt;/span> = &lt;span style="color:#e6db74">&amp;#34;read1&amp;#34;&lt;/span> &lt;span style="color:#75715e"># Any of your input segments (default: read1)&lt;/span>
&lt;/span>&lt;/span>&lt;span style="display:flex;">&lt;span> &lt;span style="color:#a6e22e">min_length&lt;/span> = &lt;span style="color:#ae81ff">3&lt;/span> &lt;span style="color:#75715e"># uint, the minimum length of match between the end of the read and&lt;/span>
&lt;/span>&lt;/span>&lt;span style="display:flex;">&lt;span> &lt;span style="color:#75715e"># the start of the adapter&lt;/span>
&lt;/span>&lt;/span>&lt;span style="display:flex;">&lt;span> &lt;span style="color:#a6e22e">max_mismatches&lt;/span> = &lt;span style="color:#ae81ff">1&lt;/span> &lt;span style="color:#75715e"># How many mismatches to accept&lt;/span>
&lt;/span>&lt;/span>&lt;/code>&lt;/pre>&lt;/div>&lt;p>Find a potentially truncated &lt;a href="https://doi.org/10.1093%2Fnar%2F13.9.3021">IUPAC string&lt;/a> sequence at the end of a read.&lt;/p>
&lt;p>Simple comparison with a max mismatch hamming distance, requiring only the first min length
bases of the query to match at the end of the read.&lt;/p></description></item><item><title>Extract IUPAC with Indels</title><link>https://tyberiusprime.github.io/fastqrab/v0.8.0/docs/reference/tag-steps/extract/extractiupacwithindel/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://tyberiusprime.github.io/fastqrab/v0.8.0/docs/reference/tag-steps/extract/extractiupacwithindel/</guid><description>&lt;h1 id="extractiupacwithindel">
 ExtractIUPACWithIndel
 &lt;a class="anchor" href="#extractiupacwithindel">#&lt;/a>
&lt;/h1>
&lt;div class="highlight">&lt;pre tabindex="0" style="color:#f8f8f2;background-color:#272822;-moz-tab-size:4;-o-tab-size:4;tab-size:4;">&lt;code class="language-toml" data-lang="toml">&lt;span style="display:flex;">&lt;span>[[&lt;span style="color:#a6e22e">step&lt;/span>]]
&lt;/span>&lt;/span>&lt;span style="display:flex;">&lt;span> &lt;span style="color:#a6e22e">action&lt;/span> = &lt;span style="color:#e6db74">&amp;#34;ExtractIUPACWithIndel&amp;#34;&lt;/span>
&lt;/span>&lt;/span>&lt;span style="display:flex;">&lt;span> &lt;span style="color:#a6e22e">out_label&lt;/span> = &lt;span style="color:#e6db74">&amp;#34;adapter&amp;#34;&lt;/span>
&lt;/span>&lt;/span>&lt;span style="display:flex;">&lt;span> &lt;span style="color:#a6e22e">search&lt;/span> = &lt;span style="color:#e6db74">&amp;#34;AGTC&amp;#34;&lt;/span> &lt;span style="color:#75715e"># IUPAC pattern to align against&lt;/span>
&lt;/span>&lt;/span>&lt;span style="display:flex;">&lt;span> &lt;span style="color:#a6e22e">max_mismatches&lt;/span> = &lt;span style="color:#ae81ff">1&lt;/span> &lt;span style="color:#75715e"># allowed substitutions (IUPAC-aware)&lt;/span>
&lt;/span>&lt;/span>&lt;span style="display:flex;">&lt;span> &lt;span style="color:#a6e22e">max_indel_bases&lt;/span> = &lt;span style="color:#ae81ff">1&lt;/span> &lt;span style="color:#75715e"># total insertions + deletions allowed&lt;/span>
&lt;/span>&lt;/span>&lt;span style="display:flex;">&lt;span> &lt;span style="color:#a6e22e">max_total_edits&lt;/span> = &lt;span style="color:#ae81ff">2&lt;/span> &lt;span style="color:#75715e"># optional overall edit ceiling&lt;/span>
&lt;/span>&lt;/span>&lt;span style="display:flex;">&lt;span> &lt;span style="color:#a6e22e">anchor&lt;/span> = &lt;span style="color:#e6db74">&amp;#39;Anywhere&amp;#39;&lt;/span> &lt;span style="color:#75715e"># Left | Right | Anywhere&lt;/span>
&lt;/span>&lt;/span>&lt;span style="display:flex;">&lt;span> &lt;span style="color:#a6e22e">segment&lt;/span> = &lt;span style="color:#e6db74">&amp;#39;read1&amp;#39;&lt;/span> &lt;span style="color:#75715e"># defaults to read1&lt;/span>
&lt;/span>&lt;/span>&lt;/code>&lt;/pre>&lt;/div>&lt;p>Locate an &lt;a href="https://doi.org/10.1093%2Fnar%2F13.9.3021">IUPAC&lt;/a> pattern even when the read contains small insertions or deletions relative to the pattern. The extractor performs a semiglobal alignment (pattern vs. read segment) using IUPAC-aware scoring and returns the aligned span as a location tag.&lt;/p></description></item><item><title/><link>https://tyberiusprime.github.io/fastqrab/v0.8.0/docs/reference/tag-steps/extract/extractlongestpolyx/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://tyberiusprime.github.io/fastqrab/v0.8.0/docs/reference/tag-steps/extract/extractlongestpolyx/</guid><description>&lt;h1 id="extractlongestpolyx">
 ExtractLongestPolyX
 &lt;a class="anchor" href="#extractlongestpolyx">#&lt;/a>
&lt;/h1>
&lt;p>Find the longest homopolymer stretch anywhere in the read (unlike &lt;code>ExtractPolyTail&lt;/code>, which only considers suffixes).&lt;/p>
&lt;div class="highlight">&lt;pre tabindex="0" style="color:#f8f8f2;background-color:#272822;-moz-tab-size:4;-o-tab-size:4;tab-size:4;">&lt;code class="language-toml" data-lang="toml">&lt;span style="display:flex;">&lt;span>[[&lt;span style="color:#a6e22e">step&lt;/span>]]
&lt;/span>&lt;/span>&lt;span style="display:flex;">&lt;span> &lt;span style="color:#a6e22e">action&lt;/span> = &lt;span style="color:#e6db74">&amp;#34;ExtractLongestPolyX&amp;#34;&lt;/span>
&lt;/span>&lt;/span>&lt;span style="display:flex;">&lt;span> &lt;span style="color:#a6e22e">out_label&lt;/span> = &lt;span style="color:#e6db74">&amp;#34;my_tag&amp;#34;&lt;/span>
&lt;/span>&lt;/span>&lt;span style="display:flex;">&lt;span> &lt;span style="color:#a6e22e">segment&lt;/span> = &lt;span style="color:#e6db74">&amp;#34;read1&amp;#34;&lt;/span>
&lt;/span>&lt;/span>&lt;span style="display:flex;">&lt;span> &lt;span style="color:#a6e22e">min_length&lt;/span> = &lt;span style="color:#ae81ff">5&lt;/span>
&lt;/span>&lt;/span>&lt;span style="display:flex;">&lt;span> &lt;span style="color:#a6e22e">base&lt;/span> = &lt;span style="color:#e6db74">&amp;#39;.&amp;#39;&lt;/span> &lt;span style="color:#75715e"># search for any homopolymer (A/C/G/T/N)&lt;/span>
&lt;/span>&lt;/span>&lt;span style="display:flex;">&lt;span> &lt;span style="color:#a6e22e">max_mismatch_rate&lt;/span> = &lt;span style="color:#ae81ff">0.15&lt;/span>
&lt;/span>&lt;/span>&lt;span style="display:flex;">&lt;span> &lt;span style="color:#a6e22e">max_consecutive_mismatches&lt;/span> = &lt;span style="color:#ae81ff">2&lt;/span>
&lt;/span>&lt;/span>&lt;/code>&lt;/pre>&lt;/div>&lt;ul>
&lt;li>&lt;code>base&lt;/code> accepts a concrete nucleotide (&lt;code>A&lt;/code>, &lt;code>C&lt;/code>, &lt;code>G&lt;/code>, &lt;code>T&lt;/code>, &lt;code>N&lt;/code>) or &lt;code>.&lt;/code> to search all of &lt;code>ACGT&lt;/code> and report the longest hit.&lt;/li>
&lt;li>&lt;code>max_mismatch_rate&lt;/code> and &lt;code>max_consecutive_mismatches&lt;/code> mirror &lt;code>ExtractPolyTail&lt;/code>; they control how permissive the run detection is.&lt;/li>
&lt;li>When no run satisfies &lt;code>min_length&lt;/code>, the tag is reported as missing.&lt;/li>
&lt;li>only one run is reported, even if multiple runs of the same length exist; in this case, the first run found is reported.&lt;/li>
&lt;/ul></description></item><item><title/><link>https://tyberiusprime.github.io/fastqrab/v0.8.0/docs/reference/tag-steps/extract/extractlowqualityend/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://tyberiusprime.github.io/fastqrab/v0.8.0/docs/reference/tag-steps/extract/extractlowqualityend/</guid><description>&lt;h1 id="trimqualityend">
 TrimQualityEnd
 &lt;a class="anchor" href="#trimqualityend">#&lt;/a>
&lt;/h1>
&lt;div class="highlight">&lt;pre tabindex="0" style="color:#f8f8f2;background-color:#272822;-moz-tab-size:4;-o-tab-size:4;tab-size:4;">&lt;code class="language-toml" data-lang="toml">&lt;span style="display:flex;">&lt;span>[[&lt;span style="color:#a6e22e">step&lt;/span>]]
&lt;/span>&lt;/span>&lt;span style="display:flex;">&lt;span> &lt;span style="color:#a6e22e">action&lt;/span> = &lt;span style="color:#e6db74">&amp;#34;ExtractLowQualityEnd&amp;#34;&lt;/span>
&lt;/span>&lt;/span>&lt;span style="display:flex;">&lt;span> &lt;span style="color:#a6e22e">out_label&lt;/span> = &lt;span style="color:#e6db74">&amp;#34;low_quality_ends&amp;#34;&lt;/span>
&lt;/span>&lt;/span>&lt;span style="display:flex;">&lt;span> &lt;span style="color:#a6e22e">min_qual&lt;/span> = &lt;span style="color:#ae81ff">20&lt;/span> &lt;span style="color:#75715e"># u8, minimum quality to keep (in whatever your score is encoded in)&lt;/span>
&lt;/span>&lt;/span>&lt;span style="display:flex;">&lt;span> &lt;span style="color:#75715e"># either a char like &amp;#39;A&amp;#39; or a number 0..128 (typical phred score is 33..75)&lt;/span>
&lt;/span>&lt;/span>&lt;span style="display:flex;">&lt;span> &lt;span style="color:#a6e22e">segment&lt;/span> = &lt;span style="color:#e6db74">&amp;#34;read1&amp;#34;&lt;/span> &lt;span style="color:#75715e"># Any of your input segments&lt;/span>
&lt;/span>&lt;/span>&lt;/code>&lt;/pre>&lt;/div>&lt;p>Define a region of low quality bases at the end of reads.&lt;/p>
&lt;h2 id="corresponding-options-in-other-software">
 Corresponding options in other software
 &lt;a class="anchor" href="#corresponding-options-in-other-software">#&lt;/a>
&lt;/h2>
&lt;ul>
&lt;li>Trimmomatic: TRAILING (if paired with &lt;a href="https://tyberiusprime.github.io/fastqrab/v0.8.0/fastqrab/v0.8.0/docs/reference/modification-steps/trimattag/">TrimAtTag&lt;/a>)&lt;/li>
&lt;/ul></description></item><item><title/><link>https://tyberiusprime.github.io/fastqrab/v0.8.0/docs/reference/tag-steps/extract/extractlowqualitystart/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://tyberiusprime.github.io/fastqrab/v0.8.0/docs/reference/tag-steps/extract/extractlowqualitystart/</guid><description>&lt;h1 id="trimqualitystart">
 TrimQualityStart
 &lt;a class="anchor" href="#trimqualitystart">#&lt;/a>
&lt;/h1>
&lt;div class="highlight">&lt;pre tabindex="0" style="color:#f8f8f2;background-color:#272822;-moz-tab-size:4;-o-tab-size:4;tab-size:4;">&lt;code class="language-toml" data-lang="toml">&lt;span style="display:flex;">&lt;span>[[&lt;span style="color:#a6e22e">step&lt;/span>]]
&lt;/span>&lt;/span>&lt;span style="display:flex;">&lt;span> &lt;span style="color:#a6e22e">action&lt;/span> = &lt;span style="color:#e6db74">&amp;#34;ExtractLowQualityStart&amp;#34;&lt;/span>
&lt;/span>&lt;/span>&lt;span style="display:flex;">&lt;span> &lt;span style="color:#a6e22e">min_qual&lt;/span> = &lt;span style="color:#ae81ff">20&lt;/span> &lt;span style="color:#75715e"># u8, minimum quality to keep (in whatever your score is encoded in)&lt;/span>
&lt;/span>&lt;/span>&lt;span style="display:flex;">&lt;span> &lt;span style="color:#75715e"># either a char like &amp;#39;A&amp;#39; or a number 0..128 (typical phred score is 33..75)&lt;/span>
&lt;/span>&lt;/span>&lt;span style="display:flex;">&lt;span> &lt;span style="color:#a6e22e">segment&lt;/span> = &lt;span style="color:#e6db74">&amp;#34;read1&amp;#34;&lt;/span> &lt;span style="color:#75715e"># Any of your input segments&lt;/span>
&lt;/span>&lt;/span>&lt;span style="display:flex;">&lt;span> &lt;span style="color:#a6e22e">out_label&lt;/span> = &lt;span style="color:#e6db74">&amp;#34;bad_starts&amp;#34;&lt;/span>
&lt;/span>&lt;/span>&lt;/code>&lt;/pre>&lt;/div>&lt;p>Define a region with low quality bases (below threshold) at steart of read.&lt;/p>
&lt;h2 id="corresponding-options-in-other-software">
 Corresponding options in other software
 &lt;a class="anchor" href="#corresponding-options-in-other-software">#&lt;/a>
&lt;/h2>
&lt;ul>
&lt;li>Trimmomatic: LEADING (if combined with &lt;a href="https://tyberiusprime.github.io/fastqrab/v0.8.0/fastqrab/v0.8.0/docs/reference/modification-steps/trimattag/">TrimAtTag&lt;/a>)&lt;/li>
&lt;/ul></description></item><item><title/><link>https://tyberiusprime.github.io/fastqrab/v0.8.0/docs/reference/tag-steps/extract/extractpolytail/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://tyberiusprime.github.io/fastqrab/v0.8.0/docs/reference/tag-steps/extract/extractpolytail/</guid><description>&lt;h1 id="extractpolytail">
 ExtractPolyTail
 &lt;a class="anchor" href="#extractpolytail">#&lt;/a>
&lt;/h1>
&lt;div class="highlight">&lt;pre tabindex="0" style="color:#f8f8f2;background-color:#272822;-moz-tab-size:4;-o-tab-size:4;tab-size:4;">&lt;code class="language-toml" data-lang="toml">&lt;span style="display:flex;">&lt;span>[[&lt;span style="color:#a6e22e">step&lt;/span>]]
&lt;/span>&lt;/span>&lt;span style="display:flex;">&lt;span> &lt;span style="color:#a6e22e">action&lt;/span> = &lt;span style="color:#e6db74">&amp;#34;ExtractPolyTail&amp;#34;&lt;/span>
&lt;/span>&lt;/span>&lt;span style="display:flex;">&lt;span> &lt;span style="color:#a6e22e">out_label&lt;/span> = &lt;span style="color:#e6db74">&amp;#34;tag_label&amp;#34;&lt;/span>
&lt;/span>&lt;/span>&lt;span style="display:flex;">&lt;span> &lt;span style="color:#a6e22e">segment&lt;/span> = &lt;span style="color:#e6db74">&amp;#34;read1&amp;#34;&lt;/span> &lt;span style="color:#75715e"># Any of your input segments (default: read1)&lt;/span>
&lt;/span>&lt;/span>&lt;span style="display:flex;">&lt;span> &lt;span style="color:#a6e22e">min_length&lt;/span> = &lt;span style="color:#ae81ff">5&lt;/span> &lt;span style="color:#75715e"># positive integer, the minimum number of repeats of the base&lt;/span>
&lt;/span>&lt;/span>&lt;span style="display:flex;">&lt;span> &lt;span style="color:#a6e22e">base&lt;/span> = &lt;span style="color:#e6db74">&amp;#34;A&amp;#34;&lt;/span> &lt;span style="color:#75715e"># one of AGTCN., the &amp;#39;base&amp;#39; to trim (or . for any repeated base)&lt;/span>
&lt;/span>&lt;/span>&lt;span style="display:flex;">&lt;span> &lt;span style="color:#a6e22e">max_mismatch_rate&lt;/span> = &lt;span style="color:#ae81ff">0.1&lt;/span> &lt;span style="color:#75715e"># float 0.0..=1.0, how many mismatches are allowed in the repeat&lt;/span>
&lt;/span>&lt;/span>&lt;span style="display:flex;">&lt;span> &lt;span style="color:#a6e22e">max_consecutive_mismatches&lt;/span> = &lt;span style="color:#ae81ff">3&lt;/span> &lt;span style="color:#75715e"># how many consecutive mismatches are allowed&lt;/span>
&lt;/span>&lt;/span>&lt;/code>&lt;/pre>&lt;/div>&lt;p>Identify either a specific letter (AGTC or N) repetition,
or any base repetition (base = &amp;lsquo;.&amp;rsquo;) at the end of the read.&lt;/p></description></item></channel></rss>