<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>fastqrab documentation</title><link>https://tyberiusprime.github.io/fastqrab/main/docs/reference/validation-steps/</link><description>Recent content on fastqrab documentation</description><generator>Hugo</generator><language>en-us</language><atom:link href="https://tyberiusprime.github.io/fastqrab/main/docs/reference/validation-steps/index.xml" rel="self" type="application/rss+xml"/><item><title/><link>https://tyberiusprime.github.io/fastqrab/main/docs/reference/validation-steps/ValidateAllReadsSameLength/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://tyberiusprime.github.io/fastqrab/main/docs/reference/validation-steps/ValidateAllReadsSameLength/</guid><description>&lt;h1 id="validateallreadssamelength">
 ValidateAllReadsSameLength
 &lt;a class="anchor" href="#validateallreadssamelength">#&lt;/a>
&lt;/h1>
&lt;div class="highlight">&lt;pre tabindex="0" style="color:#f8f8f2;background-color:#272822;-moz-tab-size:4;-o-tab-size:4;tab-size:4;">&lt;code class="language-toml" data-lang="toml">&lt;span style="display:flex;">&lt;span>[[&lt;span style="color:#a6e22e">step&lt;/span>]]
&lt;/span>&lt;/span>&lt;span style="display:flex;">&lt;span> &lt;span style="color:#a6e22e">action&lt;/span> = &lt;span style="color:#e6db74">&amp;#34;ValidateAllReadsSameLength&amp;#34;&lt;/span>
&lt;/span>&lt;/span>&lt;span style="display:flex;">&lt;span> &lt;span style="color:#a6e22e">source&lt;/span> = &lt;span style="color:#e6db74">&amp;#34;read1&amp;#34;&lt;/span> &lt;span style="color:#75715e"># Any segment, All, tag:&amp;lt;name&amp;gt; or &amp;#39;name:segment&amp;gt;&amp;#39;&lt;/span>
&lt;/span>&lt;/span>&lt;/code>&lt;/pre>&lt;/div>&lt;p>Validates that all reads have the same sequence/tag/name length.&lt;/p>
&lt;p>Useful when you want to verify read length consistency in your pipeline.&lt;/p>
&lt;p>(For names, the names without comments -
that is up to the first input.options.read_comment_character are used).&lt;/p></description></item><item><title/><link>https://tyberiusprime.github.io/fastqrab/main/docs/reference/validation-steps/ValidateName/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://tyberiusprime.github.io/fastqrab/main/docs/reference/validation-steps/ValidateName/</guid><description>&lt;h1 id="validatename">
 ValidateName
 &lt;a class="anchor" href="#validatename">#&lt;/a>
&lt;/h1>
&lt;p>Verify that all segments have the same read name (or a shared prefix).&lt;/p>
&lt;div class="highlight">&lt;pre tabindex="0" style="color:#f8f8f2;background-color:#272822;-moz-tab-size:4;-o-tab-size:4;tab-size:4;">&lt;code class="language-toml" data-lang="toml">&lt;span style="display:flex;">&lt;span>[[&lt;span style="color:#a6e22e">step&lt;/span>]]
&lt;/span>&lt;/span>&lt;span style="display:flex;">&lt;span> &lt;span style="color:#a6e22e">action&lt;/span> = &lt;span style="color:#e6db74">&amp;#34;ValidateName&amp;#34;&lt;/span>
&lt;/span>&lt;/span>&lt;span style="display:flex;">&lt;span> &lt;span style="color:#75715e"># Optional separator character; the comparison stops at the first match&lt;/span>
&lt;/span>&lt;/span>&lt;span style="display:flex;">&lt;span> &lt;span style="color:#a6e22e">readname_end_char&lt;/span> = &lt;span style="color:#e6db74">&amp;#34;_&amp;#34;&lt;/span> &lt;span style="color:#75715e"># Optional. Do not set for exact matching. Otherwise, a byte character&lt;/span>
&lt;/span>&lt;/span>&lt;span style="display:flex;">&lt;span> &lt;span style="color:#a6e22e">sample_stride&lt;/span> = &lt;span style="color:#ae81ff">1000&lt;/span> &lt;span style="color:#75715e"># Check every nth fragment, default 1000. Must be &amp;gt; 0. Starts with first read&lt;/span>
&lt;/span>&lt;/span>&lt;/code>&lt;/pre>&lt;/div>&lt;p>When no separator character (readname_end_char) is provided the
entire name must match exactly across all segments.&lt;/p></description></item><item><title/><link>https://tyberiusprime.github.io/fastqrab/main/docs/reference/validation-steps/ValidateQuality/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://tyberiusprime.github.io/fastqrab/main/docs/reference/validation-steps/ValidateQuality/</guid><description>&lt;h1 id="validatequality">
 ValidateQuality
 &lt;a class="anchor" href="#validatequality">#&lt;/a>
&lt;/h1>
&lt;p>Validate that all scores are between 33..=41&lt;/p>
&lt;div class="highlight">&lt;pre tabindex="0" style="color:#f8f8f2;background-color:#272822;-moz-tab-size:4;-o-tab-size:4;tab-size:4;">&lt;code class="language-toml" data-lang="toml">&lt;span style="display:flex;">&lt;span>[[&lt;span style="color:#a6e22e">step&lt;/span>]]
&lt;/span>&lt;/span>&lt;span style="display:flex;">&lt;span> &lt;span style="color:#a6e22e">action&lt;/span> = &lt;span style="color:#e6db74">&amp;#34;ValidateQuality&amp;#34;&lt;/span>
&lt;/span>&lt;/span>&lt;span style="display:flex;">&lt;span> &lt;span style="color:#a6e22e">segment&lt;/span> = &lt;span style="color:#e6db74">&amp;#34;read1&amp;#34;&lt;/span> &lt;span style="color:#75715e"># Any of your input segments, or &amp;#39;All&amp;#39;&lt;/span>
&lt;/span>&lt;/span>&lt;span style="display:flex;">&lt;span> &lt;span style="color:#a6e22e">encoding&lt;/span> = &lt;span style="color:#e6db74">&amp;#39;Illumina1.8&amp;#39;&lt;/span> &lt;span style="color:#75715e"># &amp;#39;Illumina1.8|Illumina1.3|Sanger|Solexa&amp;#39;&lt;/span>
&lt;/span>&lt;/span>&lt;span style="display:flex;">&lt;span> &lt;span style="color:#75715e"># Illumina1.8 is an alias for Sanger.&lt;/span>
&lt;/span>&lt;/span>&lt;/code>&lt;/pre>&lt;/div>&lt;p>The encoding defines the accepted range of values.&lt;/p>
&lt;p>If you want to convert quality codes, use &lt;a href="https://tyberiusprime.github.io/fastqrab/main/fastqrab/main/docs/reference/modification-steps/ConvertQuality/">ConvertQuality&lt;/a>.&lt;/p>
&lt;p>See &lt;a href="https://pmc.ncbi.nlm.nih.gov/articles/PMC2847217/">https://pmc.ncbi.nlm.nih.gov/articles/PMC2847217/&lt;/a> , table 1&lt;/p></description></item><item><title/><link>https://tyberiusprime.github.io/fastqrab/main/docs/reference/validation-steps/ValidateReadNamesPrintable/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://tyberiusprime.github.io/fastqrab/main/docs/reference/validation-steps/ValidateReadNamesPrintable/</guid><description>&lt;h1 id="validatereadnamesprintable">
 ValidateReadNamesPrintable
 &lt;a class="anchor" href="#validatereadnamesprintable">#&lt;/a>
&lt;/h1>
&lt;p>Validate that every read name conforms to the SAM/BAM specification.&lt;/p>
&lt;p>The SAM specification requires that query names (QNAME) match &lt;code>[!-?A-~]{1,254}&lt;/code>:
printable ASCII characters excluding &lt;code>@&lt;/code> and space, with a maximum length of 254.&lt;/p>
&lt;div class="highlight">&lt;pre tabindex="0" style="color:#f8f8f2;background-color:#272822;-moz-tab-size:4;-o-tab-size:4;tab-size:4;">&lt;code class="language-toml" data-lang="toml">&lt;span style="display:flex;">&lt;span>[[&lt;span style="color:#a6e22e">step&lt;/span>]]
&lt;/span>&lt;/span>&lt;span style="display:flex;">&lt;span> &lt;span style="color:#a6e22e">action&lt;/span> = &lt;span style="color:#e6db74">&amp;#34;ValidateReadNamesPrintable&amp;#34;&lt;/span>
&lt;/span>&lt;/span>&lt;/code>&lt;/pre>&lt;/div>&lt;p>No additional parameters are needed — the allowed character set is fixed by the SAM spec.&lt;/p>
&lt;h2 id="when-to-use">
 When to use
 &lt;a class="anchor" href="#when-to-use">#&lt;/a>
&lt;/h2>
&lt;p>Add this step when your pipeline writes non-BAM output that will become BAM eventually
and you suspect read names may contain invalid characters.&lt;/p></description></item><item><title/><link>https://tyberiusprime.github.io/fastqrab/main/docs/reference/validation-steps/ValidateReadPairing/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://tyberiusprime.github.io/fastqrab/main/docs/reference/validation-steps/ValidateReadPairing/</guid><description>&lt;h1 id="validatereadpairing">
 ValidateReadPairing
 &lt;a class="anchor" href="#validatereadpairing">#&lt;/a>
&lt;/h1>
&lt;p>Confirms for every &lt;code>sample_stride&lt;/code>th read &amp;lsquo;pair&amp;rsquo; that the names are
identical but for one letter.&lt;/p>
&lt;div class="highlight">&lt;pre tabindex="0" style="color:#f8f8f2;background-color:#272822;-moz-tab-size:4;-o-tab-size:4;tab-size:4;">&lt;code class="language-toml" data-lang="toml">&lt;span style="display:flex;">&lt;span>[[&lt;span style="color:#a6e22e">step&lt;/span>]]
&lt;/span>&lt;/span>&lt;span style="display:flex;">&lt;span> &lt;span style="color:#a6e22e">action&lt;/span> = &lt;span style="color:#e6db74">&amp;#34;ValidateReadPairing&amp;#34;&lt;/span>
&lt;/span>&lt;/span>&lt;span style="display:flex;">&lt;span> &lt;span style="color:#a6e22e">sample_stride&lt;/span> = &lt;span style="color:#ae81ff">1000&lt;/span> &lt;span style="color:#75715e"># Check every nth fragment, default 1000. Must be &amp;gt; 0. Starts with first read&lt;/span>
&lt;/span>&lt;/span>&lt;/code>&lt;/pre>&lt;/div>&lt;p>Ensures&lt;/p>
&lt;ul>
&lt;li>read names between segments have the same length&lt;/li>
&lt;li>read names between segments have a hamming distance of at most one.&lt;/li>
&lt;/ul>
&lt;p>Note that this validation requires at least two input segments.&lt;/p>
&lt;p>(See also: &lt;a href="https://tyberiusprime.github.io/fastqrab/main/fastqrab/main/docs/reference/validation-steps/ValidateName/">&lt;code>ValidateName&lt;/code>&lt;/a>,
which validates after truncating on a character occurance).&lt;/p></description></item><item><title/><link>https://tyberiusprime.github.io/fastqrab/main/docs/reference/validation-steps/ValidateSeq/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://tyberiusprime.github.io/fastqrab/main/docs/reference/validation-steps/ValidateSeq/</guid><description>&lt;h1 id="validateseq">
 ValidateSeq
 &lt;a class="anchor" href="#validateseq">#&lt;/a>
&lt;/h1>
&lt;p>Validate that only allowed characters are in the sequence.&lt;/p>
&lt;div class="highlight">&lt;pre tabindex="0" style="color:#f8f8f2;background-color:#272822;-moz-tab-size:4;-o-tab-size:4;tab-size:4;">&lt;code class="language-toml" data-lang="toml">&lt;span style="display:flex;">&lt;span>[[&lt;span style="color:#a6e22e">step&lt;/span>]]
&lt;/span>&lt;/span>&lt;span style="display:flex;">&lt;span> &lt;span style="color:#a6e22e">action&lt;/span> = &lt;span style="color:#e6db74">&amp;#34;ValidateSeq&amp;#34;&lt;/span>
&lt;/span>&lt;/span>&lt;span style="display:flex;">&lt;span> &lt;span style="color:#a6e22e">allowed&lt;/span> = &lt;span style="color:#e6db74">&amp;#34;AGTC&amp;#34;&lt;/span> &lt;span style="color:#75715e"># String. Example &amp;#39;ACGTN&amp;#39;, the allowed characters&lt;/span>
&lt;/span>&lt;/span>&lt;span style="display:flex;">&lt;span> &lt;span style="color:#a6e22e">segment&lt;/span> = &lt;span style="color:#e6db74">&amp;#34;read1&amp;#34;&lt;/span> &lt;span style="color:#75715e"># Any of your input segments, or &amp;#39;All&amp;#39;&lt;/span>
&lt;/span>&lt;/span>&lt;/code>&lt;/pre>&lt;/div></description></item></channel></rss>