DupRadar

graphical results

Figure 1 :a case with no duplication problems to the left and a case with duplication problems to the right.

source : nf-core RNAseq description

In figure 1, the graphs have the rate of duplicated reads on the ordinate and the expression level expressed in reads/kbp (1000 base pairs) on the abscissa. These graphs of regression lines perfectly illustrate the 2 possible cases. In the case of conventional sequencing, the greater the sequencing depth, the higher the number of duplicates. Dupradar allows you to visualize the levels of duplicates obtained as a function of the expression found for each sample. In an ideal case, the duplicate rate increases suddenly, but remains low (less than 10%) at low expression levels. On the other hand, in a case where there is a duplication problem, the points tend to represent a uniform egg shape, resulting in a continuously increasing regression curve. In the second case, it is not the sequencing depth but the level of duplication of the sequence that is responsible for the number of duplicates.

Figure 2 : generalized linear Dupradar models modeling the expression of duplicate data for each sample.

source : nf-core RNAseq MultiQC

The values shown in Figure 2 are taken from RNAseq data. A very high level of duplication is observed, particularly for the four samples at the top of the graph. This may be typical for RNAseq data, given that some genes, such as mitochondrial genes, may be much more expressed than others. There is also ribosomal RNA, which is a highly duplicated gene in the genome. As a result, this sequence can be found many times in the transcriptome, giving rise to multimapping and duplications. It is therefore very difficult to measure the differential expression of ribosomal RNA.

results matching ""

    No results matching ""