Instructions

Welcome to the metaseqR2 report! If you are familiar with the metaseqR report, then you will find that there are not many differences with respect to the presented information. Some diagnostic and exploration plots were added. The most notable difference is that all plots are interactive. This helps a lot with exploration and interpretation but also adds a lot of computational burden. However, relatively modern systems with recent browser versions should be capable of rendering all the graphics. The metaseqR2 report has been tested with Google Chrome, Mozilla Firefox and Microsoft Edge. It has not been tested with Internet Explorer, Opera and Safari and most probably will not be. Other Chromium browsers (e.g. Brave) should also be fine.

One particular characteristic of the metaseqR2 report is that all plots are interactive. This is achieved by using the standard graphics underlying data with libraries including Highcharts, Plotly and jvenn to create more user-friendly and directly explorable plots. Instructions on the usage of these plots follow:

  • All plots are interactively explorable. This means that if you move your mouse inside the plot area (a move called mouse-over), you can retrieve information on each single data point. This applies to all plots. More specifically:
    • In scatterplots, if you mouse-over each point, information about this point is presented, depending on the type of the plot. The data series from which the point comes is also presented. For example, in a Volcano plot, fold change and significance, as well as the name of the gene and the data category (e.g. up-regulated) will be presented.
    • In barplots, if you mouse-over each bar, information about this bar is presented, such as the value it represents and the data series from which it comes. If the barplot contains groups of bars, then information about each group is displayed. For example, in a Biodetection plot, each bar group presents the percentage of a biotype in the examined genome, the percentage in the sample and the detected percentage according to read counts.
    • In boxplots, if you mouse-over the boxes, the information about the underlying distribution is displayed (maximum, upper quartile, median, lower quartile and minimum) as well as the data series. If you mouse-over an outlier, then information on this single point is presented (e.g. value).
    • Some barplots have a double y-axis system corresponding to different measurements or scales. For example in Biodetection barplots, the left y-axis presents abundant features while the right y-axis presents non-abundant features. In the Filtered barplot, y-axes present different values (numbers and fractions).
    • Line plots can be moused-over too. Depending on the plot type, exact values may or may not be shown, depending on how important it is to display them, and to avoid over-crowding the plots. For example in Reads noise plot, we are interested in the trend and not so much in exact values.
    • Heatmaps can be moused-over too. Information on each heatmap cell will be displayed.
  • All scatterplots and heatmaps are zoomable. You need to press the left mouse button inside a plot area and draw a square area to zoom-in. If you wish to reset the zoom, there is a button appearing for this when zooming-in.
  • Data series in scatterplots, barplots and boxplots can be toggled on or off by clicking on the legend name of each data series which is placed below each plot. For example, in Volcano plots, if you click on the name “Unregulated”, then the respective data series will stop appearing in the plot. You can bring it back by clicking the legend again.
  • All plots are exportable. On the top right corner of each scatterplot, barplot and boxplot, there is a menu button with several functionalities, including exporting in various formats and presenting the plot in full-screen mode. For heatmaps, this functionality is offered by a set of small buttons that appear if you mouse-over at the top of the heatmap.
  • In Venn diagrams, if you click on the number for each category, the respective gene/transcript names will appear in the box on the right of the diagram.
  • All plots can be downloaded in static formats (in formats according to metaseqr2 call) from the Results section.

The metaseqR2 report contains the sections described below, depending on which diagnostic and exploration plots have been asked for from the run command. As plots are categorized, if no plot from a specific category is asked for, then this category will not appear. Below, are the categories:

Summary

The Summary section is further categorized in several subsections. Specifically:

  • Analysis summary: This section contains an auto-generated text that analytically describes the computational process followed and summarized the results of each step. This text can be used as is or with slight modifications in the Methods section of an article.
  • Input options: This section provides a list of the input arguments to the pipeline in a more human-readable format.
  • Filtering: This section reports in detail the number of filtered genes decomposed according to the number of genes removed by each applied filter.
  • Differential expression: This section reports in detail the number of differentially expressed genes for each contrast, both when using only a p-value cutoff as well as an FDR cutoff (numbers in parentheses), that is, genes passing the multiple testing correction procedure selected. These numbers are also calculated based on a simple fold change cutoff in log2 scale. Finally, when multiple algorithms are used with p-value combination, this section reports all the findings analytically per algorithm.
  • Command: This section contains the command used to run the metaseqr2 pipeline for users that want to experiment as well as a critical messages displayed within the R session running metaseqr2 displayed as a log. Finally, if a targets file has been used to perform the analysis, a table depicting the parameters in the targets files is created and a link to download the actual targets file, but any relative paths to BAM files are stripped and the user is responsible to prepend them if the targets file has to be reused in another location, e.g. locally.
  • Tracks: This section contains a link which opens a new window to the UCSC Genome Browser where normalized tracks based on the input BAM files are displayed. If stranded tracks have been requested (according to the sequencing protocol or technology), the a track hub is created to display the stranded tracks. From this tab, you can also download bigWig files as well as copy track lines for manual input to the UCSC Genome Browser.

Quality control

The Quality control section contains several interactive plots concerning the overall quality control of each sample provided as well as overall assessments. The quality control plots are the Multidimensional Scaling (MDS) plot, the Biotypes detection (Biodetection) plot, the Biotype abundance (Countsbio) plot, the Read saturation (Saturation) plot, the Read noise (ReadNoise) plot, the Correlation heatmap (Correlation), the Pairwise sample scatterplots (Pairwise) and the Filtered entities (Filtered) plot. Each plot is accompanied by a detailed description of what it depicts. Where multiple plot are available (e.g. one for each sample), a selection list on the top of the respective section allows the selection of the sample to be displayed.

Normalization

The Normalization section contains several interactive plots that can be used to inspect and assess the normalization procedure. Therefore, normalization plots are usually paired, showing the same data instance normalized and not normalized. The normalization plots are the Expression boxplots (Boxplots) plots, the GC content bias (GC bias) plots, the Gene length bias (Length bias) plots, the Within condition mean-difference (Mean-Difference) plots, the Mean-variance relationship (Mean-Variance) plot and the RNA composition (Rna composition) plot. Each plot is accompanied by a detailed description of what it depicts. Where multiple plot are available (e.g. one for each sample), a selection list on the top of the respective section allows the selection of the sample to be displayed.

Statistics

The Statistics section contains several interactive plots that can be used to inspect and explore the outcome of statistical testing procedures. The statistics plots are the Volcano plot (Volcano), the MA or Mean-Difference across conditions (MA) plot, the Expression heatmap (Heatmap) plot, the Chromosome and biotype distributions (Biodist) plot, the Venn diagram across statistical tests (StatVenn), the Venn diagram across contrasts (FoldVenn) and the Deregulogram. Each plot is accompanied by a detailed description of what it depicts. Please note that the heatmap plots only show the top percentage of differentially expressed genes as this is controlled by the reportTop parameter of the metaseqr2 pipeline. When multiple plots are available (e.g. one for each contrast), a selection list on the top of the respective section allows the selection of the sample to be displayed.

Results

The Results section contains a snapshot of differentially expressed genes in table format with basic information about each gene and links to external resources. Certain columns of the table are colored according to significance. Larger bars and more intense colors indicate higher significance. For example, the bar in the p_value column is larger if the genes has higher statistical significance and the fold change cell background is bright red if the gene is highly up-regulated. From the Results section, full gene lists can be downloaded in text tab-delimited format and viewed with a spreadsheet application such as MS Excel. A selector on the top of the section above the table allows the display of different contrasts.

References

The References section contains bibliographical references regading the algorithms used by the metaseqr2 pipeline and is adjusted according to the algorithms selected.

Summary

Analysis summary

Analysis summary

The raw bam files, one for each RNA-Seq sample, were summarized to an exon read counts table, using the Bioconductor package GenomicRanges. In the final read counts table, each row represented one exon, each column one RNA-Seq sample and each cell, the corresponding read counts associated with each row and column.The exon read counts were filtered for artifacts that could affect the subsequent normalization and statistical testing procedures as follows: if an annotated gene had up to 5 exons, read presence was required in at least 2 of the exons, else if an annotated gene had more than 5 exons, then read presence was required in at least 0.2x⌈E⌉ exons, where ⌈.⌉ is the ceiling mathematical function. The application of this filter resulted in the exclusion of 9464 genes from further analysis. The total number of genes excluded due to the application of exon filters was 9464. The final read counts for each gene model were calculated as the sums of their exon reads, creating a gene counts table where each row corresponded to an Ensembl gene model and each column corresponded to an RNA-Seq sample. The gene counts table was normalized for inherent systematic or experimental biases (e.g. sequencing depth, gene length, GC content bias etc. using the Bioconductor package DESeq after removing genes that had zero counts over all the RNA-Seq samples (6412 genes). The output of the normalization algorithm was a table with normalized counts, which can be used for differential expression analysis with statistical algorithms developed specifically for count data. Prior to the statistical testing procedure, the gene read counts were filtered for possible artifacts that may affect the subsequent statistical testing procedures. Genes/transcripts presenting any of the following were excluded from further analysis: i) genes with length less than 100bp (793 genes), ii) genes whose average numbers of reads per 100 bp was less than the 25th quantile of the total normalized distribution of average reads per 100bp (0 genes with cutoff value 0.10379 average reads per 100 bp), iii) genes with read counts below the median read counts of the total normalized count distribution (12559 genes with cutoff value 14 normalized read counts), iv) genes whose biotype matched the following: rRNA (130 genes). The total number of genes excluded due to the application of gene filters was 2628. The total (unified) number of genes excluded due to the application of all filters was 19360. The resulting gene counts table was subjected to differential expression analysis for the contrasts DEN_Smyd3KO versus DEN_WT, DEN_WT versus WT, DEN_Smyd3KO versus Smyd3KO, Smyd3KO versus WT using the Bioconductor packages DESeq, DESeq2, edgeR, NOISeq, limma, NBPSeq, ABSSeq, DSS. In order to combine the statistical significance from multiple algorithms and perform meta-analysis, the PANDORA weighted p-value across results method was applied. The final numbers of differentially expressed genes were (per contrast): for the contrast DEN_Smyd3KO versus DEN_WT, 8733 (6100) statistically significant genes were found with a p-value (FDR or adjusted p-value) threshold of 0.05 and of these, 2279 (1937) were up-regulated, 1910 (1611) were down-regulated and 4544 (2552) were not differentially expressed according to an absolute fold change cutoff value of 1 in log2 scale, for the contrast DEN_WT versus WT, 10043 (8016) statistically significant genes were found with a p-value (FDR or adjusted p-value) threshold of 0.05 and of these, 2302 (2156) were up-regulated, 2961 (2638) were down-regulated and 4780 (3222) were not differentially expressed according to an absolute fold change cutoff value of 1 in log2 scale, for the contrast DEN_Smyd3KO versus Smyd3KO, 4524 (1752) statistically significant genes were found with a p-value (FDR or adjusted p-value) threshold of 0.05 and of these, 1060 (512) were up-regulated, 1478 (889) were down-regulated and 1986 (351) were not differentially expressed according to an absolute fold change cutoff value of 1 in log2 scale, for the contrast Smyd3KO versus WT, 9669 (7513) statistically significant genes were found with a p-value (FDR or adjusted p-value) threshold of 0.05 and of these, 2298 (2106) were up-regulated, 2522 (2239) were down-regulated and 4849 (3168) were not differentially expressed according to an absolute fold change cutoff value of 1 in log2 scale. Literature references for all the algorithms used can be found at the end of this report.

Input options

Input options

Read counts file: imported sam/bam/bed files

Conditions: WT, DEN_WT, Smyd3KO, DEN_Smyd3KO

Samples included: WT_BR1, WT_BR2, WT_BR3, DEN_WT_BR1, DEN_WT_BR2, DEN_WT_BR3, Smyd3KO_BR2, Smyd3KO_BR3, Smyd3KO_POOL, DEN_Smyd3KO_BR1, DEN_Smyd3KO_BR2, DEN_Smyd3KO_BR3

Samples excluded: none

Requested contrasts: DEN_Smyd3KO_vs_DEN_WT, DEN_WT_vs_WT, DEN_Smyd3KO_vs_Smyd3KO, Smyd3KO_vs_WT

Library sizes:
  • WT_BR1: 44072344
  • WT_BR2: 39630072
  • WT_BR3: 36126282
  • DEN_WT_BR1: 49725302
  • DEN_WT_BR2: 42656958
  • DEN_WT_BR3: 46112564
  • Smyd3KO_BR2: 35935169
  • Smyd3KO_BR3: 42407487
  • Smyd3KO_POOL: 40546769
  • DEN_Smyd3KO_BR1: 35900330
  • DEN_Smyd3KO_BR2: 42948349
  • DEN_Smyd3KO_BR3: 44376053

Organism: mouse (Mus musculus), genome version alias mm9

Annotation source: Ensembl genomes

Count type: exon

Exon filters: minActiveExons
  • minActiveExons
    • exonsPerGene: 5
    • minExons: 2
    • frac: 0.2
Gene filters: length, avgReads, expression, biotype
  • length
    • length: 100
  • avgReads
    • averagePerBp: 100
    • quantile: 0.25
  • expression
    • median: TRUE
    • mean: FALSE
    • quantile: NA
    • known: NA
    • custom: NA
  • biotype
    • pseudogene: FALSE
    • snRNA: FALSE
    • protein_coding: FALSE
    • antisense: FALSE
    • miRNA: FALSE
    • lincRNA: FALSE
    • snoRNA: FALSE
    • processed_transcript: FALSE
    • misc_RNA: FALSE
    • rRNA: TRUE
    • sense_overlapping: FALSE
    • sense_intronic: FALSE
    • polymorphic_pseudogene: FALSE
    • non_coding: FALSE
    • three_prime_overlapping_ncrna: FALSE
    • IG_C_gene: FALSE
    • IG_J_gene: FALSE
    • IG_D_gene: FALSE
    • IG_V_gene: FALSE
    • ncrna_host: FALSE

Filter application: after normalization

Normalization algorithm: DESeq

Normalization arguments: locfunc
  • [[list(function (x, na.rm = FALSE, …) UseMethod(“median”))locfunc

Statistical algorithm(s): DESeq, DESeq2, edgeR, NOISeq, limma, NBPSeq, ABSSeq, DSS

Statistical arguments for DESeq: method, sharingMode, fitType
  • method: blind
  • sharingMode: fit-only
  • fitType: local
Statistical arguments for DESeq2: tidy, fitType, maxit, quiet, modelMatrix, betaPrior, betaTol, useOptim, useT, useQR, lfcThreshold, altHypothesis, independentFiltering, alpha, pAdjustMethod, format, addMLE, parallel
  • tidy: FALSE
  • fitType: parametric
  • maxit: 100
  • quiet: FALSE
  • betaPrior: FALSE
  • betaTol: 1e-08
  • useOptim: TRUE
  • useT: FALSE
  • useQR: TRUE
  • lfcThreshold: 0
  • altHypothesis: greaterAbs
  • independentFiltering: TRUE
  • alpha: 0.1
  • pAdjustMethod: BH
  • format: DataFrame
  • addMLE: FALSE
  • parallel: FALSE
Statistical arguments for edgeR: main.method, rowsum.filter, prior.df, trend, span, tag.method, grid.length, grid.range, offset, glm.method, subset, AveLogCPM, trend.method, dispersion, offset, weights, lib.size, prior.count, start, method, test, abundance.trend, robust, winsor.tail.p
  • main.method: classic
  • rowsum.filter: 5
  • prior.df: 10
  • trend: movingave
  • tag.method: grid
  • grid.length: 11
  • grid.range: -6, 6
  • glm.method: CoxReid
  • subset: 10000
  • trend.method: auto
  • prior.count: 0.125
  • method: auto
  • test: chisq
  • abundance.trend: TRUE
  • robust: FALSE
  • winsor.tail.p: 0.05, 0.1
Statistical arguments for NOISeq: k, norm, replicates, factor, conditions, pnr, nss, v, lc, nclust, r, adj, a0per, filter, depth, cv.cutoff, cpm
  • k: 0.5
  • norm: n
  • replicates: biological
  • factor: class
  • pnr: 0.2
  • nss: 5
  • v: 0.02
  • lc: 1
  • nclust: 15
  • r: 100
  • adj: 1.5
  • a0per: 0.9
  • filter: 0
  • cv.cutoff: 500
  • cpm: 1
Statistical arguments for limma: normalize.method
  • normalize.method: none
Statistical arguments for NBPSeq: main.method, model, tests, alternative
  • main.method: nbsmyth
  • model: log-linear-rel-mean, NBP
  • tests: HOA
  • alternative: two.sided
Statistical arguments for ABSSeq: paired, minDispersion, minRates, maxRates, LevelstoNormFC, adjmethod, replaceOutliers, useaFold, quiet, lmodel, preval, qforkappa, scale
  • paired: FALSE
  • minRates: 0.1
  • maxRates: 0.3
  • LevelstoNormFC: 100
  • adjmethod: BH
  • replaceOutliers: TRUE
  • useaFold: FALSE
  • quiet: FALSE
  • lmodel: TRUE
  • preval: 0.05
  • qforkappa: 0
  • scale: FALSE
Statistical arguments for DSS: trend, equal.var
  • trend: FALSE
  • equal.var: FALSE

Meta-analysis method: PANDORA weighted p-value across results

Multiple testing correction: Benjamini-Hochberg FDR

p-value threshold: 0.05

Logarithmic tranformation offset: 1

Analysis preset: not available

Quality control plots: multidimensional scaling, biotype detection, biotype counts, sample and biotype saturation, filtered biotypes, correlation heatmap and correlogram, boxplots, GC-content bias, transcript length bias, mean-difference plot, mean-variance plot, RNA composition, DEG heatmap, volcano plot, DEG biotype detection

Figure format: png, pdf

Output directory: /home/panos/public_html/metaseqR2_showcase/metaseqR2_Smyd3_PANDORA

Output data: Annotation, p-value, Adjusted p-value (FDR), Combined p-value, Adjusted combined p-value (FDR), Fold change, Statistics, Read counts

Output scale(s): Natural scale, log2 scale, Reads per Gene Model

Output values: Normalized values

Output statistics: Mean

Total run time: 02 hours 21 minutes 54 seconds

Filtering

Filtered genes

Number of filtered genes: 19360 which is the union of

  • Filtered because of zero reads: 6412
  • Filtered because of exon filters: 9464 which is the union of
    • minActiveExons : 9464
  • Filtered because of gene filters: 12789 which is the union of
    • length: 793 genes with filter cutoff value 100
    • avgReads: 1704 genes with filter cutoff value 0.1037928
    • expression: 12559 genes further decomposed to (filter name, filtered genes, filter cutoff):
      • median: 12559 genes with filter cutoff value 14
    • biotype: 130 genes with filter cutoff value rRNA

Differential expression

Differentially expressed genes

Number of differentially expressed genes per contrast:
  • DEN_Smyd3KO_vs_DEN_WT: 8733 (6100) statistically significant genes of which 2279 (1937) up regulated, 1910 (1611) down regulated and 4544 (2552) not differentially expressed according to a p-value (FDR or adjusted p-value) threshold of 0.05 and an absolute fold change cutoff value of 1 in log2 scale. These numbers refer to the combined analysis performed by metaseqR2. Per statistical algorithm, the differentially expressed genes are:
    • DESeq: 2343 (660) statistically significant genes of which 1307 (404) up regulated, 943(256) down regulated and 93 (0) not differentially expressed according to a p-value (FDR or adjusted p-value) threshold of 0.05 and an absolute fold change cutoff value of 1 in log2 scale.
    • DESeq2: 8334 (6904) statistically significant genes of which 2236 (2033) up regulated, 1880(1708) down regulated and 4218 (3163) not differentially expressed according to a p-value (FDR or adjusted p-value) threshold of 0.05 and an absolute fold change cutoff value of 1 in log2 scale.
    • edgeR: 7474 (5810) statistically significant genes of which 2388 (2193) up regulated, 1866(1627) down regulated and 3220 (1990) not differentially expressed according to a p-value (FDR or adjusted p-value) threshold of 0.05 and an absolute fold change cutoff value of 1 in log2 scale.
    • NOISeq: 11731 (8444) statistically significant genes of which 2557 (2340) up regulated, 2202(2100) down regulated and 6972 (4004) not differentially expressed according to a p-value (FDR or adjusted p-value) threshold of 0.05 and an absolute fold change cutoff value of 1 in log2 scale.
    • limma: 7954 (5602) statistically significant genes of which 2341 (2008) up regulated, 1724(1351) down regulated and 3889 (2243) not differentially expressed according to a p-value (FDR or adjusted p-value) threshold of 0.05 and an absolute fold change cutoff value of 1 in log2 scale.
    • NBPSeq: 7219 (5604) statistically significant genes of which 2396 (2198) up regulated, 2116(1965) down regulated and 2707 (1441) not differentially expressed according to a p-value (FDR or adjusted p-value) threshold of 0.05 and an absolute fold change cutoff value of 1 in log2 scale.
    • ABSSeq: 5418 (4017) statistically significant genes of which 1839 (1556) up regulated, 1630(1282) down regulated and 1949 (1179) not differentially expressed according to a p-value (FDR or adjusted p-value) threshold of 0.05 and an absolute fold change cutoff value of 1 in log2 scale.
    • DSS: 10779 (9923) statistically significant genes of which 1516 (1396) up regulated, 1300(1182) down regulated and 7963 (7345) not differentially expressed according to a p-value (FDR or adjusted p-value) threshold of 0.05 and an absolute fold change cutoff value of 1 in log2 scale.
  • DEN_WT_vs_WT: 10043 (8016) statistically significant genes of which 2302 (2156) up regulated, 2961 (2638) down regulated and 4780 (3222) not differentially expressed according to a p-value (FDR or adjusted p-value) threshold of 0.05 and an absolute fold change cutoff value of 1 in log2 scale. These numbers refer to the combined analysis performed by metaseqR2. Per statistical algorithm, the differentially expressed genes are:
    • DESeq: 3165 (1581) statistically significant genes of which 1316 (639) up regulated, 1781(942) down regulated and 68 (0) not differentially expressed according to a p-value (FDR or adjusted p-value) threshold of 0.05 and an absolute fold change cutoff value of 1 in log2 scale.
    • DESeq2: 9219 (8032) statistically significant genes of which 2230 (2097) up regulated, 2919(2682) down regulated and 4070 (3253) not differentially expressed according to a p-value (FDR or adjusted p-value) threshold of 0.05 and an absolute fold change cutoff value of 1 in log2 scale.
    • edgeR: 9017 (7678) statistically significant genes of which 2372 (2300) up regulated, 2744(2474) down regulated and 3901 (2904) not differentially expressed according to a p-value (FDR or adjusted p-value) threshold of 0.05 and an absolute fold change cutoff value of 1 in log2 scale.
    • NOISeq: 12683 (11164) statistically significant genes of which 2443 (2427) up regulated, 3298(3290) down regulated and 6942 (5447) not differentially expressed according to a p-value (FDR or adjusted p-value) threshold of 0.05 and an absolute fold change cutoff value of 1 in log2 scale.
    • limma: 8468 (6477) statistically significant genes of which 2245 (2101) up regulated, 2409(1890) down regulated and 3814 (2486) not differentially expressed according to a p-value (FDR or adjusted p-value) threshold of 0.05 and an absolute fold change cutoff value of 1 in log2 scale.
    • NBPSeq: 8189 (6834) statistically significant genes of which 2412 (2337) up regulated, 3149(2939) down regulated and 2628 (1558) not differentially expressed according to a p-value (FDR or adjusted p-value) threshold of 0.05 and an absolute fold change cutoff value of 1 in log2 scale.
    • ABSSeq: 6024 (4661) statistically significant genes of which 1928 (1624) up regulated, 2368(1962) down regulated and 1728 (1075) not differentially expressed according to a p-value (FDR or adjusted p-value) threshold of 0.05 and an absolute fold change cutoff value of 1 in log2 scale.
    • DSS: 11601 (10890) statistically significant genes of which 1540 (1464) up regulated, 2171(1976) down regulated and 7890 (7450) not differentially expressed according to a p-value (FDR or adjusted p-value) threshold of 0.05 and an absolute fold change cutoff value of 1 in log2 scale.
  • DEN_Smyd3KO_vs_Smyd3KO: 4524 (1752) statistically significant genes of which 1060 (512) up regulated, 1478 (889) down regulated and 1986 (351) not differentially expressed according to a p-value (FDR or adjusted p-value) threshold of 0.05 and an absolute fold change cutoff value of 1 in log2 scale. These numbers refer to the combined analysis performed by metaseqR2. Per statistical algorithm, the differentially expressed genes are:
    • DESeq: 1175 (219) statistically significant genes of which 551 (164) up regulated, 580(55) down regulated and 44 (0) not differentially expressed according to a p-value (FDR or adjusted p-value) threshold of 0.05 and an absolute fold change cutoff value of 1 in log2 scale.
    • DESeq2: 4914 (2899) statistically significant genes of which 1190 (781) up regulated, 1529(1194) down regulated and 2195 (924) not differentially expressed according to a p-value (FDR or adjusted p-value) threshold of 0.05 and an absolute fold change cutoff value of 1 in log2 scale.
    • edgeR: 4324 (2451) statistically significant genes of which 1197 (800) up regulated, 1630(1210) down regulated and 1497 (441) not differentially expressed according to a p-value (FDR or adjusted p-value) threshold of 0.05 and an absolute fold change cutoff value of 1 in log2 scale.
    • NOISeq: 7333 (489) statistically significant genes of which 1553 (202) up regulated, 1762(197) down regulated and 4018 (90) not differentially expressed according to a p-value (FDR or adjusted p-value) threshold of 0.05 and an absolute fold change cutoff value of 1 in log2 scale.
    • limma: 4698 (1920) statistically significant genes of which 1031 (453) up regulated, 1532(1009) down regulated and 2135 (458) not differentially expressed according to a p-value (FDR or adjusted p-value) threshold of 0.05 and an absolute fold change cutoff value of 1 in log2 scale.
    • NBPSeq: 4512 (2933) statistically significant genes of which 1368 (1047) up regulated, 1733(1454) down regulated and 1411 (432) not differentially expressed according to a p-value (FDR or adjusted p-value) threshold of 0.05 and an absolute fold change cutoff value of 1 in log2 scale.
    • ABSSeq: 2897 (1667) statistically significant genes of which 653 (373) up regulated, 1337(957) down regulated and 907 (337) not differentially expressed according to a p-value (FDR or adjusted p-value) threshold of 0.05 and an absolute fold change cutoff value of 1 in log2 scale.
    • DSS: 7521 (5756) statistically significant genes of which 670 (521) up regulated, 674(527) down regulated and 6177 (4708) not differentially expressed according to a p-value (FDR or adjusted p-value) threshold of 0.05 and an absolute fold change cutoff value of 1 in log2 scale.
  • Smyd3KO_vs_WT: 9669 (7513) statistically significant genes of which 2298 (2106) up regulated, 2522 (2239) down regulated and 4849 (3168) not differentially expressed according to a p-value (FDR or adjusted p-value) threshold of 0.05 and an absolute fold change cutoff value of 1 in log2 scale. These numbers refer to the combined analysis performed by metaseqR2. Per statistical algorithm, the differentially expressed genes are:
    • DESeq: 2858 (1313) statistically significant genes of which 1145 (449) up regulated, 1600(864) down regulated and 113 (0) not differentially expressed according to a p-value (FDR or adjusted p-value) threshold of 0.05 and an absolute fold change cutoff value of 1 in log2 scale.
    • DESeq2: 8628 (7234) statistically significant genes of which 2217 (2053) up regulated, 2526(2323) down regulated and 3885 (2858) not differentially expressed according to a p-value (FDR or adjusted p-value) threshold of 0.05 and an absolute fold change cutoff value of 1 in log2 scale.
    • edgeR: 9653 (8522) statistically significant genes of which 2388 (2344) up regulated, 2193(2001) down regulated and 5072 (4177) not differentially expressed according to a p-value (FDR or adjusted p-value) threshold of 0.05 and an absolute fold change cutoff value of 1 in log2 scale.
    • NOISeq: 12230 (10933) statistically significant genes of which 2413 (2403) up regulated, 2863(2846) down regulated and 6954 (5684) not differentially expressed according to a p-value (FDR or adjusted p-value) threshold of 0.05 and an absolute fold change cutoff value of 1 in log2 scale.
    • limma: 9721 (8190) statistically significant genes of which 2343 (2260) up regulated, 2171(1907) down regulated and 5207 (4023) not differentially expressed according to a p-value (FDR or adjusted p-value) threshold of 0.05 and an absolute fold change cutoff value of 1 in log2 scale.
    • NBPSeq: 7521 (6121) statistically significant genes of which 2371 (2257) up regulated, 2757(2591) down regulated and 2393 (1273) not differentially expressed according to a p-value (FDR or adjusted p-value) threshold of 0.05 and an absolute fold change cutoff value of 1 in log2 scale.
    • ABSSeq: 5754 (4403) statistically significant genes of which 1946 (1636) up regulated, 2076(1745) down regulated and 1732 (1022) not differentially expressed according to a p-value (FDR or adjusted p-value) threshold of 0.05 and an absolute fold change cutoff value of 1 in log2 scale.
    • DSS: 11019 (10203) statistically significant genes of which 1421 (1313) up regulated, 1732(1594) down regulated and 7866 (7296) not differentially expressed according to a p-value (FDR or adjusted p-value) threshold of 0.05 and an absolute fold change cutoff value of 1 in log2 scale.

Command

The differential expression analysis and this report were generated using the following command:

metaseqr2(sampleList = file.path(path2, "targets_show.txt"), 
contrast = c("DEN_Smyd3KO_vs_DEN_WT", "DEN_WT_vs_WT", "DEN_Smyd3KO_vs_Smyd3KO",
"Smyd3KO_vs_WT"), org = "mm9", countType = "exon", normalization = "deseq",
statistics = c("deseq", "deseq2", "edger", "noiseq", "limma",
"nbpseq", "absseq", "dss"), metaP = "pandora", weight = weights,
figFormat = c("png", "pdf"), exportWhere = file.path(exportPath,
"metaseqR2_Smyd3_PANDORA"), restrictCores = 0.5, qcPlots = c("mds",
"biodetection", "countsbio", "saturation", "readnoise",
"filtered", "correl", "pairwise", "boxplot", "gcbias",
"lengthbias", "meandiff", "meanvar", "rnacomp", "deheatmap",
"volcano", "biodist", "mastat", "statvenn", "foldvenn",
"deregulogram"), exonFilters = list(minActiveExons = list(exonsPerGene = 5,
minExons = 2, frac = 1/5)), geneFilters = list(length = list(length = 100),
avgReads = list(averagePerBp = 100, quantile = 0.25),
expression = list(median = TRUE, mean = FALSE, quantile = NA,
known = NA, custom = NA), biotype = getDefaults("biotypeFilter",
"mm9")), pcut = 0.05, exportWhat = c("annotation",
"p_value", "adj_p_value", "meta_p_value", "adj_meta_p_value",
"fold_change", "stats", "counts", "flags"), exportScale = c("natural",
"log2", "rpgm"), exportValues = "normalized", exportStats = "mean",
exportCountsTable = TRUE, reportTop = 0.05, createTracks = TRUE,
overwrite = TRUE, trackInfo = list(stranded = TRUE, normTo = 1e+09,
urlBase = "http://epigenomics.fleming.gr/~panos/metaseqR2_showcase/metaseqR2_Smyd3_PANDORA/tracks",
hubInfo = list(name = "Smyd3", shortLabel = "Smyd3 comparisons",
longLabel = "Data from Sarris et al., 2016, PMID: 26908355",
email = "moulos@fleming.gr")))


You can download the targets file from here

The following table summarizes the targets file used for the analysis. Do not forget to prepend the path to your BAM files in the filename column (also in the file that can be downloaded above).

samplename filename condition paired stranded
WT_BR1 WT_P90_BR1.bam WT single no
WT_BR2 WT_P90_BR2.bam WT single no
WT_BR3 WT_P90_BR3.bam WT single no
DEN_WT_BR1 DEN_WT_BR1.bam DEN_WT single forward
DEN_WT_BR2 DEN_WT_BR2.bam DEN_WT single forward
DEN_WT_BR3 DEN_WT_BR3.bam DEN_WT single forward
Smyd3KO_BR2 Smyd3KO_P90_BR2.bam Smyd3KO single forward
Smyd3KO_BR3 Smyd3KO_P90_BR3.bam Smyd3KO single forward
Smyd3KO_POOL Smyd3KO_P90_POOL.bam Smyd3KO single forward
DEN_Smyd3KO_BR1 DEN_Smyd3KO_BR1.bam DEN_Smyd3KO single forward
DEN_Smyd3KO_BR2 DEN_Smyd3KO_BR2.bam DEN_Smyd3KO single forward
DEN_Smyd3KO_BR3 DEN_Smyd3KO_BR3.bam DEN_Smyd3KO single forward

The above command generated the following log output:

INFO [2020-04-03 14:39:33] 2020-04-03 14:39:33: Data processing started…
INFO [2020-04-03 14:39:33] Read counts file: imported sam/bam/bed files
INFO [2020-04-03 14:39:33] Conditions: WT, DEN_WT, Smyd3KO, DEN_Smyd3KO
INFO [2020-04-03 14:39:33] Samples to include: WT_BR1, WT_BR2, WT_BR3, DEN_WT_BR1, DEN_WT_BR2, DEN_WT_BR3, Smyd3KO_BR2, Smyd3KO_BR3, Smyd3KO_POOL, DEN_Smyd3KO_BR1, DEN_Smyd3KO_BR2, DEN_Smyd3KO_BR3
INFO [2020-04-03 14:39:33] Samples to exclude: none
INFO [2020-04-03 14:39:33] Requested contrasts: DEN_Smyd3KO_vs_DEN_WT, DEN_WT_vs_WT, DEN_Smyd3KO_vs_Smyd3KO, Smyd3KO_vs_WT
INFO [2020-04-03 14:39:33] Organism: mm9
INFO [2020-04-03 14:39:33] Reference source: ensembl
INFO [2020-04-03 14:39:33] Count type: exon
INFO [2020-04-03 14:39:33] Transcriptional level: gene
INFO [2020-04-03 14:39:33] Exon filters: minActiveExons
INFO [2020-04-03 14:39:33] minActiveExons:
INFO [2020-04-03 14:39:33] exonsPerGene: 5
INFO [2020-04-03 14:39:33] minExons: 2
INFO [2020-04-03 14:39:33] frac: 0.2
INFO [2020-04-03 14:39:33] Gene filters: length, avgReads, expression, biotype
INFO [2020-04-03 14:39:33] length:
INFO [2020-04-03 14:39:33] length: 100
INFO [2020-04-03 14:39:33] avgReads:
INFO [2020-04-03 14:39:33] averagePerBp: 100
INFO [2020-04-03 14:39:33] quantile: 0.25
INFO [2020-04-03 14:39:33] expression:
INFO [2020-04-03 14:39:33] median: TRUE
INFO [2020-04-03 14:39:33] mean: FALSE
INFO [2020-04-03 14:39:33] quantile: NA
INFO [2020-04-03 14:39:33] known: NA
INFO [2020-04-03 14:39:33] custom: NA
INFO [2020-04-03 14:39:33] biotype:
INFO [2020-04-03 14:39:33] pseudogene: FALSE
INFO [2020-04-03 14:39:33] snRNA: FALSE
INFO [2020-04-03 14:39:33] protein_coding: FALSE
INFO [2020-04-03 14:39:33] antisense: FALSE
INFO [2020-04-03 14:39:33] miRNA: FALSE
INFO [2020-04-03 14:39:33] lincRNA: FALSE
INFO [2020-04-03 14:39:33] snoRNA: FALSE
INFO [2020-04-03 14:39:33] processed_transcript: FALSE
INFO [2020-04-03 14:39:33] misc_RNA: FALSE
INFO [2020-04-03 14:39:33] rRNA: TRUE
INFO [2020-04-03 14:39:33] sense_overlapping: FALSE
INFO [2020-04-03 14:39:33] sense_intronic: FALSE
INFO [2020-04-03 14:39:33] polymorphic_pseudogene: FALSE
INFO [2020-04-03 14:39:33] non_coding: FALSE
INFO [2020-04-03 14:39:33] three_prime_overlapping_ncrna: FALSE
INFO [2020-04-03 14:39:33] IG_C_gene: FALSE
INFO [2020-04-03 14:39:33] IG_J_gene: FALSE
INFO [2020-04-03 14:39:33] IG_D_gene: FALSE
INFO [2020-04-03 14:39:33] IG_V_gene: FALSE
INFO [2020-04-03 14:39:33] ncrna_host: FALSE
INFO [2020-04-03 14:39:33] Filter application: postnorm
INFO [2020-04-03 14:39:33] Normalization algorithm: deseq
INFO [2020-04-03 14:39:33] Normalization arguments:
INFO [2020-04-03 14:39:33] locfunc:
INFO [2020-04-03 14:39:33] [[list(function (x, na.rm = FALSE, …) UseMethod(“median”))locfunc
INFO [2020-04-03 14:39:33] Statistical algorithm: deseq, deseq2, edger, noiseq, limma, nbpseq, absseq, dss
INFO [2020-04-03 14:39:33] Statistical arguments:
INFO [2020-04-03 14:39:33] deseq: blind, fit-only, local
INFO [2020-04-03 14:39:33] deseq2: FALSE, parametric, 100, FALSE, NULL, FALSE, 1e-08, TRUE, FALSE, TRUE, 0, greaterAbs, TRUE, 0.1, BH, DataFrame, FALSE, FALSE
INFO [2020-04-03 14:39:33] edger: classic, 5, 10, movingave, NULL, grid, 11, c(-6, 6), NULL, CoxReid, 10000, NULL, auto, NULL, NULL, NULL, NULL, 0.125, NULL, auto, chisq, TRUE, FALSE, c(0.05, 0.1)
INFO [2020-04-03 14:39:33] noiseq: 0.5, n, biological, class, NULL, 0.2, 5, 0.02, 1, 15, 100, 1.5, 0.9, 0, NULL, 500, 1
INFO [2020-04-03 14:39:33] limma: none
INFO [2020-04-03 14:39:33] nbpseq: nbsmyth, list(nbpseq = “log-linear-rel-mean”, nbsmyth = “NBP”), HOA, two.sided
INFO [2020-04-03 14:39:33] absseq: FALSE, NULL, 0.1, 0.3, 100, BH, TRUE, FALSE, FALSE, TRUE, 0.05, 0, FALSE
INFO [2020-04-03 14:39:33] dss: FALSE, FALSE
INFO [2020-04-03 14:39:33] Meta-analysis method: pandora
INFO [2020-04-03 14:39:33] Multiple testing correction: BH
INFO [2020-04-03 14:39:33] p-value threshold: 0.05
INFO [2020-04-03 14:39:33] Logarithmic transformation offset: 1
INFO [2020-04-03 14:39:33] Quality control plots: mds, biodetection, countsbio, saturation, readnoise, filtered, correl, pairwise, boxplot, gcbias, lengthbias, meandiff, meanvar, rnacomp, deheatmap, volcano, biodist, mastat, statvenn, foldvenn, deregulogram
INFO [2020-04-03 14:39:33] Figure format: png, pdf
INFO [2020-04-03 14:39:33] Output directory: /home/panos/public_html/metaseqR2_showcase/metaseqR2_Smyd3_PANDORA
INFO [2020-04-03 14:39:33] Output data: annotation, p_value, adj_p_value, meta_p_value, adj_meta_p_value, fold_change, stats, counts, flags
INFO [2020-04-03 14:39:33] Output scale(s): natural, log2, rpgm
INFO [2020-04-03 14:39:33] Output values: normalized
INFO [2020-04-03 14:39:33] Output statistics: mean
INFO [2020-04-03 14:39:33] Loading gene annotation…
INFO [2020-04-03 14:39:34] Loading exon annotation…
INFO [2020-04-03 14:39:40] Reading bam file WT_P90_BR1.bam for sample with name WT_BR1. This might take some time…
INFO [2020-04-03 14:39:40] Reading bam file WT_P90_BR2.bam for sample with name WT_BR2. This might take some time…
INFO [2020-04-03 14:39:40] Reading bam file WT_P90_BR3.bam for sample with name WT_BR3. This might take some time…
INFO [2020-04-03 14:39:40] Reading bam file DEN_WT_BR1.bam for sample with name DEN_WT_BR1. This might take some time…
INFO [2020-04-03 14:39:40] Reading bam file DEN_WT_BR2.bam for sample with name DEN_WT_BR2. This might take some time…
INFO [2020-04-03 14:39:40] Reading bam file DEN_WT_BR3.bam for sample with name DEN_WT_BR3. This might take some time…
INFO [2020-04-03 14:39:40] Reading bam file Smyd3KO_P90_BR2.bam for sample with name Smyd3KO_BR2. This might take some time…
INFO [2020-04-03 14:39:40] Reading bam file Smyd3KO_P90_BR3.bam for sample with name Smyd3KO_BR3. This might take some time…
INFO [2020-04-03 14:44:21] Counting reads overlapping with given annotation…
INFO [2020-04-03 14:44:21] …for single-end reads…
INFO [2020-04-03 14:44:21] …ignoring strandedness…
INFO [2020-04-03 14:44:35] Counting reads overlapping with given annotation…
INFO [2020-04-03 14:44:35] …for single-end reads…
INFO [2020-04-03 14:44:35] …ignoring strandedness…
INFO [2020-04-03 14:44:41] Counting reads overlapping with given annotation…
INFO [2020-04-03 14:44:41] …for single-end reads…
INFO [2020-04-03 14:44:41] …assuming forward sequenced reads…
INFO [2020-04-03 14:45:21] Counting reads overlapping with given annotation…
INFO [2020-04-03 14:45:21] …for single-end reads…
INFO [2020-04-03 14:45:21] …assuming forward sequenced reads…
INFO [2020-04-03 14:45:40] Counting reads overlapping with given annotation…
INFO [2020-04-03 14:45:40] …for single-end reads…
INFO [2020-04-03 14:45:40] …assuming forward sequenced reads…
INFO [2020-04-03 14:45:41] Counting reads overlapping with given annotation…
INFO [2020-04-03 14:45:41] …for single-end reads…
INFO [2020-04-03 14:45:41] …assuming forward sequenced reads…
INFO [2020-04-03 14:45:45] Counting reads overlapping with given annotation…
INFO [2020-04-03 14:45:45] …for single-end reads…
INFO [2020-04-03 14:45:45] …ignoring strandedness…
INFO [2020-04-03 14:46:15] Counting reads overlapping with given annotation…
INFO [2020-04-03 14:46:15] …for single-end reads…
INFO [2020-04-03 14:46:15] …assuming forward sequenced reads…
INFO [2020-04-03 14:52:49] Reading bam file DEN_Smyd3KO_BR1.bam for sample with name DEN_Smyd3KO_BR1. This might take some time…
INFO [2020-04-03 14:52:56] Reading bam file DEN_Smyd3KO_BR2.bam for sample with name DEN_Smyd3KO_BR2. This might take some time…
INFO [2020-04-03 14:54:37] Counting reads overlapping with given annotation…
INFO [2020-04-03 14:54:37] …for single-end reads…
INFO [2020-04-03 14:54:37] …assuming forward sequenced reads…
INFO [2020-04-03 14:55:01] Counting reads overlapping with given annotation…
INFO [2020-04-03 14:55:01] …for single-end reads…
INFO [2020-04-03 14:55:01] …assuming forward sequenced reads…
INFO [2020-04-03 14:58:22] Reading bam file Smyd3KO_P90_POOL.bam for sample with name Smyd3KO_POOL. This might take some time…
INFO [2020-04-03 14:59:12] Reading bam file DEN_Smyd3KO_BR3.bam for sample with name DEN_Smyd3KO_BR3. This might take some time…
INFO [2020-04-03 14:59:54] Counting reads overlapping with given annotation…
INFO [2020-04-03 14:59:54] …for single-end reads…
INFO [2020-04-03 14:59:54] …assuming forward sequenced reads…
INFO [2020-04-03 15:01:32] Counting reads overlapping with given annotation…
INFO [2020-04-03 15:01:32] …for single-end reads…
INFO [2020-04-03 15:01:32] …assuming forward sequenced reads…
INFO [2020-04-03 15:07:58] Finished counting!
INFO [2020-04-03 15:08:00] Exporting raw read counts table to /home/panos/public_html/metaseqR2_showcase/metaseqR2_Smyd3_PANDORA/lists/raw_counts_table.txt.gz
INFO [2020-04-03 15:08:12] Checking chromosomes in exon counts and gene annotation…
INFO [2020-04-03 15:08:13] Processing exons…
INFO [2020-04-03 15:08:13] Separating exons per gene for WT_BR1…
INFO [2020-04-03 15:08:13] Separating exons per gene for WT_BR2…
INFO [2020-04-03 15:08:13] Separating exons per gene for WT_BR3…
INFO [2020-04-03 15:08:13] Separating exons per gene for DEN_WT_BR1…
INFO [2020-04-03 15:08:13] Separating exons per gene for Smyd3KO_POOL…
INFO [2020-04-03 15:08:13] Separating exons per gene for DEN_WT_BR2…
INFO [2020-04-03 15:08:13] Separating exons per gene for DEN_WT_BR3…
INFO [2020-04-03 15:08:13] Separating exons per gene for DEN_Smyd3KO_BR1…
INFO [2020-04-03 15:08:13] Separating exons per gene for DEN_Smyd3KO_BR2…
INFO [2020-04-03 15:08:13] Separating exons per gene for Smyd3KO_BR2…
INFO [2020-04-03 15:08:14] Separating exons per gene for Smyd3KO_BR3…
INFO [2020-04-03 15:08:14] Separating exons per gene for DEN_Smyd3KO_BR3…
INFO [2020-04-03 15:08:19] Saving gene model to /home/panos/public_html/metaseqR2_showcase/metaseqR2_Smyd3_PANDORA/data/gene_model.RData
INFO [2020-04-03 15:08:31] Applying exon filter minActiveExons…
INFO [2020-04-03 15:08:31] Checking read presence in exons for WT_BR1…
INFO [2020-04-03 15:08:32] Checking read presence in exons for WT_BR2…
INFO [2020-04-03 15:08:33] Checking read presence in exons for WT_BR3…
INFO [2020-04-03 15:08:34] Checking read presence in exons for DEN_WT_BR1…
INFO [2020-04-03 15:08:35] Checking read presence in exons for DEN_WT_BR2…
INFO [2020-04-03 15:08:36] Checking read presence in exons for DEN_WT_BR3…
INFO [2020-04-03 15:08:37] Checking read presence in exons for Smyd3KO_BR2…
INFO [2020-04-03 15:08:38] Checking read presence in exons for Smyd3KO_BR3…
INFO [2020-04-03 15:08:39] Checking read presence in exons for Smyd3KO_POOL…
INFO [2020-04-03 15:08:40] Checking read presence in exons for DEN_Smyd3KO_BR1…
INFO [2020-04-03 15:08:41] Checking read presence in exons for DEN_Smyd3KO_BR2…
INFO [2020-04-03 15:08:42] Checking read presence in exons for DEN_Smyd3KO_BR3…
INFO [2020-04-03 15:08:43] Summarizing count data…
INFO [2020-04-03 15:08:44] Removing genes with zero counts in all samples…
INFO [2020-04-03 15:08:45] Normalizing with: deseq
INFO [2020-04-03 15:08:45] Applying gene filter length…
INFO [2020-04-03 15:08:45] Threshold below which ignored: 100
INFO [2020-04-03 15:08:45] Applying gene filter avgReads…
INFO [2020-04-03 15:08:46] Threshold below which ignored: 0.103792765308053
INFO [2020-04-03 15:08:46] Applying gene filter expression…
INFO [2020-04-03 15:08:46] Threshold below which ignored: 14
INFO [2020-04-03 15:08:46] Applying gene filter biotype…
INFO [2020-04-03 15:08:46] Biotypes ignored: rRNA
INFO [2020-04-03 15:08:47] 19360 genes filtered out
INFO [2020-04-03 15:08:47] 18223 genes remain after filtering
INFO [2020-04-03 15:08:47] Running statistical tests with: deseq
INFO [2020-04-03 15:08:50] Contrast: DEN_Smyd3KO_vs_DEN_WT
INFO [2020-04-03 15:16:08] Contrast: DEN_WT_vs_WT
INFO [2020-04-03 15:23:44] Contrast: DEN_Smyd3KO_vs_Smyd3KO
INFO [2020-04-03 15:28:49] Contrast: Smyd3KO_vs_WT
INFO [2020-04-03 15:34:23] Contrast DEN_Smyd3KO_vs_DEN_WT: found 2343 genes
INFO [2020-04-03 15:34:23] Contrast DEN_WT_vs_WT: found 3165 genes
INFO [2020-04-03 15:34:23] Contrast DEN_Smyd3KO_vs_Smyd3KO: found 1175 genes
INFO [2020-04-03 15:34:23] Contrast Smyd3KO_vs_WT: found 2858 genes
INFO [2020-04-03 15:34:23] Running statistical tests with: deseq2
INFO [2020-04-03 15:34:34] Contrast: DEN_Smyd3KO_vs_DEN_WT
INFO [2020-04-03 15:34:49] Contrast: DEN_WT_vs_WT
INFO [2020-04-03 15:35:03] Contrast: DEN_Smyd3KO_vs_Smyd3KO
INFO [2020-04-03 15:35:17] Contrast: Smyd3KO_vs_WT
INFO [2020-04-03 15:35:32] Contrast DEN_Smyd3KO_vs_DEN_WT: found 8334 genes
INFO [2020-04-03 15:35:32] Contrast DEN_WT_vs_WT: found 9219 genes
INFO [2020-04-03 15:35:32] Contrast DEN_Smyd3KO_vs_Smyd3KO: found 4914 genes
INFO [2020-04-03 15:35:32] Contrast Smyd3KO_vs_WT: found 8628 genes
INFO [2020-04-03 15:35:32] Running statistical tests with: edger
INFO [2020-04-03 15:35:42] Contrast: DEN_Smyd3KO_vs_DEN_WT
INFO [2020-04-03 15:35:46] Contrast: DEN_WT_vs_WT
INFO [2020-04-03 15:35:49] Contrast: DEN_Smyd3KO_vs_Smyd3KO
INFO [2020-04-03 15:35:52] Contrast: Smyd3KO_vs_WT
INFO [2020-04-03 15:35:55] Contrast DEN_Smyd3KO_vs_DEN_WT: found 7474 genes
INFO [2020-04-03 15:35:55] Contrast DEN_WT_vs_WT: found 9017 genes
INFO [2020-04-03 15:35:55] Contrast DEN_Smyd3KO_vs_Smyd3KO: found 4324 genes
INFO [2020-04-03 15:35:55] Contrast Smyd3KO_vs_WT: found 9653 genes
INFO [2020-04-03 15:35:55] Running statistical tests with: noiseq
INFO [2020-04-03 15:35:56] Contrast: DEN_Smyd3KO_vs_DEN_WT
INFO [2020-04-03 15:37:06] Contrast: DEN_WT_vs_WT
INFO [2020-04-03 15:38:21] Contrast: DEN_Smyd3KO_vs_Smyd3KO
INFO [2020-04-03 15:39:41] Contrast: Smyd3KO_vs_WT
INFO [2020-04-03 15:40:55] Contrast DEN_Smyd3KO_vs_DEN_WT: found 11731 genes
INFO [2020-04-03 15:40:55] Contrast DEN_WT_vs_WT: found 12683 genes
INFO [2020-04-03 15:40:55] Contrast DEN_Smyd3KO_vs_Smyd3KO: found 7333 genes
INFO [2020-04-03 15:40:55] Contrast Smyd3KO_vs_WT: found 12230 genes
INFO [2020-04-03 15:40:55] Running statistical tests with: limma
INFO [2020-04-03 15:40:55] Contrast: DEN_Smyd3KO_vs_DEN_WT
INFO [2020-04-03 15:40:59] Contrast: DEN_WT_vs_WT
INFO [2020-04-03 15:41:02] Contrast: DEN_Smyd3KO_vs_Smyd3KO
INFO [2020-04-03 15:41:05] Contrast: Smyd3KO_vs_WT
INFO [2020-04-03 15:41:08] Contrast DEN_Smyd3KO_vs_DEN_WT: found 7954 genes
INFO [2020-04-03 15:41:08] Contrast DEN_WT_vs_WT: found 8468 genes
INFO [2020-04-03 15:41:08] Contrast DEN_Smyd3KO_vs_Smyd3KO: found 4698 genes
INFO [2020-04-03 15:41:08] Contrast Smyd3KO_vs_WT: found 9721 genes
INFO [2020-04-03 15:41:08] Running statistical tests with: nbpseq
INFO [2020-04-03 15:41:08] Contrast: DEN_Smyd3KO_vs_DEN_WT
INFO [2020-04-03 15:45:28] Contrast: DEN_WT_vs_WT
INFO [2020-04-03 15:50:30] Contrast: DEN_Smyd3KO_vs_Smyd3KO
INFO [2020-04-03 15:54:41] Contrast: Smyd3KO_vs_WT
INFO [2020-04-03 15:59:27] Contrast DEN_Smyd3KO_vs_DEN_WT: found 7219 genes
INFO [2020-04-03 15:59:27] Contrast DEN_WT_vs_WT: found 8189 genes
INFO [2020-04-03 15:59:27] Contrast DEN_Smyd3KO_vs_Smyd3KO: found 4512 genes
INFO [2020-04-03 15:59:27] Contrast Smyd3KO_vs_WT: found 7521 genes
INFO [2020-04-03 15:59:27] Running statistical tests with: absseq
INFO [2020-04-03 15:59:27] Contrast: DEN_Smyd3KO_vs_DEN_WT
INFO [2020-04-03 15:59:38] Contrast: DEN_WT_vs_WT
INFO [2020-04-03 15:59:49] Contrast: DEN_Smyd3KO_vs_Smyd3KO
INFO [2020-04-03 16:00:01] Contrast: Smyd3KO_vs_WT
INFO [2020-04-03 16:00:12] Contrast DEN_Smyd3KO_vs_DEN_WT: found 5418 genes
INFO [2020-04-03 16:00:12] Contrast DEN_WT_vs_WT: found 6024 genes
INFO [2020-04-03 16:00:12] Contrast DEN_Smyd3KO_vs_Smyd3KO: found 2897 genes
INFO [2020-04-03 16:00:12] Contrast Smyd3KO_vs_WT: found 5754 genes
INFO [2020-04-03 16:00:12] Running statistical tests with: dss
INFO [2020-04-03 16:00:20] Contrast: DEN_Smyd3KO_vs_DEN_WT
INFO [2020-04-03 16:00:21] Contrast: DEN_WT_vs_WT
INFO [2020-04-03 16:00:21] Contrast: DEN_Smyd3KO_vs_Smyd3KO
INFO [2020-04-03 16:00:22] Contrast: Smyd3KO_vs_WT
INFO [2020-04-03 16:00:22] Contrast DEN_Smyd3KO_vs_DEN_WT: found 10779 genes
INFO [2020-04-03 16:00:22] Contrast DEN_WT_vs_WT: found 11601 genes
INFO [2020-04-03 16:00:22] Contrast DEN_Smyd3KO_vs_Smyd3KO: found 7521 genes
INFO [2020-04-03 16:00:22] Contrast Smyd3KO_vs_WT: found 11019 genes
INFO [2020-04-03 16:00:23] Exporting and compressing normalized read counts table to /home/panos/public_html/metaseqR2_showcase/metaseqR2_Smyd3_PANDORA/lists/normalized_counts_table.txt
INFO [2020-04-03 16:00:25] Performing meta-analysis with pandora
INFO [2020-04-03 16:00:30] Building output files…
INFO [2020-04-03 16:00:30] Contrast: DEN_Smyd3KO_vs_DEN_WT
INFO [2020-04-03 16:00:30] Adding non-filtered data…
INFO [2020-04-03 16:00:30] binding annotation…
INFO [2020-04-03 16:00:30] binding p-values…
INFO [2020-04-03 16:00:30] binding FDRs…
INFO [2020-04-03 16:00:30] binding meta p-values…
INFO [2020-04-03 16:00:30] binding adjusted meta p-values…
INFO [2020-04-03 16:00:30] binding natural normalized fold changes…
INFO [2020-04-03 16:00:30] binding log2 normalized fold changes…
INFO [2020-04-03 16:00:30] binding normalized mean counts…
INFO [2020-04-03 16:00:32] binding normalized mean counts…
INFO [2020-04-03 16:00:33] binding all normalized counts for DEN_Smyd3KO…
INFO [2020-04-03 16:00:33] binding all normalized counts for DEN_WT…
INFO [2020-04-03 16:00:33] binding filtering flags…
INFO [2020-04-03 16:00:33] Writing output…
INFO [2020-04-03 16:00:35] Adding filtered data…
INFO [2020-04-03 16:00:35] binding annotation…
INFO [2020-04-03 16:00:35] binding p-values…
INFO [2020-04-03 16:00:35] binding FDRs…
INFO [2020-04-03 16:00:35] binding meta p-values…
INFO [2020-04-03 16:00:35] binding adjusted meta p-values…
INFO [2020-04-03 16:00:36] binding natural normalized fold changes…
INFO [2020-04-03 16:00:36] binding log2 normalized fold changes…
INFO [2020-04-03 16:00:36] binding normalized mean counts…
INFO [2020-04-03 16:00:37] binding normalized mean counts…
INFO [2020-04-03 16:00:38] binding all normalized counts for DEN_Smyd3KO…
INFO [2020-04-03 16:00:38] binding all normalized counts for DEN_WT…
INFO [2020-04-03 16:00:38] binding filtering flags…
INFO [2020-04-03 16:00:40] Writing output…
INFO [2020-04-03 16:00:46] Adding report data…
INFO [2020-04-03 16:00:46] binding annotation…
INFO [2020-04-03 16:00:46] binding meta p-values…
INFO [2020-04-03 16:00:46] binding adjusted meta p-values…
INFO [2020-04-03 16:00:47] binding log2 normalized fold changes…
INFO [2020-04-03 16:00:47] binding normalized mean counts…
INFO [2020-04-03 16:00:47] binding normalized mean counts…
INFO [2020-04-03 16:00:48] Contrast: DEN_WT_vs_WT
INFO [2020-04-03 16:00:48] Adding non-filtered data…
INFO [2020-04-03 16:00:48] binding annotation…
INFO [2020-04-03 16:00:48] binding p-values…
INFO [2020-04-03 16:00:48] binding FDRs…
INFO [2020-04-03 16:00:48] binding meta p-values…
INFO [2020-04-03 16:00:48] binding adjusted meta p-values…
INFO [2020-04-03 16:00:48] binding natural normalized fold changes…
INFO [2020-04-03 16:00:48] binding log2 normalized fold changes…
INFO [2020-04-03 16:00:48] binding normalized mean counts…
INFO [2020-04-03 16:00:49] binding normalized mean counts…
INFO [2020-04-03 16:00:50] binding all normalized counts for DEN_WT…
INFO [2020-04-03 16:00:50] binding all normalized counts for WT…
INFO [2020-04-03 16:00:50] binding filtering flags…
INFO [2020-04-03 16:00:50] Writing output…
INFO [2020-04-03 16:00:53] Adding filtered data…
INFO [2020-04-03 16:00:53] binding annotation…
INFO [2020-04-03 16:00:53] binding p-values…
INFO [2020-04-03 16:00:53] binding FDRs…
INFO [2020-04-03 16:00:53] binding meta p-values…
INFO [2020-04-03 16:00:53] binding adjusted meta p-values…
INFO [2020-04-03 16:00:53] binding natural normalized fold changes…
INFO [2020-04-03 16:00:53] binding log2 normalized fold changes…
INFO [2020-04-03 16:00:53] binding normalized mean counts…
INFO [2020-04-03 16:00:54] binding normalized mean counts…
INFO [2020-04-03 16:00:55] binding all normalized counts for DEN_WT…
INFO [2020-04-03 16:00:56] binding all normalized counts for WT…
INFO [2020-04-03 16:00:56] binding filtering flags…
INFO [2020-04-03 16:00:57] Writing output…
INFO [2020-04-03 16:01:04] Adding report data…
INFO [2020-04-03 16:01:04] binding annotation…
INFO [2020-04-03 16:01:04] binding meta p-values…
INFO [2020-04-03 16:01:04] binding adjusted meta p-values…
INFO [2020-04-03 16:01:04] binding log2 normalized fold changes…
INFO [2020-04-03 16:01:04] binding normalized mean counts…
INFO [2020-04-03 16:01:05] binding normalized mean counts…
INFO [2020-04-03 16:01:05] Contrast: DEN_Smyd3KO_vs_Smyd3KO
INFO [2020-04-03 16:01:05] Adding non-filtered data…
INFO [2020-04-03 16:01:05] binding annotation…
INFO [2020-04-03 16:01:05] binding p-values…
INFO [2020-04-03 16:01:05] binding FDRs…
INFO [2020-04-03 16:01:05] binding meta p-values…
INFO [2020-04-03 16:01:05] binding adjusted meta p-values…
INFO [2020-04-03 16:01:05] binding natural normalized fold changes…
INFO [2020-04-03 16:01:05] binding log2 normalized fold changes…
INFO [2020-04-03 16:01:05] binding normalized mean counts…
INFO [2020-04-03 16:01:07] binding normalized mean counts…
INFO [2020-04-03 16:01:08] binding all normalized counts for DEN_Smyd3KO…
INFO [2020-04-03 16:01:08] binding all normalized counts for Smyd3KO…
INFO [2020-04-03 16:01:08] binding filtering flags…
INFO [2020-04-03 16:01:08] Writing output…
INFO [2020-04-03 16:01:09] Adding filtered data…
INFO [2020-04-03 16:01:09] binding annotation…
INFO [2020-04-03 16:01:09] binding p-values…
INFO [2020-04-03 16:01:09] binding FDRs…
INFO [2020-04-03 16:01:09] binding meta p-values…
INFO [2020-04-03 16:01:09] binding adjusted meta p-values…
INFO [2020-04-03 16:01:09] binding natural normalized fold changes…
INFO [2020-04-03 16:01:09] binding log2 normalized fold changes…
INFO [2020-04-03 16:01:09] binding normalized mean counts…
INFO [2020-04-03 16:01:10] binding normalized mean counts…
INFO [2020-04-03 16:01:12] binding all normalized counts for DEN_Smyd3KO…
INFO [2020-04-03 16:01:12] binding all normalized counts for Smyd3KO…
INFO [2020-04-03 16:01:12] binding filtering flags…
INFO [2020-04-03 16:01:13] Writing output…
INFO [2020-04-03 16:01:19] Adding report data…
INFO [2020-04-03 16:01:19] binding annotation…
INFO [2020-04-03 16:01:19] binding meta p-values…
INFO [2020-04-03 16:01:20] binding adjusted meta p-values…
INFO [2020-04-03 16:01:20] binding log2 normalized fold changes…
INFO [2020-04-03 16:01:20] binding normalized mean counts…
INFO [2020-04-03 16:01:20] binding normalized mean counts…
INFO [2020-04-03 16:01:21] Contrast: Smyd3KO_vs_WT
INFO [2020-04-03 16:01:21] Adding non-filtered data…
INFO [2020-04-03 16:01:21] binding annotation…
INFO [2020-04-03 16:01:21] binding p-values…
INFO [2020-04-03 16:01:21] binding FDRs…
INFO [2020-04-03 16:01:21] binding meta p-values…
INFO [2020-04-03 16:01:21] binding adjusted meta p-values…
INFO [2020-04-03 16:01:21] binding natural normalized fold changes…
INFO [2020-04-03 16:01:21] binding log2 normalized fold changes…
INFO [2020-04-03 16:01:21] binding normalized mean counts…
INFO [2020-04-03 16:01:22] binding normalized mean counts…
INFO [2020-04-03 16:01:23] binding all normalized counts for Smyd3KO…
INFO [2020-04-03 16:01:23] binding all normalized counts for WT…
INFO [2020-04-03 16:01:23] binding filtering flags…
INFO [2020-04-03 16:01:23] Writing output…
INFO [2020-04-03 16:01:26] Adding filtered data…
INFO [2020-04-03 16:01:26] binding annotation…
INFO [2020-04-03 16:01:26] binding p-values…
INFO [2020-04-03 16:01:26] binding FDRs…
INFO [2020-04-03 16:01:26] binding meta p-values…
INFO [2020-04-03 16:01:26] binding adjusted meta p-values…
INFO [2020-04-03 16:01:26] binding natural normalized fold changes…
INFO [2020-04-03 16:01:26] binding log2 normalized fold changes…
INFO [2020-04-03 16:01:26] binding normalized mean counts…
INFO [2020-04-03 16:01:27] binding normalized mean counts…
INFO [2020-04-03 16:01:28] binding all normalized counts for Smyd3KO…
INFO [2020-04-03 16:01:28] binding all normalized counts for WT…
INFO [2020-04-03 16:01:28] binding filtering flags…
INFO [2020-04-03 16:01:30] Writing output…
INFO [2020-04-03 16:01:36] Adding report data…
INFO [2020-04-03 16:01:36] binding annotation…
INFO [2020-04-03 16:01:36] binding meta p-values…
INFO [2020-04-03 16:01:36] binding adjusted meta p-values…
INFO [2020-04-03 16:01:37] binding log2 normalized fold changes…
INFO [2020-04-03 16:01:37] binding normalized mean counts…
INFO [2020-04-03 16:01:37] binding normalized mean counts…
WARN [2020-04-03 16:01:37] Pairwise sample comparison plot becomes indistinguishable for more than 6 samples! Removing from plots…
INFO [2020-04-03 16:01:37] Creating quality control graphs…
INFO [2020-04-03 16:01:37] Plotting in png format…
INFO [2020-04-03 16:01:37] Plotting mds…
INFO [2020-04-03 16:01:40] Plotting biodetection…
INFO [2020-04-03 16:01:43] Plotting countsbio…
INFO [2020-04-03 16:01:47] Plotting saturation…
INFO [2020-04-03 16:02:07] Plotting readnoise…
INFO [2020-04-03 16:02:11] Plotting correl…
INFO [2020-04-03 16:02:11] Plotting boxplot…
INFO [2020-04-03 16:02:12] Plotting gcbias…
INFO [2020-04-03 16:02:14] Plotting lengthbias…
INFO [2020-04-03 16:02:16] Plotting meandiff…
INFO [2020-04-03 16:02:21] Plotting meanvar…
INFO [2020-04-03 16:02:23] Plotting rnacomp…
INFO [2020-04-03 16:03:25] Plotting boxplot…
INFO [2020-04-03 16:03:26] Plotting gcbias…
INFO [2020-04-03 16:03:29] Plotting lengthbias…
INFO [2020-04-03 16:03:30] Plotting meandiff…
INFO [2020-04-03 16:03:35] Plotting meanvar…
INFO [2020-04-03 16:03:37] Plotting rnacomp…
INFO [2020-04-03 16:04:38] Plotting deheatmap…
INFO [2020-04-03 16:04:38] Contrast: DEN_Smyd3KO_vs_DEN_WT
INFO [2020-04-03 16:07:25] Contrast: DEN_WT_vs_WT
INFO [2020-04-03 16:10:04] Contrast: DEN_Smyd3KO_vs_Smyd3KO
INFO [2020-04-03 16:12:46] Contrast: Smyd3KO_vs_WT
INFO [2020-04-03 16:15:42] Plotting volcano…
INFO [2020-04-03 16:15:42] Contrast: DEN_Smyd3KO_vs_DEN_WT
INFO [2020-04-03 16:15:43] Contrast: DEN_WT_vs_WT
INFO [2020-04-03 16:15:44] Contrast: DEN_Smyd3KO_vs_Smyd3KO
INFO [2020-04-03 16:15:45] Contrast: Smyd3KO_vs_WT
INFO [2020-04-03 16:16:05] Plotting biodist…
INFO [2020-04-03 16:16:05] Contrast: DEN_Smyd3KO_vs_DEN_WT
INFO [2020-04-03 16:16:05] Contrast: DEN_WT_vs_WT
INFO [2020-04-03 16:16:06] Contrast: DEN_Smyd3KO_vs_Smyd3KO
INFO [2020-04-03 16:16:06] Contrast: Smyd3KO_vs_WT
INFO [2020-04-03 16:16:25] Plotting mastat…
INFO [2020-04-03 16:16:25] Contrast: DEN_Smyd3KO_vs_DEN_WT
INFO [2020-04-03 16:16:27] Contrast: DEN_WT_vs_WT
INFO [2020-04-03 16:16:28] Contrast: DEN_Smyd3KO_vs_Smyd3KO
INFO [2020-04-03 16:16:30] Contrast: Smyd3KO_vs_WT
INFO [2020-04-03 16:16:51] Plotting deregulogram…
INFO [2020-04-03 16:16:51] Contrast: DEN_Smyd3KO_vs_DEN_WT
INFO [2020-04-03 16:16:51] Contrast: DEN_WT_vs_WT
INFO [2020-04-03 16:16:51] Contrast: DEN_Smyd3KO_vs_Smyd3KO
INFO [2020-04-03 16:16:51] Contrast: Smyd3KO_vs_WT
INFO [2020-04-03 16:17:09] Plotting filtered…
INFO [2020-04-03 16:17:09] Plotting statvenn…
INFO [2020-04-03 16:17:09] Contrast: DEN_Smyd3KO_vs_DEN_WT
WARN [2020-04-03 16:17:09] Cannot create a Venn diagram for more than 5 result sets! 8 found, only the first 5 will be used…
INFO [2020-04-03 16:17:11] Contrast: DEN_WT_vs_WT
WARN [2020-04-03 16:17:11] Cannot create a Venn diagram for more than 5 result sets! 8 found, only the first 5 will be used…
INFO [2020-04-03 16:17:12] Contrast: DEN_Smyd3KO_vs_Smyd3KO
WARN [2020-04-03 16:17:12] Cannot create a Venn diagram for more than 5 result sets! 8 found, only the first 5 will be used…
INFO [2020-04-03 16:17:13] Contrast: Smyd3KO_vs_WT
WARN [2020-04-03 16:17:13] Cannot create a Venn diagram for more than 5 result sets! 8 found, only the first 5 will be used…
INFO [2020-04-03 16:17:15] Plotting foldvenn…
INFO [2020-04-03 16:17:15] Plotting in pdf format…
INFO [2020-04-03 16:17:15] Plotting mds…
INFO [2020-04-03 16:17:15] Plotting biodetection…
INFO [2020-04-03 16:17:17] Plotting countsbio…
INFO [2020-04-03 16:17:19] Plotting saturation…
INFO [2020-04-03 16:17:35] Plotting readnoise…
INFO [2020-04-03 16:17:40] Plotting correl…
INFO [2020-04-03 16:17:40] Plotting boxplot…
INFO [2020-04-03 16:17:40] Plotting gcbias…
INFO [2020-04-03 16:17:44] Plotting lengthbias…
INFO [2020-04-03 16:17:46] Plotting meandiff…
INFO [2020-04-03 16:17:52] Plotting meanvar…
INFO [2020-04-03 16:17:54] Plotting rnacomp…
INFO [2020-04-03 16:18:53] Plotting boxplot…
INFO [2020-04-03 16:18:54] Plotting gcbias…
INFO [2020-04-03 16:18:57] Plotting lengthbias…
INFO [2020-04-03 16:19:00] Plotting meandiff…
INFO [2020-04-03 16:19:06] Plotting meanvar…
INFO [2020-04-03 16:19:08] Plotting rnacomp…
INFO [2020-04-03 16:20:06] Plotting deheatmap…
INFO [2020-04-03 16:20:06] Contrast: DEN_Smyd3KO_vs_DEN_WT
INFO [2020-04-03 16:22:44] Contrast: DEN_WT_vs_WT
INFO [2020-04-03 16:25:23] Contrast: DEN_Smyd3KO_vs_Smyd3KO
INFO [2020-04-03 16:28:08] Contrast: Smyd3KO_vs_WT
INFO [2020-04-03 16:31:07] Plotting volcano…
INFO [2020-04-03 16:31:07] Contrast: DEN_Smyd3KO_vs_DEN_WT
INFO [2020-04-03 16:31:07] Contrast: DEN_WT_vs_WT
INFO [2020-04-03 16:31:08] Contrast: DEN_Smyd3KO_vs_Smyd3KO
INFO [2020-04-03 16:31:09] Contrast: Smyd3KO_vs_WT
INFO [2020-04-03 16:31:27] Plotting biodist…
INFO [2020-04-03 16:31:27] Contrast: DEN_Smyd3KO_vs_DEN_WT
INFO [2020-04-03 16:31:28] Contrast: DEN_WT_vs_WT
INFO [2020-04-03 16:31:28] Contrast: DEN_Smyd3KO_vs_Smyd3KO
INFO [2020-04-03 16:31:28] Contrast: Smyd3KO_vs_WT
INFO [2020-04-03 16:31:47] Plotting mastat…
INFO [2020-04-03 16:31:47] Contrast: DEN_Smyd3KO_vs_DEN_WT
INFO [2020-04-03 16:31:48] Contrast: DEN_WT_vs_WT
INFO [2020-04-03 16:31:49] Contrast: DEN_Smyd3KO_vs_Smyd3KO
INFO [2020-04-03 16:31:50] Contrast: Smyd3KO_vs_WT
INFO [2020-04-03 16:32:06] Plotting deregulogram…
INFO [2020-04-03 16:32:06] Contrast: DEN_Smyd3KO_vs_DEN_WT
INFO [2020-04-03 16:32:06] Contrast: DEN_WT_vs_WT
INFO [2020-04-03 16:32:06] Contrast: DEN_Smyd3KO_vs_Smyd3KO
INFO [2020-04-03 16:32:06] Contrast: Smyd3KO_vs_WT
INFO [2020-04-03 16:32:21] Plotting filtered…
INFO [2020-04-03 16:32:21] Plotting statvenn…
INFO [2020-04-03 16:32:21] Contrast: DEN_Smyd3KO_vs_DEN_WT
WARN [2020-04-03 16:32:21] Cannot create a Venn diagram for more than 5 result sets! 8 found, only the first 5 will be used…
INFO [2020-04-03 16:32:22] Contrast: DEN_WT_vs_WT
WARN [2020-04-03 16:32:22] Cannot create a Venn diagram for more than 5 result sets! 8 found, only the first 5 will be used…
INFO [2020-04-03 16:32:24] Contrast: DEN_Smyd3KO_vs_Smyd3KO
WARN [2020-04-03 16:32:24] Cannot create a Venn diagram for more than 5 result sets! 8 found, only the first 5 will be used…
INFO [2020-04-03 16:32:25] Contrast: Smyd3KO_vs_WT
WARN [2020-04-03 16:32:25] Cannot create a Venn diagram for more than 5 result sets! 8 found, only the first 5 will be used…
INFO [2020-04-03 16:32:26] Plotting foldvenn…
INFO [2020-04-03 16:54:48] Importing mds…
INFO [2020-04-03 16:54:49] Importing biodetection…
INFO [2020-04-03 16:54:50] Importing countsbio…
INFO [2020-04-03 16:55:25] Importing saturation…
INFO [2020-04-03 16:55:42] Importing readnoise…
INFO [2020-04-03 16:55:46] Importing filtered…
INFO [2020-04-03 16:55:47] Importing boxplot…
INFO [2020-04-03 16:55:47] Importing gcbias…
INFO [2020-04-03 16:55:54] Importing lengthbias…
INFO [2020-04-03 16:56:01] Importing meandif…
INFO [2020-04-03 16:57:07] Importing meanvar…
INFO [2020-04-03 16:57:15] Importing rnacomp…
INFO [2020-04-03 16:59:20] Importing volcano
INFO [2020-04-03 16:59:21] DEN_Smyd3KO_vs_DEN_WT DEN_Smyd3KO_vs_DEN_WT
INFO [2020-04-03 16:59:27] DEN_WT_vs_WT DEN_WT_vs_WT
INFO [2020-04-03 16:59:34] DEN_Smyd3KO_vs_Smyd3KO DEN_Smyd3KO_vs_Smyd3KO
INFO [2020-04-03 16:59:41] Smyd3KO_vs_WT Smyd3KO_vs_WT
INFO [2020-04-03 16:59:48] Importing mastat
INFO [2020-04-03 16:59:49] DEN_Smyd3KO_vs_DEN_WT DEN_Smyd3KO_vs_DEN_WT
INFO [2020-04-03 16:59:58] DEN_WT_vs_WT DEN_WT_vs_WT
INFO [2020-04-03 17:00:07] DEN_Smyd3KO_vs_Smyd3KO DEN_Smyd3KO_vs_Smyd3KO
INFO [2020-04-03 17:00:16] Smyd3KO_vs_WT Smyd3KO_vs_WT
INFO [2020-04-03 17:00:26] Importing biodist
INFO [2020-04-03 17:00:26] DEN_Smyd3KO_vs_DEN_WT
INFO [2020-04-03 17:00:26] DEN_WT_vs_WT
INFO [2020-04-03 17:00:26] DEN_Smyd3KO_vs_Smyd3KO
INFO [2020-04-03 17:00:27] Smyd3KO_vs_WT
INFO [2020-04-03 17:00:27] Importing statvenn
INFO [2020-04-03 17:00:27] DEN_Smyd3KO_vs_DEN_WT
WARN [2020-04-03 17:00:28] Cannot create a JVenn diagram for more than 6 result sets! 8 found, only the first 6 will be used…
WARN [2020-04-03 17:00:28] Cannot create a JVenn diagram for more than 6 result sets! 8 found, only the first 6 will be used…
WARN [2020-04-03 17:00:28] Cannot create a JVenn diagram for more than 6 result sets! 8 found, only the first 6 will be used…
INFO [2020-04-03 17:00:28] DEN_WT_vs_WT
WARN [2020-04-03 17:00:28] Cannot create a JVenn diagram for more than 6 result sets! 8 found, only the first 6 will be used…
WARN [2020-04-03 17:00:28] Cannot create a JVenn diagram for more than 6 result sets! 8 found, only the first 6 will be used…
WARN [2020-04-03 17:00:28] Cannot create a JVenn diagram for more than 6 result sets! 8 found, only the first 6 will be used…
INFO [2020-04-03 17:00:28] DEN_Smyd3KO_vs_Smyd3KO
WARN [2020-04-03 17:00:29] Cannot create a JVenn diagram for more than 6 result sets! 8 found, only the first 6 will be used…
WARN [2020-04-03 17:00:29] Cannot create a JVenn diagram for more than 6 result sets! 8 found, only the first 6 will be used…
WARN [2020-04-03 17:00:29] Cannot create a JVenn diagram for more than 6 result sets! 8 found, only the first 6 will be used…
INFO [2020-04-03 17:00:29] Smyd3KO_vs_WT
WARN [2020-04-03 17:00:29] Cannot create a JVenn diagram for more than 6 result sets! 8 found, only the first 6 will be used…
WARN [2020-04-03 17:00:29] Cannot create a JVenn diagram for more than 6 result sets! 8 found, only the first 6 will be used…
WARN [2020-04-03 17:00:30] Cannot create a JVenn diagram for more than 6 result sets! 8 found, only the first 6 will be used…
INFO [2020-04-03 17:00:30] Importing foldvenn
INFO [2020-04-03 17:00:32] deseq
INFO [2020-04-03 17:00:32] deseq2
INFO [2020-04-03 17:00:32] edger
INFO [2020-04-03 17:00:32] noiseq
INFO [2020-04-03 17:00:32] limma
INFO [2020-04-03 17:00:32] nbpseq
INFO [2020-04-03 17:00:32] absseq
INFO [2020-04-03 17:00:32] dss
INFO [2020-04-03 17:00:32] pandora
INFO [2020-04-03 17:00:32] Importing deregulogram
INFO [2020-04-03 17:00:32] DEN_Smyd3KO_vs_DEN_WT and DEN_WT_vs_WT
INFO [2020-04-03 17:00:39] DEN_Smyd3KO_vs_DEN_WT and DEN_Smyd3KO_vs_Smyd3KO
INFO [2020-04-03 17:00:45] DEN_Smyd3KO_vs_DEN_WT and Smyd3KO_vs_WT
INFO [2020-04-03 17:00:55] DEN_WT_vs_WT and DEN_Smyd3KO_vs_Smyd3KO
INFO [2020-04-03 17:01:02] DEN_WT_vs_WT and Smyd3KO_vs_WT
INFO [2020-04-03 17:01:09] DEN_Smyd3KO_vs_Smyd3KO and Smyd3KO_vs_WT
INFO [2020-04-03 17:01:15] Writing plot database in /home/panos/public_html/metaseqR2_showcase/metaseqR2_Smyd3_PANDORA/data/reportdb.js
INFO [2020-04-03 17:01:24] Creating HTML report…
INFO [2020-04-03 17:01:24] Compressing figures…
INFO [2020-04-03 17:01:28] Downloading required JavaScript libraries…


Tracks

You can use this link to load a UCSC Genome Browser session with the tracks derived from this analysis. If stranded mode was chosen, a trackhub will be loaded, otherwise, simple tracks will be loaded.

You can download individual bigWig files, one for each sample, using the following list:

Plus (+) strand
WT_BR1
WT_BR2
WT_BR3
DEN_WT_BR1
DEN_WT_BR2
DEN_WT_BR3
Smyd3KO_BR2
Smyd3KO_BR3
Smyd3KO_POOL
DEN_Smyd3KO_BR1
DEN_Smyd3KO_BR2
DEN_Smyd3KO_BR3

Minus (-) strand
WT_BR1
WT_BR2
WT_BR3
DEN_WT_BR1
DEN_WT_BR2
DEN_WT_BR3
Smyd3KO_BR2
Smyd3KO_BR3
Smyd3KO_POOL
DEN_Smyd3KO_BR1
DEN_Smyd3KO_BR2
DEN_Smyd3KO_BR3

Quality control

Quality control figures

The following figures summarize the quality control steps and assessment performed by the metaseqr2 pipeline. Each figure category is accompanied by an explanatory text. All figures are interactive wih additional controls on the top right of the figure.

MDS

Multidimensional scaling

Multidimensional Scaling (MDS) plots constitute a means of visualizing the level of similarity of individual cases of a dataset. It is similar to Principal Component Analysis (PCA), but instead of using the covariance matrix to find similarities between cases, MDS uses absolute distance metrics such as the classical Euclidean distance. Because of the relative linear relations between sequencing samples, it provides a more realistic clustering of samples. MDS serves quality control and it can be interpreted as follows: when the distance between samples of the same biological condition in the MDS space is small, this is an indication of high correlation and reproducibility between them. When this distance is larger or heterogeneous (e.g. the 3rd sample of a triplicate set is further from the other 2), this constitutes an indication of low correlation and reproducibility between samples. It can help exclude poor samples from further analysis.

Biodetection

Biotype detection

The biotype detection bar diagrams are a set of quality control charts that show the percentage of each biotype in the genome (i.e. in the whole set of features provided, for example, protein coding genes, non coding RNAs or pseudogenes) in red bars, the proportion of which has been detected in a sample before normalization and after basic filtering by removing features with zero counts in green bars, and the percentage of each biotype within the sample in blue bars. The difference between red bars and blue bars is that red bars show the percentage of a feature in the genome while blue bars show the percentage in the sample. Thus, the blue bars may sometimes be higher than the green bars because certain features (e.g. protein coding genes) may be detected within a sample with a higher proportion relative to their presence in the genome, as compared with other features. For example, while the percentage of protein coding genes in the whole genome is already higher than other biotypes, this percentage is expected to be even higher in an RNA-Seq experiment where one expects protein-coding genes to exhibit greater abundance. The vertical line separates the most abundant (yellow band) biotypes (on the left-hand side, corresponding to the left axis scale) from the rest (on the right-hand side, corresponding to the right axis scale, red band). Otherwise, lower abundance biotypes would be indistinguishable. Unexpected outcomes in this quality control chart (e.g. very low detection of protein coding genes) would signify possible low quality of a sample.

Select a sample to display plot for

Biocounts

Biotype representation

Biotype detection counts boxplots are a set of quality control charts that depict both the biological classification for the detected features and the actual distribution of the read counts for each biological type. The boxplot comprises a means of summarizing the read counts distribution of a sample in the form of a bar with extending lines, as a commonly used way of graphically presenting groups of numerical data. A boxplot also indicates which observations, if any, might be considered outliers and is able to visually show different types of populations, without making any assumptions about the underlying statistical distribution. The spacing between the different parts of the box help indicate variance, skewness and identify outliers. The thick bar inside the colored box is the median of the observations while the box extends over the Interquartile Range of the observations. The whiskers extend up (down) to +/-1.5xIQR. Unexpected outcomes (e.g. protein coding read count distribution similar to pseudogene read count distribution) indicates poor sample quality.

Biotypes within samples

Select a sample to display plot for

Biotype representation across samples

Select a biotype to display plot for

Saturation

Biotype representation

Read and biotype saturation plots are a set of quality control charts that depict the read count saturation levels at several sequencing depths. Thus, they comprise a means of assessing whether the sequencing depth of an RNA-Seq experiment is sufficient in order to detect the biological features under investigation. These quality control charts are separated in two subgroups: the first (read saturation per biotype for all samples) is a set of plots, one for each biological feature (e.g. protein coding, pseudogene, lincRNA, etc.), that depict the number of detected features in different sequencing depths and for all samples in the same plot. The second subgroup (read saturation per sample for all biotypes) is a set of plots similar to the above, but with one pair of plots with two panels for each sample, presenting all biological features. The left panel depicts the saturation levels for the less abundatnt features, while the right panel, the saturation for the more abundant features, as placing them all together would make the less abundant features indistinguishable. All the saturation plots should be interpreted as follows: if the read counts for a biotype tend to be saturated, the respective curve should tend to reach a plateau at higher depths. Otherwise, more sequencing is needed for the specific biotype.

Read saturation per biotype for all samples

Select a sample to display plot for

Read saturation per sample for all biotypes

Select a biotype to display plot for

Reads noise

RNA-Seq reads noise

The read noise plots depict the percentage of biological features detected when subsampling the total number of reads. Very steep curves in read noise plots indicate that although the sequencing depth reaches its maximum, a relatively small percentage of total features is detected, indicating that the level of background noise is relatively high. Less steep RNA composition curves, indicate less noise. When a sample’s curve deviate from the rest, it may indicate lower or higher quality, depending on the curves of the rest of the samples.

Correlation

Pairwise sample correlations

Sample correlation plots depict the accordance of RNA-Seq samples, as this is manifested through the read counts table used with the metaseqR2 pipeline, with representations that both use the correlation matrix (a matrix which depicts all the pairwise correlations between each pair of samples) of the read counts matrix. The correlation representation is a clustered heatmap which depicts the correlations of samples as color-scaled images and the hierarchical clustering tree depicts the grouping of the samples according to their correlation. If samples from the same group not being clustered together provides an indication that there might be a quality problem with the dataset.

Filtered

Chromosome and biotype distribution of filtered genes

The chromosome and biotype distribution of filtered genes is a quality control chart with two rows and four panels: on the left panel of the first row, the bar chart depicts the numbers of filtered genes per chromosome (actual numbers shown above the bars). On the right panel of the first row, the bar chart depicts the numbers of filtered genes per biotype (actual numbers shown above the bars). On the left panel of the second row, the bar chart depicts the fraction of filtered genes to the total genes per chromosome (actual percentages shown above the bars). On the right panel of the second row, the bar chart depicts the fraction of the filtered genes to the total genes per biotype (actual percentages shown above the bars). This plot should indicate possible quality problems when for example the filtered genes for a specific chromosome (or the fraction) is much higher than the rest. Generally, the fractions per chromosome should be uniform and the fractions per biotype should be proportional to the biotype fraction relative to the genome.

Chromosome distribution of filteredgenes

Biotype distribution of filtered genes

Normalization

Normalization assessment figures

The following figures allow for the assessment of the normalization procedures performed by the metaseqr2 pipeline. Each figure category is accompanied by an explanatory text. All figures are interactive wih additional controls on the top right corner of the figure.

Boxplots

Boxplots

The boxplot comprises a means of summarizing the read counts distribution of a sample in the form of a bar with extending lines, as a commonly used way of graphically presenting groups of numerical data. A boxplot also indicates which observations, if any, might be considered outliers and is able to visually show different types of populations, without making any assumptions about the underlying statistical distribution. The spacings between the different parts of the box help indicate variance, skewness and identify outliers. The thick bar inside the colored box is the median of the observations while the box extends over the Interquartile Range of the observations. The whiskers extend up (down) to +/-1.5xIQR. Similar boxplots indicate good quality of normalization. If boxplots remain dissimilar after normalization, another normalization algorithm may have to be examined. The un-normalized boxplots show the need for data normalization in order for the data from different samples to follow the same underlying distribution and statistical testing to become possible.



GC bias

GC bias assessment plots

The GC-content bias plot is a quality control chart that shows the possible dependence of the read counts (in log2 scale) under a gene to the GC content percentage of that gene. In order for the statistical tests to be able to detect statistical significance which occurs due to real biological effects and not through other systematic biases present in the data (e.g. possible GC-content bias), the latter should be accounted for by the applied normalization algorithm. Although the tests are performed for each gene across biological conditions one could assume that the GC content does not represent a bias, as it is the same for the tested gene across samples and conditions. However, Risso et al. (2011) showed that GC-content could could have an impact on the statistical testing procedure. The GC-content bias plot depicts the dependence of the read counts to the GC content before and after normalization. The smoothing lines for each sample, should be as ‘straight’ as possible after normalization. In addition, if the smoothing lines differ significantly between biological conditions, this would constitute a possible quality warning.



Length bias

Length bias assessment plots

The gene/transcript length bias plot is a quality control chart that shows the possible dependence of read counts (in log2 scale) under a gene to the length of that gene (whole gene or sum of exons depending on the analysis). In order for the statistical tests to be able to detect statistical significance which occurs due to real biological effects and not by other systematic biases present in the data (e.g. possible length bias), the latter should be accounted for by the applied normalization algorithm. Although the tests are performed for each gene across biological conditions, one could assume that the gene length does not represent a bias, as it is the same for the tested gene across samples and conditions. However, it has been shown in several studies that gene length could have an impact on the statistical testing procedure. The length bias plot depicts the dependence of the read counts to the gene/transcript length before and after normalization. The smoothing lines for each sample, should be as ‘straight’ as possible after normalization. In addition, if the smoothing lines differ significantly between biological conditions, this would constitute a possible quality warning.



Mean-Difference

Mean-difference plots for normalization assessment

A mean-difference plot (or a Bland-Altman plot) is a method of data plotting used in analyzing the agreement between two different assays/variables. In this graphical method the differences (or alternatively the ratios) between the two variables are plotted against the averages of the two. Such a plot is useful, for example, for analyzing data with strong correlation between the x and y axes, when the (x,y) dots on the plot are close to the diagonal x=y. In this case, the value of the transformed variable X is approximately the same as x and y and variable Y shows the difference between x and y. When the data cloud in a mean difference plot is centered around the horizontal zero line, this is an indication of good data quality and good normalization results. On the other hand, when the data cloud deviates from the center line or has a ‘banana’ shape, this constitutes an indication of systematic biases present in the data and that either the chosen normalization algorithm has not worked well, or that data are not normalized. The smoothing curve that traverses the data (red curve) summarizes the above trends.

Select a pair to display plots for

Mean-Variance

Mean-variance plot for normalization assessment

The mean-variance plot comprises a graphical means of displaying a possible relationship between the means of gene expression (counts) values and their variances across replicates of a gene expression experiment. Thus data can be inspected for possible overdispersion (greater variability in a dataset than would be expected based on a given simple statistical model). In such plots for RNA-Seq data, overdispersion is usually manifested as increasing variance with increasing gene expression (counts) and it is summarized through a smoothing curve (red curve). The following is taken from the EDASeq package vignette: ‘…although the Poisson distribution is a natural and simple way to model count data, it has the limitation of assuming equality of the mean and variance. For this reason, the negative binomial distribution has been proposed as an alternative when the data show over-dispersion…’ If overdispersion is not present, the data cloud is expected to be evenly scattered around the smoothing curve.

Rna composition

RNA composition plot

The RNA composition plots depict differences in the distributions of reads in the same biological features across samples. The following is taken from the NOISeq vignette: ‘…when two samples have different RNA composition, the distribution of sequencing reads across the features is different in such a way that although a feature had the same number of read counts in both samples, it would not mean that it was equally expressed in both… To check if this bias is present in the data, the RNA composition plot and the correponding diagnostic test can be used. In this case, each sample s is compared to the reference sample r (which can be arbitrarily chosen). To do that, M values are computed as log2(counts_sample = counts_reference). If no bias is present, it should be expected that the median of M values for each comparison is 0. Otherwise, it would be indicating that expression levels in one of the samples tend to be higher than in the other, and this could lead to false discoveries when computing differencial expression. Confidence intervals for the M median are also computed by bootstrapping. If value 0 does not fall inside the interval, it means that the deviation of the sample with regard to the reference sample is statistically significant. Therefore, a normalization procedure is required.’

Statistics

Differential expression assessment figures

The following figures allow for the assessment of the statistical testing procedures performed by the metaseqr2 pipeline. Each figure category is accompanied by an explanatory text. All figures are interactive wih additional controls on the top right corner of the figure.

Volcano

Volcano plots

A volcano plot is a scatterplot that is often used when analyzing high-throughput -omics data (e.g. microarray data, RNA-Seq data) to give an overview of interesting genes. The log2 fold change is plotted on the x-axis and the negative log10 p-value is plotted on the y-axis. A volcano plot combines the results of a statistical test (aka, p-values) with the magnitude of the change enabling quick visual identification of those genes that display large-magnitude changes and that are also statistically significant. The horizontal dashed line sets the threshold for statistical significance, while the vertical dashed lines set the thresholds for biological significance. It should be noted that volcano plots become harder to interpret when using more than one statistical algorithm and performing meta-analysis. This happens because the genes that have stronger evidence of being differentially expressed obtain lower p-values while the rest either remain at similar levels or obtain higher p-values. The result is a ‘warped’ volcano plot, with two main data clouds: one in the upper part of the plot, and one in the lower part of the plot. You can always zoom in when using interactive mode (the default).

Select a contrast to display plot for

MA

Mean-Difference (MA) plots

A mean-difference (or MA) plot with overlaid statistical information (p-value and fold change thresholds manifested as points with different colors) is a very useful graphic that enables the visualization of the results of differential expression analysis. It differs from the volcano plot regarding what is displayed in the axes system. While a volcano plot displays the fold change (x-axis) versus the statistical significance (y-axis), an MA plot with statistical scores depicts average expression over the biological conditions that are compared (x-axis) versus the fold change of the comparison. Statistical significance categorization is added as point coloring and statistical significance is indicated only by different colors and not by the position to the axes system as in the volcano plot. This plot is useful when it is of little interest how statistically significant a gene/transcript is (we are interested only in the fact that it is) but someone is interested in actual expression and fold change values instead.

Select a contrast to display plot for

Heatmap

Differential expression heatmaps

Differentially Expressed Genes (DEGs) heatmaps depict how well samples from different conditions cluster together according to their expression values after normalization and statistical testing, for each requested statistical contrast. If samples from the same biological condition do not cluster together, this would constitute a warning sign regarding the quality of the samples. In addition, DEG heatmaps provide an initial view of possible clusters of co-expressed genes.

Select a contrast to display heatmap for
Select a scale to display heatmap for

Biodist

Chromosome and biotype distributions of differentially expressed genes

The chromosome and biotype distributions bar diagram for Differentially Expressed Genes (DEGs) is split in two panels: i)in the upper panel DEGs are distributed per chromosome and the percentage of each chromosome in the genome is presented in red bars, the percentage of DEGs in each chromosome is presented in green bars and the percentage of certain chromosomes in the distribution of DEGs is presented in blue bars; ii)in the lower panel, DEGs are distributed per biotype and the percentage of each biotype in the genome (i.e. in the whole set of features provided, for example, protein coding genes, non coding RNAs or pseudogenes) is presented in red bars, the percentage of DEGs in each biotype is presented in green bars and the percentage of each biotype in DEGs is presented in blue lines. The vertical line separates the most abundant biotypes (on the left-hand side, corresponding to the left axis scale), from the rest(on the right-hand side, corresponding to the right axis scale). Otherwise, the lower abundance, biotypes would be indistinguishable.

Select a contrast to display plot for

Chromosome distribution of differentially expressed genes

Biotype distribution of differentially expressed genes

Deregulogram

Deregulograms

The de-regulogram is a scatterplot of fold changes between two different contrasts. It depicts whether the DEGs between the two selected contrasts follow a concordant or discordant regulation pattern. For each (common) DEG, the x-axis and y-axis represent the log2 fold change of the two contrasts. The location of each point along the four quartiles can directly show its regulation pattern in the two comparisons. Therefore, the dots localized in the second or the fourth quartile, illustrate DEGs with a common regulation pattern, while those localized in the first or third quartile represent DEGs with opposite patterns of regulation.

Select a pair of contrasts to display plot for

StatVenn

Venn diagram of differentially expressed genes

Venn diagrams are an intuitive way of presenting overlaps between lists, based on the overlap of basic geometrical shapes. The numbers of overlapping genes per statistical algorithm are shown in the different areas of the Venn diagrams, one for each contrast. Apart from a p-value cutoff, a fold change threshold of 0.5 in log2 scale is applied for each contrast. For multi-condition contrasts, the first condition is used to calculate the fold change against the reference.

Select a contrast to display Venn diagram for
Select a direction to display Venn diagram for
Click on a number on Venn diagrams to display the respective genes

FoldVenn

Venn diagram of differentially expressed genes

Venn diagrams are an intuitive way of presenting overlaps between lists, based on the overlap of basic geometrical shapes. The numbers of overlapping genes per statistical contrast are shown in the different areas of the Venn diagrams, one for each contrast. Apart from a p-value cutoff, a fold change threshold of 0.5 in log2 scale is applied for each contrast. For multi-condition contrasts, the first condition is used to calculate the fold change against the reference.

Select an algorithm to display Venn diagram for
Select a direction to display Venn diagram for
Click on a number on Venn diagrams to display the respective genes

Results

Tables of differentially expressed genes

The following tables allow for a quick exploration of the results of the statistical analysis performed by the metaseqr2 pipeline.

Each table presents the top 5% statistically significant genesUse the download links below each table to retrieve the total list of differentially expressed genes or the whole gene list of the selected genome irrespective of differential expression.Furthermore each table can be searched using the search field on the top right and you can also find the following information:

  • The chromosome column is linked to the genomic location of the gene and opens a new tab/window to the UCSC Genome Browser
  • The gene_id column opens a link to the respective full annotation source (only for Ensembl and RefSeq)
  • The background of the p_value and FDR columns displays a bar with length proportional to the significance of each gene
  • The background color of the fold change (vs) column(s) displays shows the deregulation of each gene and is proportional to the deregulation strength (red for up- green for down-regulation)
  • The background of the rest columns (condition average expression) displays a bar with length proportional to the expression strength of each condition

Select a contrast to display DEG table for

DEG table for the contrast DEN_Smyd3KO vs DEN_WT

The following table presents the top 5% statistically significant genes for the contrast DEN_Smyd3KO vs DEN_WT.




DEG table for the contrast DEN_WT vs WT

The following table presents the top 5% statistically significant genes for the contrast DEN_WT vs WT.




DEG table for the contrast DEN_Smyd3KO vs Smyd3KO

The following table presents the top 5% statistically significant genes for the contrast DEN_Smyd3KO vs Smyd3KO.




DEG table for the contrast Smyd3KO vs WT

The following table presents the top 5% statistically significant genes for the contrast Smyd3KO vs WT.





References

  1. Moulos, P., Hatzis, P. (2015). Systematic integration of RNA-Seq statistical algorithms for accurate detection of differential gene expression patterns. Nucleic Acids Research 43(4), e25.
  2. Statham, A.L., Strbenac, D., Coolen, M.W., Stirzaker, C., Clark, S.J., Robinson, M.D. (2010) Repitools: an R package for the analysis of enrichment-based epigenomic data. Bioinformatics 26(13), 1662-1663.
  3. Anders, S., and Huber, W. (2010). Differential expression analysis for sequence count data. Genome Biol 11, R106.
  4. Love, M.I., Huber, W., Anders, S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biology 15(12):550 (2014)
  5. Robinson, M.D., McCarthy, D.J., and Smyth, G.K. (2010). edgeR: a Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics 26, 139-140.
  6. Tarazona, S., Garcia-Alcalde, F., Dopazo, J., Ferrer, A., and Conesa, A. (2011). Differential expression in RNA-seq: a matter of depth. Genome Res 21, 2213-2223.
  7. Smyth, G. (2005). Limma: linear models for microarray data. In Bioinformatics and Computational Biology Solutions using R and Bioconductor, G. R., C. V., D. S., I. R., and H. W., eds. (New York, Springer), pp. 397-420.
  8. Di, Y, Schafer, D., Cumbie, J.S., and Chang, J.H. (2011). The NBP Negative Binomial Model for Assessing Differential Gene Expression from RNA-Seq. Statistical Applications in Genetics and Molecular Biology 10(1), 1-28.
  9. Wentao Yang, Philip Rosenstiel and Hinrich Schulenburg: ABSSeq: a new RNA-Seq analysis method based on modelling absolute expression differences BMC Genomics 2016; 17: 541
  10. Hao Wu, Chi Wang, Zhijin Wu (2013): A new shrinkage estimator for dispersion improves differential expression detection in RNA-seq data. Biostatistics, 14(2):232-43. doi:10.1093/biostatistics/kxs033
  11. Planet, E., Attolini, C.S., Reina, O., Flores, O., and Rossell, D. (2012). htSeqTools: high-throughput sequencing quality control, processing and visualization in R. Bioinformatics 28, 589-590.
  12. Risso, D., Schwartz, K., Sherlock, G., and Dudoit, S. (2011). GC-content normalization for RNA-Seq data. BMC Bioinformatics 12, 480.
  13. Chen, H., and Boutros, P.C. (2011). VennDiagram: a package for the generation of highly-customizable Venn and Euler diagrams in R. BMC Bioinformatics 12, 35.
  14. Benjamini, Y., and Hochberg, Y. (1995). Controlling the False Discovery Rate: A Practical and Powerful Approach to Multiple Testing. Journal of the Royal Statistical Society Series B (Methodological) 57, 289-300.