library(scater)
library(scran)
library(BiocStyle)
library(pheatmap)
sce.paul <- readRDS("output/paul_quality_control_sce.RDS")

Normalization

library(scran)
set.seed(101000110)
clusters <- quickCluster(sce.paul)
sce.paul <- computeSumFactors(sce.paul, clusters=clusters)
sce.paul <- logNormCounts(sce.paul)

We examine some key metrics for the distribution of size factors, and compare it to the library sizes as a sanity check.

summary(sizeFactors(sce.paul))
   Min. 1st Qu.  Median    Mean 3rd Qu.    Max. 
0.05681 0.42248 0.77545 1.00000 1.33514 9.65387 
plot(librarySizeFactors(sce.paul), sizeFactors(sce.paul), pch=16,
    xlab="Library size factors", ylab="Deconvolution factors", log="xy")

Relationship between the library size factors and the deconvolution size factors in the Paul HSC dataset.

Variance modelling

We fit a mean-variance trend to the endogenous genes to detect highly variable genes. Unfortunately, the plates are confounded with an experimental treatment (Batch_desc) so we cannot block on the plate of origin.

set.seed(00010101)
dec.paul <- modelGeneVarByPoisson(sce.paul)
top.paul <- getTopHVGs(dec.paul, prop=0.1)
plot(dec.paul$mean, dec.paul$total, pch=16, cex=0.5, 
    xlab="Mean of log-expression", ylab="Variance of log-expression")
curve(metadata(dec.paul)$trend(x), col="blue", add=TRUE)

Per-gene variance as a function of the mean for the log-expression values in the Paul HSC dataset. Each point represents a gene (black) with the mean-variance trend (blue) fitted to simulated Poisson noise.

saveRDS(sce.paul,"output/paul_normalization_sce.RDS")
saveRDS(dec.paul,"output/paul_normalization_dec.RDS")
saveRDS(top.paul,"output/paul_normalization_top.RDS")

Click to see page metadata

Computation Started: 2023-07-21 16:24:08

Finished in 1.003 mins


Git Log

No git history available for this page


Packages

package version date
MatrixGenerics 1.2.0 2020-10-28
Biobase 2.50.0 2020-10-28
viridis 0.5.1 2020-07-17
edgeR 3.32.1 2021-01-15
BiocSingular 1.6.0 2020-10-28
viridisLite 0.3.0 2020-06-15
DelayedMatrixStats 1.12.2 2021-01-13
scuttle 1.0.4 2020-12-18
assertthat 0.2.1 2020-07-15
statmod 1.4.35 2020-10-20
highr 0.8 2020-07-15
BiocManager 1.30.10 2020-07-15
stats4 4.0.1 2020-06-07
dqrng 0.2.1 2020-07-15
grDevices 4.0.1 2020-06-07
GenomeInfoDbData 1.2.4 2020-11-03
vipor 0.4.5 2020-07-15
yaml 2.2.1 2020-07-15
pillar 1.6.0 2021-04-14
lattice 0.20-41 2020-06-07
glue 1.4.2 2020-08-28
base 4.0.1 2020-06-07
limma 3.46.0 2020-10-28
beachmat 2.6.4 2020-12-21
digest 0.6.27 2020-10-25
RColorBrewer 1.1-2 2020-07-15
GenomicRanges 1.42.0 2020-10-28
XVector 0.30.0 2020-10-29
colorspace 2.0-0 2020-11-12
htmltools 0.5.5 2023-07-17
Matrix 1.2-18 2020-06-07
pkgconfig 2.0.3 2020-07-15
pheatmap 1.0.12 2020-07-16
zlibbioc 1.36.0 2020-10-29
purrr 0.3.4 2020-07-15
scales 1.1.1 2020-07-16
BiocParallel 1.24.1 2020-11-07
git2r 0.28.0 2021-01-11
tibble 3.1.1 2021-04-19
generics 0.1.0 2020-11-01
datasets 4.0.1 2020-06-07
IRanges 2.24.1 2020-12-13
ggplot2 3.3.3 2020-12-31
ellipsis 0.3.1 2020-07-15
withr 2.4.2 2021-04-19
SummarizedExperiment 1.20.0 2020-10-28
BiocGenerics 0.36.0 2020-10-28
cli 2.5.0 2021-04-27
magrittr 2.0.1 2020-11-18
crayon 1.4.1 2021-02-09
evaluate 0.14 2020-06-15
methods 4.0.1 2020-06-07
fansi 0.4.2 2021-01-16
bluster 1.0.0 2020-10-28
utils 4.0.1 2020-06-07
beeswarm 0.2.3 2020-07-15
tools 4.0.1 2020-06-07
scater 1.18.3 2020-11-09
BiocStyle 2.18.1 2020-11-25
lifecycle 1.0.0 2021-02-16
matrixStats 0.57.0 2020-09-26
stringr 1.4.0 2020-07-15
S4Vectors 0.28.1 2020-12-10
munsell 0.5.0 2020-07-15
locfit 1.5-9.4 2020-07-15
DelayedArray 0.16.0 2020-10-28
irlba 2.3.3 2020-07-15
stats 4.0.1 2020-06-07
compiler 4.0.1 2020-06-07
GenomeInfoDb 1.26.2 2020-12-09
rsvd 1.0.3 2020-07-15
rlang 1.1.1 2023-07-17
grid 4.0.1 2020-06-07
RCurl 1.98-1.2 2020-07-15
BiocNeighbors 1.8.2 2020-12-08
graphics 4.0.1 2020-06-07
SingleCellExperiment 1.12.0 2020-10-28
igraph 1.2.6 2020-10-07
bitops 1.0-6 2020-07-15
rmarkdown 2.23 2023-07-17
gtable 0.3.0 2020-07-15
DBI 1.1.1 2021-01-16
R6 2.5.0 2020-10-29
gridExtra 2.3 2020-07-15
knitr 1.30 2020-09-23
dplyr 1.0.5 2021-03-06
fastmap 1.1.1 2023-07-17
utf8 1.1.4 2020-07-15
stringi 1.5.3 2020-09-10
ggbeeswarm 0.6.0 2020-07-16
parallel 4.0.1 2020-06-07
Rcpp 1.0.6 2021-01-16
scran 1.18.3 2020-12-22
vctrs 0.3.6 2020-12-18
tidyselect 1.1.0 2020-07-15
xfun 0.39 2023-07-17
sparseMatrixStats 1.2.0 2020-10-28

System Information

systemInfo
version R version 4.0.1 (2020-06-06)
platform x86_64-apple-darwin17.0 (64-bit)
locale en_CA.UTF-8
OS macOS 10.16
UI X11

Scikick Configuration

cat scikick.yml
### Scikick Project Workflow Configuration File

# Directory where Scikick will store all standard notebook outputs
reportdir: report

# --- Content below here is best modified by using the Scikick CLI ---

# Notebook Execution Configuration (format summarized below)
# analysis:
#  first_notebook.Rmd:
#  second_notebook.Rmd: 
#  - first_notebook.Rmd     # must execute before second_notebook.Rmd
#  - functions.R            # file is used by second_notebook.Rmd
#
# Each analysis item is executed to generate md and html files, E.g.:
# 1. <reportdir>/out_md/first_notebook.md
# 2. <reportdir>/out_html/first_notebook.html
analysis: !!omap
- index.Rmd:
- notebooks/nestorowa/import.Rmd:
- notebooks/nestorowa/quality_control.Rmd:
  - notebooks/nestorowa/import.Rmd
- notebooks/nestorowa/normalization.Rmd:
  - notebooks/nestorowa/quality_control.Rmd
- notebooks/nestorowa/further_exploration.Rmd:
  - notebooks/nestorowa/normalization.Rmd
- notebooks/grun/import.Rmd:
- notebooks/grun/quality_control.Rmd:
  - notebooks/grun/import.Rmd
- notebooks/grun/normalization.Rmd:
  - notebooks/grun/quality_control.Rmd
- notebooks/grun/further_exploration.Rmd:
  - notebooks/grun/normalization.Rmd
- notebooks/paul/import.Rmd:
- notebooks/paul/quality_control.Rmd:
  - notebooks/paul/import.Rmd
- notebooks/paul/normalization.Rmd:
  - notebooks/paul/quality_control.Rmd
- notebooks/paul/further_exploration.Rmd:
  - notebooks/paul/normalization.Rmd
- notebooks/merged/merge.Rmd:
  - notebooks/grun/quality_control.Rmd
  - notebooks/paul/quality_control.Rmd
  - notebooks/nestorowa/normalization.Rmd
- notebooks/merged/combined_analysis.Rmd:
  - notebooks/merged/merge.Rmd
version_info:
  snakemake: 6.0.2
  ruamel.yaml: 0.16.12
  scikick: 0.2.1
# Optional site theme customization
output:
  BiocStyle::html_document:
    code_folding: hide
    theme: readable
    toc_float: true
    toc: true
    number_sections: false
    toc_depth: 5
    self_contained: true

Functions



Next (Project Map) skmap cluster_/ / cluster_notebooks/nestorowa/ notebooks/nestorowa/ cluster_notebooks/grun/ notebooks/grun/ cluster_notebooks/paul/ notebooks/paul/ cluster_notebooks/merged/ notebooks/merged/ notebooks/grun/quality_control.Rmd Quality Control notebooks/merged/merge.Rmd Merge notebooks/grun/quality_control.Rmd->notebooks/merged/merge.Rmd notebooks/grun/normalization.Rmd Normalization notebooks/grun/quality_control.Rmd->notebooks/grun/normalization.Rmd notebooks/merged/combined_analysis.Rmd Combined Analysis notebooks/merged/merge.Rmd->notebooks/merged/combined_analysis.Rmd notebooks/paul/quality_control.Rmd Quality Control notebooks/paul/quality_control.Rmd->notebooks/merged/merge.Rmd notebooks/paul/normalization.Rmd Normalization notebooks/paul/quality_control.Rmd->notebooks/paul/normalization.Rmd notebooks/nestorowa/normalization.Rmd Normalization notebooks/nestorowa/normalization.Rmd->notebooks/merged/merge.Rmd notebooks/nestorowa/further_exploration.Rmd Further Exploration notebooks/nestorowa/normalization.Rmd->notebooks/nestorowa/further_exploration.Rmd index.Rmd Index notebooks/nestorowa/import.Rmd Import notebooks/nestorowa/quality_control.Rmd Quality Control notebooks/nestorowa/import.Rmd->notebooks/nestorowa/quality_control.Rmd notebooks/nestorowa/quality_control.Rmd->notebooks/nestorowa/normalization.Rmd notebooks/grun/import.Rmd Import notebooks/grun/import.Rmd->notebooks/grun/quality_control.Rmd notebooks/grun/further_exploration.Rmd Further Exploration notebooks/grun/normalization.Rmd->notebooks/grun/further_exploration.Rmd notebooks/paul/import.Rmd Import notebooks/paul/import.Rmd->notebooks/paul/quality_control.Rmd notebooks/paul/further_exploration.Rmd Further Exploration notebooks/paul/normalization.Rmd->notebooks/paul/further_exploration.Rmd