Here is a very quick example of a DER Finder analysis. This analysis is explained in more detail later on in this document.
## Load libraries
library('derfinder')
library('derfinderData')
library('GenomicRanges')
## Determine the files to use and fix the names
files <- rawFiles(system.file('extdata', 'AMY', package = 'derfinderData'), samplepatt = 'bw', fileterm = NULL)
names(files) <- gsub('.bw', '', names(files))
## Load the data from disk -- On Windows you have to load data from Bam files
fullCov <- fullCoverage(files = files, chrs = 'chr21', verbose = FALSE)
## Get the region matrix of Expressed Regions (ERs)
regionMat <- regionMatrix(fullCov, cutoff = 30, L = 76, verbose = FALSE)
## Get pheno table
pheno <- subset(brainspanPheno, structure_acronym == 'AMY')
## Identify which ERs are differentially expressed, that is, find the DERs
library('DESeq2')
## Round matrix
counts <- round(regionMat$chr21$coverageMatrix)
## Round matrix and specify design
dse <- DESeqDataSetFromMatrix(counts, pheno, ~ group + gender)
## Perform DE analysis
dse <- DESeq(dse, test = 'LRT', reduced = ~ gender, fitType = 'local')
## Extract results
mcols(regionMat$chr21$regions) <- c(mcols(regionMat$chr21$regions), results(dse))
## Save info in an object with a shorter name
ers <- regionMat$chr21$regions
ers
derfinder is an R package that implements the DER Finder approach (Frazee, Sabunciyan, Hansen, Irizarry, et al., 2014) for RNA-seq data. Briefly, this approach is an alternative to feature-counting and transcript assembly. The basic idea is to identify contiguous base-pairs in the genome that present differential expression signal. These base-pairs are grouped into _d_ifferentially _e_xpressed _r_regions (DERs). This approach is annotation-agnostic which is a feature you might be interested in. In particular, derfinder contains functions that allow you to identify DERs via two alternative methods. You can find more details on the full derfinder users guide.
This is a brief overview of what a DER Finder analysis looks like. In particular, here we will be identifying expressed regions (ERs) without relying on annotation. Next, we’ll identify candidate differentially expressed regions (DERs). Finally, we’ll compare the DERs with known annotation features.
We first load the required packages.
## Load libraries
library('derfinder')
library('derfinderData')
library('GenomicRanges')
Next, we need to locate the chromosome 21 coverage files for a set of 12 samples. These samples are a small subset from the BrainSpan Atlas of the Human Brain (BrainSpan, 2011) publicly available data. The function rawFiles()
helps us in locating these files.
## Determine the files to use and fix the names
files <- rawFiles(system.file('extdata', 'AMY', package = 'derfinderData'), samplepatt = 'bw', fileterm = NULL)
names(files) <- gsub('.bw', '', names(files))
Next, we can load the full coverage data into memory using the fullCoverage()
function.
## Load the data from disk
fullCov <- fullCoverage(files = files, chrs = 'chr21', verbose = FALSE)
Now that we have the data, we can identify expressed regions (ERs) by using a cutoff of 30 on the base-level mean coverage from these 12 samples. Once the regions have been identified, we can calculate a coverage matrix with one row per ER and one column per sample (12 in this case). For doing this calculation we need to know the length of the sequence reads, which in this study were 76 bp long.
## Get the region matrix of Expressed Regions (ERs)
regionMat <- regionMatrix(fullCov, cutoff = 30, L = 76, verbose = FALSE)
regionMatrix()
returns a list of elements, each with three pieces of output. The actual ERs are arranged in a GRanges
object named regions
.
## regions output
regionMat$chr21$regions
## GRanges object with 45 ranges and 6 metadata columns:
## seqnames ranges strand | value
## <Rle> <IRanges> <Rle> | <numeric>
## 1 chr21 [ 9827018, 9827582] * | 313.671741938907
## 2 chr21 [15457301, 15457438] * | 215.084607424943
## 3 chr21 [20230140, 20230192] * | 38.8325471608144
## 4 chr21 [20230445, 20230505] * | 41.3245082031834
## 5 chr21 [27253318, 27253543] * | 34.9131305372469
## .. ... ... ... . ...
## 41 chr21 [33039644, 33039688] * | 34.4705371008979
## 42 chr21 [33040784, 33040798] * | 32.134222178989
## 43 chr21 [33040890, 33040890] * | 30.0925002098083
## 44 chr21 [33040900, 33040901] * | 30.1208333174388
## 45 chr21 [48019401, 48019414] * | 31.1489284609755
## area indexStart indexEnd cluster clusterL
## <numeric> <integer> <integer> <Rle> <Rle>
## 1 177224.534195483 1 565 1 565
## 2 29681.6758246422 566 703 2 138
## 3 2058.12499952316 704 756 3 366
## 4 2520.79500039419 757 817 3 366
## 5 7890.3675014178 818 1043 4 765
## .. ... ... ... ... ...
## 41 1551.17416954041 2180 2224 17 45
## 42 482.013332684835 2225 2239 18 118
## 43 30.0925002098083 2240 2240 18 118
## 44 60.2416666348775 2241 2242 18 118
## 45 436.084998453657 2243 2256 19 14
## -------
## seqinfo: 1 sequence from an unspecified genome
bpCoverage
is the base-level coverage list which can then be used for plotting.
## Base-level coverage matrices for each of the regions
## Useful for plotting
lapply(regionMat$chr21$bpCoverage[1:2], head, n = 2)
## $`1`
## HSB113 HSB123 HSB126 HSB130 HSB135 HSB136 HSB145 HSB153 HSB159 HSB178
## 1 93.20 3.32 28.22 5.62 185.17 98.34 5.88 16.71 3.52 15.71
## 2 124.76 7.25 63.68 11.32 374.85 199.28 10.39 30.53 5.83 29.35
## HSB92 HSB97
## 1 47.40 36.54
## 2 65.04 51.42
##
## $`2`
## HSB113 HSB123 HSB126 HSB130 HSB135 HSB136 HSB145 HSB153 HSB159 HSB178
## 566 45.59 7.94 15.92 34.75 141.61 104.21 19.87 38.61 4.97 23.2
## 567 45.59 7.94 15.92 35.15 141.64 104.30 19.87 38.65 4.97 23.2
## HSB92 HSB97
## 566 13.95 22.21
## 567 13.95 22.21
## Check dimensions. First region is 565 long, second one is 138 bp long.
## The columns match the number of samples (12 in this case).
lapply(regionMat$chr21$bpCoverage[1:2], dim)
## $`1`
## [1] 565 12
##
## $`2`
## [1] 138 12
The end result of the coverage matrix is shown below. Note that the coverage has been adjusted for read length. Because reads might not fully align inside a given region, the numbers are generally not integers but can be rounded if needed.
## Dimensions of the coverage matrix
dim(regionMat$chr21$coverageMatrix)
## [1] 45 12
## Coverage for each region. This matrix can then be used with limma or other pkgs
head(regionMat$chr21$coverageMatrix)
## HSB113 HSB123 HSB126 HSB130 HSB135 HSB136
## 1 3653.1093346 277.072106 1397.068687 1106.722895 8987.460401 5570.221054
## 2 333.3740816 99.987237 463.909476 267.354342 1198.713552 1162.313418
## 3 35.3828948 20.153553 30.725394 23.483947 16.786842 17.168947
## 4 42.3398681 29.931579 41.094474 24.724736 32.634080 19.309606
## 5 77.7402631 168.939342 115.059342 171.861974 180.638684 93.503158
## 6 0.7988158 1.770263 1.473421 2.231053 1.697368 1.007895
## HSB145 HSB153 HSB159 HSB178 HSB92 HSB97
## 1 1330.158818 1461.2986829 297.939342 1407.288552 1168.519079 1325.9622371
## 2 257.114210 313.8513139 67.940131 193.695657 127.543553 200.7834228
## 3 22.895921 52.8756585 28.145395 33.127368 23.758816 20.4623685
## 4 33.802632 51.6146040 31.244343 33.576974 29.546183 28.2011836
## 5 90.950526 36.3046051 78.069605 97.151316 100.085790 35.5428946
## 6 1.171316 0.4221053 1.000132 1.139079 1.136447 0.3956579
We can then use the coverage matrix and packages such as limma, DESeq2 or edgeR to identify which ERs are differentially expressed. Here we’ll use DESeq2 for which we need some phenotype data.
## Get pheno table
pheno <- subset(brainspanPheno, structure_acronym == 'AMY')
Now we can identify the DERs using a rounded version of the coverage matrix.
## Identify which ERs are differentially expressed, that is, find the DERs
library('DESeq2')
## Loading required package: SummarizedExperiment
## Loading required package: Biobase
## Welcome to Bioconductor
##
## Vignettes contain introductory material; view with
## 'browseVignettes()'. To cite Bioconductor, see
## 'citation("Biobase")', and for packages 'citation("pkgname")'.
## Round matrix
counts <- round(regionMat$chr21$coverageMatrix)
## Round matrix and specify design
dse <- DESeqDataSetFromMatrix(counts, pheno, ~ group + gender)
## converting counts to integer mode
## Perform DE analysis
dse <- DESeq(dse, test = 'LRT', reduced = ~ gender, fitType = 'local')
## estimating size factors
## estimating dispersions
## gene-wise dispersion estimates
## mean-dispersion relationship
## final dispersion estimates
## fitting model and testing
## Extract results
mcols(regionMat$chr21$regions) <- c(mcols(regionMat$chr21$regions), results(dse))
## Save info in an object with a shorter name
ers <- regionMat$chr21$regions
ers
## GRanges object with 45 ranges and 12 metadata columns:
## seqnames ranges strand | value
## <Rle> <IRanges> <Rle> | <numeric>
## 1 chr21 [ 9827018, 9827582] * | 313.671741938907
## 2 chr21 [15457301, 15457438] * | 215.084607424943
## 3 chr21 [20230140, 20230192] * | 38.8325471608144
## 4 chr21 [20230445, 20230505] * | 41.3245082031834
## 5 chr21 [27253318, 27253543] * | 34.9131305372469
## .. ... ... ... . ...
## 41 chr21 [33039644, 33039688] * | 34.4705371008979
## 42 chr21 [33040784, 33040798] * | 32.134222178989
## 43 chr21 [33040890, 33040890] * | 30.0925002098083
## 44 chr21 [33040900, 33040901] * | 30.1208333174388
## 45 chr21 [48019401, 48019414] * | 31.1489284609755
## area indexStart indexEnd cluster clusterL
## <numeric> <integer> <integer> <Rle> <Rle>
## 1 177224.534195483 1 565 1 565
## 2 29681.6758246422 566 703 2 138
## 3 2058.12499952316 704 756 3 366
## 4 2520.79500039419 757 817 3 366
## 5 7890.3675014178 818 1043 4 765
## .. ... ... ... ... ...
## 41 1551.17416954041 2180 2224 17 45
## 42 482.013332684835 2225 2239 18 118
## 43 30.0925002098083 2240 2240 18 118
## 44 60.2416666348775 2241 2242 18 118
## 45 436.084998453657 2243 2256 19 14
## baseMean log2FoldChange lfcSE
## <numeric> <numeric> <numeric>
## 1 2846.28716155756 -1.69031816472548 0.83195870366948
## 2 451.519643002767 -1.16404264400518 0.757489962635838
## 3 29.5780895864663 0.0461487746156367 0.458096633387309
## 4 36.0602688380417 -0.186620018986545 0.390920460570197
## 5 101.646763062388 -0.138737745290958 0.320166382651629
## .. ... ... ...
## 41 20.7820346002786 -0.642055952701049 0.4276611764306
## 42 6.41054227728292 -0.634320864108628 0.51226189777078
## 43 0.129716561731455 -0.859548956942508 3.11653965011299
## 44 0.702291488002413 -0.628284603274914 2.24737800886765
## 45 5.29329263119445 -1.69456340588639 1.25228952015329
## stat pvalue padj
## <numeric> <numeric> <numeric>
## 1 0.215261662919431 0.642674256598179 0.997155030203294
## 2 0.87112644422632 0.350643649949685 0.997155030203294
## 3 3.13208182520646 0.0767656530295047 0.863613596581927
## 4 2.22570758632982 0.135730461463207 0.997155030203294
## 5 3.95798720528117 0.0466494518014198 0.862039631046612
## .. ... ... ...
## 41 0.604781356173675 0.43675951534617 0.997155030203294
## 42 0.545403917207146 0.460201756145951 0.997155030203294
## 43 0.0206273206873746 0.885798852352319 0.997155030203294
## 44 0.582510548110136 0.445329945344265 0.997155030203294
## 45 5.78959103593262 0.0161213391338242 0.725460261022089
## -------
## seqinfo: 1 sequence from an unspecified genome
We can then compare the DERs against known annotation to see which DERs overlap known exons, introns, or intergenic regions. A way to visualize this information is via a Venn diagram which we can create using vennRegions()
from the derfinderPlot package.
## Find overlaps between regions and summarized genomic annotation
annoRegs <- annotateRegions(ers, genomicState$fullGenome, verbose = FALSE)
library('derfinderPlot')
## Warning: replacing previous import 'ggplot2::Position' by
## 'BiocGenerics::Position' when loading 'ggbio'
venn <- vennRegions(annoRegs, counts.col = 'blue', main = 'Venn diagram using TxDb.Hsapiens.UCSC.hg19.knownGene annotation')
We can also identify the nearest annotated feature. In this case, we’ll look for the nearest known gene from the UCSC hg19 annotation.
## Load database of interest
library('TxDb.Hsapiens.UCSC.hg19.knownGene')
## Loading required package: GenomicFeatures
## Loading required package: AnnotationDbi
txdb <- keepSeqlevels(TxDb.Hsapiens.UCSC.hg19.knownGene, 'chr21')
## Find nearest feature
library('bumphunter')
## Loading required package: foreach
## Loading required package: iterators
## Loading required package: locfit
## locfit 1.5-9.1 2013-03-22
genes <- annotateTranscripts(txdb)
## Warning: Calling species() on a TxDb object is *deprecated*.
## Please use organism() instead.
## No annotationPackage supplied. Trying org.Hs.eg.db.
## Loading required package: org.Hs.eg.db
##
## Getting TSS and TSE.
## Getting CSS and CSE.
## Getting exons.
## Annotating genes.
annotation <- matchGenes(ers, subject = genes)
## View annotation results
head(annotation)
## name
## 1 MIR3687-1
## 2 LIPI
## 3 TMPRSS15
## 4 TMPRSS15
## 5 APP
## 6 APP
## annotation
## 1 NR_037458
## 2 NM_001302998 NM_001302999 NM_001303000 NM_001303001 NM_145317 NM_198996 NP_001289927 NP_001289928 NP_001289929 NP_001289930 NP_945347 XM_006723965 XM_006723966 XP_006724028 XP_006724029
## 3 NM_002772 NP_002763 XM_011529654 XM_011529655 XM_011529656 XM_011529657 XM_011529658 XM_011529659 XP_011527956 XP_011527957 XP_011527958 XP_011527959 XP_011527960 XP_011527961
## 4 NM_002772 NP_002763 XM_011529654 XM_011529655 XM_011529656 XM_011529657 XM_011529658 XM_011529659 XP_011527956 XP_011527957 XP_011527958 XP_011527959 XP_011527960 XP_011527961
## 5 NM_000484 NM_001136016 NM_001136129 NM_001136130 NM_001136131 NM_001204301 NM_001204302 NM_001204303 NM_201413 NM_201414 NP_000475 NP_001129488 NP_001129601 NP_001129602 NP_001129603 NP_001191230 NP_001191231 NP_001191232 NP_958816 NP_958817
## 6 NM_000484 NM_001136016 NM_001136129 NM_001136130 NM_001136131 NM_001204301 NM_001204302 NM_001204303 NM_201413 NM_201414 NP_000475 NP_001129488 NP_001129601 NP_001129602 NP_001129603 NP_001191230 NP_001191231 NP_001191232 NP_958816 NP_958817
## description region distance subregion insideDistance exonnumber
## 1 close to 3' close to 3' 815 <NA> NA NA
## 2 downstream downstream 121816 <NA> NA NA
## 3 upstream upstream 454170 <NA> NA NA
## 4 upstream upstream 454475 <NA> NA NA
## 5 inside exon inside 289903 inside exon 0 16
## 6 inside exon inside 289899 inside exon 0 16
## nexons UTR strand geneL codingL Entrez subjectHits
## 1 1 <NA> + 60 NA 100500815 15
## 2 10 <NA> - 98119 97930 149998 142
## 3 25 <NA> - 134537 133653 5651 579
## 4 25 <NA> - 134537 133653 5651 579
## 5 16 3'UTR - 290585 230434 351 354
## 6 16 3'UTR - 290585 230434 351 354
## You can use derfinderPlot::plotOverview() to visualize this information
We can check the base-level coverage information for some of our DERs. In this example we do so for the first 5 ERs.
## Extract the region coverage
regionCov <- regionMat$chr21$bpCoverage
plotRegionCoverage(regions = ers, regionCoverage = regionCov,
groupInfo = pheno$group, nearestAnnotation = annotation,
annotatedRegions = annoRegs, whichRegions = seq_len(5), txdb = txdb, scalefac = 1, ask = FALSE, verbose = FALSE)
You can then use the regionReport package to generate interactive HTML reports exploring the results.
If you are interested in using derfinder we recommend checking the derfinder users guide and good luck with your analyses!
R
is an open-source statistical environment which can be easily modified to enhance its functionality via packages. derfinder is a R
package available via the Bioconductor repository for packages. R
can be installed on any operating system from CRAN after which you can install derfinder by using the following commands in your R
session:
## try http:// if https:// URLs are not supported
source("https://bioconductor.org/biocLite.R")
biocLite("derfinder")
derfinder is based on many other packages and in particular in those that have implemented the infrastructure needed for dealing with RNA-seq data. That is, packages like Rsamtools, GenomicAlignments and rtracklayer that allow you to import the data. A derfinder user is not expected to deal with those packages directly but will need to be familiar with GenomicRanges to understand the results derfinder generates. It might also prove to be highly beneficial to check the BiocParallel package for performing parallel computations.
If you are asking yourself the question “Where do I start using Bioconductor?” you might be interested in this blog post.
As package developers, we try to explain clearly how to use our packages and in which order to use the functions. But R
and Bioconductor
have a steep learning curve so it is critical to learn where to ask for help. The blog post quoted above mentions some but we would like to highlight the Bioconductor support site as the main resource for getting help. Other alternatives are available such as creating GitHub issues and tweeting. However, please note that if you want to receive help you should adhere to the posting guidlines. It is particularly critical that you provide a small reproducible example and your session information so package developers can track down the source of the error.
We would like to highlight the derfinder user Jessica Hekman. She has used derfinder with non-human data, and in the process of doing so discovered some small bugs or sections of the documentation that were not clear.
We hope that derfinder will be useful for your research. Please use the following information to cite the package and the overall approach. Thank you!
## Citation info
citation('derfinder')
##
## Collado-Torres L, Nellore A, Frazee AC, Wilks C, Love MI, Irizarry
## RA, Leek JT and Jaffe AE (2016). "Flexible expressed region
## analysis for RNA-seq with derfinder." _bioRxiv_. doi:
## 10.1101/015370 (URL: http://doi.org/10.1101/015370), <URL:
## http://biorxiv.org/content/early/2016/05/07/015370>.
##
## Frazee AC, Sabunciyan S, Hansen KD, Irizarry RA and Leek JT
## (2014). "Differential expression analysis of RNA-seq data at
## single-base resolution." _Biostatistics_, *15 (3)*, pp. 413-426.
## doi: 10.1093/biostatistics/kxt053 (URL:
## http://doi.org/10.1093/biostatistics/kxt053), <URL:
## http://biostatistics.oxfordjournals.org/content/15/3/413.long>.
This package was made possible thanks to:
Code for creating the vignette
## Create the vignette
library('rmarkdown')
system.time(render('derfinder-quickstart.Rmd', 'BiocStyle::html_document'))
## Extract the R code
library('knitr')
knit('derfinder-quickstart.Rmd', tangle = TRUE)
## Clean up
file.remove('quickstartRef.bib')
## [1] TRUE
Date the vignette was generated.
## [1] "2016-05-15 21:54:23 PDT"
Wallclock time spent generating the vignette.
## Time difference of 51.536 secs
R
session information.
## Session info -----------------------------------------------------------------------------------------------------------
## setting value
## version R version 3.3.0 (2016-05-03)
## system x86_64, linux-gnu
## ui X11
## language en_US:
## collate C
## tz <NA>
## date 2016-05-15
## Packages ---------------------------------------------------------------------------------------------------------------
## package * version date source
## AnnotationDbi * 1.34.2 2016-05-16 Bioconductor
## AnnotationHub 2.4.2 2016-05-16 Bioconductor
## BSgenome 1.40.0 2016-05-16 Bioconductor
## Biobase * 2.32.0 2016-05-16 Bioconductor
## BiocGenerics * 0.18.0 2016-05-16 Bioconductor
## BiocInstaller 1.22.2 2016-05-16 Bioconductor
## BiocParallel 1.6.2 2016-05-16 Bioconductor
## BiocStyle * 2.0.2 2016-05-16 Bioconductor
## Biostrings 2.40.0 2016-05-16 Bioconductor
## DBI 0.4-1 2016-05-08 CRAN (R 3.3.0)
## DESeq2 * 1.12.2 2016-05-16 Bioconductor
## Formula 1.2-1 2015-04-07 CRAN (R 3.3.0)
## GGally 1.0.1 2016-01-14 CRAN (R 3.3.0)
## GenomeInfoDb * 1.8.2 2016-05-16 Bioconductor
## GenomicAlignments 1.8.0 2016-05-16 Bioconductor
## GenomicFeatures * 1.24.2 2016-05-16 Bioconductor
## GenomicFiles 1.8.0 2016-05-16 Bioconductor
## GenomicRanges * 1.24.0 2016-05-16 Bioconductor
## Hmisc 3.17-4 2016-05-02 CRAN (R 3.3.0)
## IRanges * 2.6.0 2016-05-16 Bioconductor
## Matrix 1.2-6 2016-05-02 CRAN (R 3.3.0)
## OrganismDbi 1.14.0 2016-05-16 Bioconductor
## R6 2.1.2 2016-01-26 CRAN (R 3.3.0)
## RBGL 1.48.0 2016-05-16 Bioconductor
## RColorBrewer 1.1-2 2014-12-07 CRAN (R 3.3.0)
## RCurl 1.95-4.8 2016-03-01 CRAN (R 3.3.0)
## RJSONIO 1.3-0 2014-07-28 CRAN (R 3.3.0)
## RSQLite 1.0.0 2014-10-25 CRAN (R 3.3.0)
## Rcpp 0.12.5 2016-05-14 CRAN (R 3.3.0)
## RefManageR 0.10.13 2016-04-04 CRAN (R 3.3.0)
## Rsamtools 1.24.0 2016-05-16 Bioconductor
## S4Vectors * 0.10.0 2016-05-16 Bioconductor
## SummarizedExperiment * 1.2.2 2016-05-16 Bioconductor
## TxDb.Hsapiens.UCSC.hg19.knownGene * 3.2.2 2016-05-04 Bioconductor
## VariantAnnotation 1.18.1 2016-05-16 Bioconductor
## XML 3.98-1.4 2016-03-01 CRAN (R 3.3.0)
## XVector 0.12.0 2016-05-16 Bioconductor
## acepack 1.3-3.3 2014-11-24 CRAN (R 3.3.0)
## annotate 1.50.0 2016-05-16 Bioconductor
## bibtex 0.4.0 2014-12-31 CRAN (R 3.3.0)
## biomaRt 2.28.0 2016-05-16 Bioconductor
## biovizBase 1.20.0 2016-05-16 Bioconductor
## bitops 1.0-6 2013-08-17 CRAN (R 3.3.0)
## bumphunter * 1.12.0 2016-05-16 Bioconductor
## chron 2.3-47 2015-06-24 CRAN (R 3.3.0)
## cluster 2.0.4 2016-04-18 CRAN (R 3.3.0)
## codetools 0.2-14 2015-07-15 CRAN (R 3.3.0)
## colorspace 1.2-6 2015-03-11 CRAN (R 3.3.0)
## data.table 1.9.6 2015-09-19 CRAN (R 3.3.0)
## derfinder * 1.6.4 2016-05-16 Bioconductor
## derfinderData * 0.106.0 2016-05-08 Bioconductor
## derfinderHelper 1.6.3 2016-05-16 Bioconductor
## derfinderPlot * 1.6.3 2016-05-16 Bioconductor
## devtools * 1.11.1 2016-04-21 CRAN (R 3.3.0)
## dichromat 2.0-0 2013-01-24 CRAN (R 3.3.0)
## digest 0.6.9 2016-01-08 CRAN (R 3.3.0)
## doRNG 1.6 2014-03-07 CRAN (R 3.3.0)
## ensembldb 1.4.2 2016-05-16 Bioconductor
## evaluate 0.9 2016-04-29 CRAN (R 3.3.0)
## foreach * 1.4.3 2015-10-13 CRAN (R 3.3.0)
## foreign 0.8-66 2015-08-19 CRAN (R 3.3.0)
## formatR 1.4 2016-05-09 CRAN (R 3.3.0)
## genefilter 1.54.2 2016-05-16 Bioconductor
## geneplotter 1.50.0 2016-05-16 Bioconductor
## ggbio 1.20.1 2016-05-16 Bioconductor
## ggplot2 2.1.0 2016-03-01 CRAN (R 3.3.0)
## graph 1.50.0 2016-05-16 Bioconductor
## gridExtra 2.2.1 2016-02-29 CRAN (R 3.3.0)
## gtable 0.2.0 2016-02-26 CRAN (R 3.3.0)
## htmltools 0.3.5 2016-03-21 CRAN (R 3.3.0)
## httpuv 1.3.3 2015-08-04 CRAN (R 3.3.0)
## httr 1.1.0 2016-01-28 CRAN (R 3.3.0)
## interactiveDisplayBase 1.10.3 2016-05-16 Bioconductor
## iterators * 1.0.8 2015-10-13 CRAN (R 3.3.0)
## knitcitations * 1.0.7 2015-10-28 CRAN (R 3.3.0)
## knitr 1.13 2016-05-09 CRAN (R 3.3.0)
## lattice 0.20-33 2015-07-14 CRAN (R 3.3.0)
## latticeExtra 0.6-28 2016-02-09 CRAN (R 3.3.0)
## limma 3.28.4 2016-05-16 Bioconductor
## locfit * 1.5-9.1 2013-04-20 CRAN (R 3.3.0)
## lubridate 1.5.6 2016-04-06 CRAN (R 3.3.0)
## magrittr 1.5 2014-11-22 CRAN (R 3.3.0)
## matrixStats 0.50.2 2016-04-24 CRAN (R 3.3.0)
## memoise 1.0.0 2016-01-29 CRAN (R 3.3.0)
## mime 0.4 2015-09-03 CRAN (R 3.3.0)
## munsell 0.4.3 2016-02-13 CRAN (R 3.3.0)
## nnet 7.3-12 2016-02-02 CRAN (R 3.3.0)
## org.Hs.eg.db * 3.3.0 2016-05-04 Bioconductor
## pkgmaker 0.22 2014-05-14 CRAN (R 3.3.0)
## plyr 1.8.3 2015-06-12 CRAN (R 3.3.0)
## qvalue 2.4.2 2016-05-16 Bioconductor
## registry 0.3 2015-07-08 CRAN (R 3.3.0)
## reshape 0.8.5 2014-04-23 CRAN (R 3.3.0)
## reshape2 1.4.1 2014-12-06 CRAN (R 3.3.0)
## rmarkdown 0.9.6 2016-05-01 CRAN (R 3.3.0)
## rngtools 1.2.4 2014-03-06 CRAN (R 3.3.0)
## rpart 4.1-10 2015-06-29 CRAN (R 3.3.0)
## rtracklayer 1.32.0 2016-05-16 Bioconductor
## scales 0.4.0 2016-02-26 CRAN (R 3.3.0)
## shiny 0.13.2 2016-03-28 CRAN (R 3.3.0)
## stringi 1.0-1 2015-10-22 CRAN (R 3.3.0)
## stringr 1.0.0 2015-04-30 CRAN (R 3.3.0)
## survival 2.39-4 2016-05-11 CRAN (R 3.3.0)
## withr 1.0.1.9000 2016-05-05 Github (jimhester/withr@bd42181)
## xtable 1.8-2 2016-02-05 CRAN (R 3.3.0)
## yaml 2.1.13 2014-06-12 CRAN (R 3.3.0)
## zlibbioc 1.18.0 2016-05-16 Bioconductor
This vignette was generated using BiocStyle (Oleś, Morgan, and Huber, 2016) with knitr (Xie, 2014) and rmarkdown (Allaire, Cheng, Xie, McPherson, et al., 2016) running behind the scenes.
Citations made with knitcitations (Boettiger, 2015).
[1] J. Allaire, J. Cheng, Y. Xie, J. McPherson, et al. rmarkdown: Dynamic Documents for R. R package version 0.9.6. 2016. URL: https://CRAN.R-project.org/package=rmarkdown.
[2] J. D. S. with contributions from Andrew J. Bass, A. Dabney and D. Robinson. qvalue: Q-value estimation for false discovery rate control. R package version 2.4.2. 2015. URL: http://github.com/jdstorey/qvalue.
[3] S. Arora, M. Morgan, M. Carlson and H. Pages. GenomeInfoDb: Utilities for manipulating chromosome and other ‘seqname’ identifiers. R package version 1.8.2. 2016.
[4] C. Boettiger. knitcitations: Citations for ‘Knitr’ Markdown Files. R package version 1.0.7. 2015. URL: https://CRAN.R-project.org/package=knitcitations.
[5] BrainSpan. “Atlas of the Developing Human Brain [Internet]. Funded by ARRA Awards 1RC2MH089921-01, 1RC2MH090047-01, and 1RC2MH089929-01.” 2011. URL: http://developinghumanbrain.org.
[6] M. Carlson and B. P. Maintainer. TxDb.Hsapiens.UCSC.hg19.knownGene: Annotation package for TxDb object(s). R package version 3.2.2. 2015.
[7] L. Collado-Torres, A. Jaffe and J. Leek. derfinderData: Processed BigWigs from BrainSpan for examples. R package version 0.106.0. 2015. URL: https://github.com/lcolladotor/derfinderData.
[8] A. C. Frazee, S. Sabunciyan, K. D. Hansen, R. A. Irizarry, et al. “Differential expression analysis of RNA-seq data at single-base resolution”. In: Biostatistics 15 (3) (2014), pp. 413-426. DOI: 10.1093/biostatistics/kxt053. URL: http://biostatistics.oxfordjournals.org/content/15/3/413.long.
[9] F. E. Harrell Jr, with contributions from Charles Dupont and many others. Hmisc: Harrell Miscellaneous. R package version 3.17-4. 2016. URL: https://CRAN.R-project.org/package=Hmisc.
[10] A. E. Jaffe, P. Murakami, H. Lee, J. T. Leek, et al. “Bump hunting to identify differentially methylated regions in epigenetic epidemiology studies”. In: International journal of epidemiology 41.1 (2012), pp. 200–209. DOI: 10.1093/ije/dyr238.
[11] A. E. Jaffe, P. Murakami, H. Lee, J. T. Leek, et al. “Bump hunting to identify differentially methylated regions in epigenetic epidemiology studies”. In: International Journal of Epidemiology (2012).
[12] M. Lawrence, R. Gentleman and V. Carey. “rtracklayer: an R package for interfacing with genome browsers”. In: Bioinformatics 25 (2009), pp. 1841-1842. DOI: 10.1093/bioinformatics/btp328. URL: http://bioinformatics.oxfordjournals.org/content/25/14/1841.abstract.
[13] M. Lawrence, W. Huber, H. Pagès, P. Aboyoun, et al. “Software for Computing and Annotating Genomic Ranges”. In: PLoS Computational Biology 9 (8 2013). DOI: 10.1371/journal.pcbi.1003118. URL: http://www.ploscompbiol.org/article/info%3Adoi%2F10.1371%2Fjournal.pcbi.1003118}.
[14] M. Morgan, V. Obenchain, M. Lang and R. Thompson. BiocParallel: Bioconductor facilities for parallel evaluation. R package version 1.6.2. 2016. URL: https://github.com/Bioconductor/BiocParallel.
[15] M. Morgan, H. Pagès, V. Obenchain and N. Hayden. Rsamtools: Binary alignment (BAM), FASTA, variant call (BCF), and tabix file import. R package version 1.24.0. 2016. URL: http://bioconductor.org/packages/release/bioc/html/Rsamtools.html.
[16] V. Obenchain, M. Love and M. Morgan. GenomicFiles: Distributed computing by file or by range. R package version 1.8.0. 2016.
[17] A. Oleś, M. Morgan and W. Huber. BiocStyle: Standard styles for vignettes and other Bioconductor documents. R package version 2.0.2. 2016. URL: https://github.com/Bioconductor/BiocStyle.
[18] H. Pages, M. Carlson, S. Falcon and N. Li. AnnotationDbi: Annotation Database Interface. R package version 1.34.2. 2014.
[19] H. Pagès, M. Lawrence and P. Aboyoun. S4Vectors: S4 implementation of vectors and lists. R package version 0.10.0. 2016.
[20] R Core Team. R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing. Vienna, Austria, 2016. URL: https://www.R-project.org/.
[21] H. Wickham. ggplot2: Elegant Graphics for Data Analysis. Springer-Verlag New York, 2009. ISBN: 978-0-387-98140-6. URL: http://ggplot2.org.
[22] H. Wickham. “testthat: Get Started with Testing”. In: The R Journal 3 (2011), pp. 5–10. URL: http://journal.r-project.org/archive/2011-1/RJournal_2011-1_Wickham.pdf.
[23] H. Wickham and W. Chang. devtools: Tools to Make Developing R Packages Easier. R package version 1.11.1. 2016. URL: https://CRAN.R-project.org/package=devtools.
[24] Y. Xie. “knitr: A Comprehensive Tool for Reproducible Research in R”. In: Implementing Reproducible Computational Research. Ed. by V. Stodden, F. Leisch and R. D. Peng. ISBN 978-1466561595. Chapman and Hall/CRC, 2014. URL: http://www.crcpress.com/product/isbn/9781466561595.
[25] T. Yin, M. Lawrence and D. Cook. biovizBase: Basic graphic utilities for visualization of genomic data. R package version 1.20.0. 2016.