Page last updated: 04 May, 2019

 

1 Intro

These pages go through the process of making different types of plots. We use simulated data that epidemiology researchers are likely to have access to, as well as larger data and genome-wide association data, and a correlation matrix researchers may come across during their work. You can download the .Rmd file from which this page is built by clicking the ‘Code’ tab at the top of the page.

 

First, we need some simulated data to use for the plots. Additionally we can spice up the plots by making our own colour choices and deciding what we want the plot style to be like.

 

2 Data

2.1 Basic data

Simulate some data that is likely to be available to an epidemiology researcher. This data has a simple structure of a top level (group) with subgroups (subgroup) and individual data (name).

# Data - create a dataframe of simulated data to use in our plots
name <- rep(LETTERS[1:10], length.out = 30)
group <- as.factor(rep(c("Red", "Green", "Black"), length.out = 30))
subgroup <- as.factor(rep(c(1:2), length.out = 30))
estimate = rnorm(n = 30, 0, 0.1)
se <- estimate/3
p = runif(n = 30, min = 0, max = 0.1)
data <- data.frame(name, group, subgroup, estimate, se, p)
data$group <- factor(data$group, levels = c("Black", "Red", "Green"))
dim(data)
[1] 30  6
head(data)

 

2.2 Larger data

For a larger and more comprehensive data set the ggplot2 package has an extended package (ggplot2movies) that contains a large data set of data from IMDB which we can use. The data is of thousands of films (title) and data on those films including a top level grouping (Genre) and year, length, budget, rating and number of votes.

##### Packages
library(ggplot2)
library(ggplot2movies)

##### Data
movies <- movies

##### Create a genre variable for the data set
genre <- rep(0,nrow(movies))
for(i in 18:24)
{
  genre[movies[,i]==1] <- names(movies)[i]
}; genre[genre==0] <- "Unknown"
movies$Genre <- as.factor(genre)

dim(movies)
[1] 58788    25
head(movies)

 

2.3 Genome wide data

For GWAS data the qqman package has a dummy GWAS data set we can use. The data is not as large as a full GWAS data set, but for testing plots it provides enough information

library(qqman)
gwas_data <- gwasResults
dim(gwas_data)
[1] 16470     4
head(gwas_data)

 

2.4 Correlation matrix

Simulate a matrix data set that is likely to be available to an epidemiology researcher.

matrix_data <- matrix(runif(100, min = 0, max = 1), ncol=10) 
matrix_data <- (matrix_data * lower.tri(matrix_data)) + t(matrix_data * lower.tri(matrix_data)) # If you want the upper and lower triangles of the matrix to be the same
diag(matrix_data) <- 1 

dim(matrix_data)
[1] 10 10
head(as.data.frame(matrix_data))

 

3 Theme

I use a custom ggplot theme for my plots because I don’t like the themes ggplot uses. The code for my theme is at the end of the document in the my_theme section. You can adapt this, start from scratch, edit specific aspects of the theme with the theme() functon, or use a preset theme from ggplot such as theme_bw().

 

In addition, I use a custom discrete and continuosu colour pallete using the wesanderson and yarrr packages.

##### Packages
library("wesanderson")
library("yarrr")
##### Palettes
d1 <- wes_palette("Royal1", type = "discrete")
d2 <- wes_palette("GrandBudapest2", type = "discrete")
d3 <- wes_palette("Cavalcanti1", type = "discrete")
discrete_wes_pal <- c(d1, d2, d3)
continuous_wes_pal <- wes_palette("Zissou1", 100, type = "continuous")

##### Palettes
d1

d2

d3

continuous_wes_pal

 

4 Hidden code

4.1 My theme

my_theme <- function () 
  { 
  ggplot2::theme(
    
    # high level arguments
    
    ## line - all line elements (element_line())
    line = element_line(colour = "black",
                        size = 0.5,
                        linetype = 1,
                        lineend = "butt",
                        arrow = FALSE),

    ## rectangular - all rectangular elements (element_rect())
    rect = element_rect(fill = NULL,
                        colour = NULL,
                        size = 0.5,
                        linetype = 1),
    
    ## text - all text elements (element_text())
    text = element_text(family = "Helvetica",
                        size = 11,
                        hjust = 0.5,
                        vjust = 0,
                        face = "bold",
                        color = "#222222"),
    
    ## title - all title elements: plot, axes, legends (element_text(); inherits from text)
    title = element_text(family = "Helvetica",
                         size = 11,
                         hjust = 0.5,
                         vjust = 0,
                         face = "bold",
                         color = "#222222"),
    ## margin
    #margin = margin(t = 5.5, r = 5.5, b = 5.5, l = 5.5, unit = "pt"),    
    
    
    # axis
    
    ## title - labels of axes (element_text()). Specify all axes' labels (axis.title), labels by plane (using axis.title.x or axis.title.y), or individually for each axis (using axis.title.x.bottom, axis.title.x.top, axis.title.y.left, axis.title.y.right). axis.title.*.* inherits from axis.title.* which inherits from axis.title, which in turn inherits from text
    axis.title = element_text(family = "Helvetica", size = 11, hjust = 0.5, vjust = 0.5, face = "bold", color = "#222222"), 
    axis.title.x = element_text(family = "Helvetica", size = 11, hjust = 0.5, vjust = 0.5, face = "bold", color = "#222222"),
    axis.title.y = element_text(family = "Helvetica", size = 11, hjust = 0.5, vjust = 0.5, face = "bold", color = "#222222"),
    axis.title.x.top = element_text(family = "Helvetica", size = 11, hjust = 0.5, vjust = 0.5, face = "bold", color = "#222222"),
    axis.title.x.bottom = element_text(family = "Helvetica", size = 11, hjust = 0.5, vjust = 0.5, face = "bold", color = "#222222"),
    axis.title.y.left = element_text(family = "Helvetica", size = 11, hjust = 0.5, vjust = 0.5, face = "bold", color = "#222222"),
    axis.title.y.right = element_text(family = "Helvetica", size = 11, hjust = 0.5, vjust = 0.5, face = "bold", color = "#222222"),
    
    ## text of tick labels - tick labels along axes (element_text()). Specify all axis tick labels (axis.text), tick labels by plane (using axis.text.x or axis.text.y), or individually for each axis (using axis.text.x.bottom, axis.text.x.top, axis.text.y.left, axis.text.y.right). axis.text.*.* inherits from axis.text.* which inherits from axis.text, which in turn inherits from text
    axis.text = element_text(family = "Helvetica", size = 11, hjust = 0.5, vjust = 0.5, face = "bold", color = "#222222"), 
    
    ## ticks - tick marks along axes (element_line()). Specify all tick marks (axis.ticks), ticks by plane (using axis.ticks.x or axis.ticks.y), or individually for each axis (using axis.ticks.x.bottom, axis.ticks.x.top, axis.ticks.y.left, axis.ticks.y.right). axis.ticks.*.* inherits from axis.ticks.* which inherits from axis.ticks, which in turn inherits from line
    axis.ticks = element_line(colour = "black", size = 0.5, linetype = 1, lineend = "butt", arrow = FALSE),
    
    ## tick length - length of tick marks (unit)
    axis.ticks.length = unit(2.75, "pt"),
    
    ## axis lines - lines along axes (element_line()). Specify lines along all axes (axis.line), lines for each plane (using axis.line.x or axis.line.y), or individually for each axis (using axis.line.x.bottom, axis.line.x.top, axis.line.y.left, axis.line.y.right). axis.line.*.* inherits from axis.line.* which inherits from axis.line, which in turn inherits from line
    axis.line = element_blank(),
   
    
    # legend

    ## background - background of legend (element_rect(); inherits from rect)
    legend.background = element_blank(),

    ## margin - the margin around each legend (margin())
    legend.margin = margin(t = 5.5, r = 5.5, b = 5.5, l = 5.5, unit = "pt"),

    ## spacing - the spacing between legends (unit). legend.spacing.x & legend.spacing.y inherit from legend.spacing or can be specified separately
    legend.spacing = unit(11, "pt"),
    legend.spacing.x = unit(11, "pt"),
    legend.spacing.y = unit(11, "pt"),
    
    ## key - background underneath legend keys (element_rect(); inherits from rect)
    legend.key = element_rect(fill = NULL, colour = NULL, size = 0.5, linetype = 1),

    ## key size - size of legend keys (unit); key background height & width inherit from legend.key.size or can be specified separately
    legend.key.size = unit(1.2, "pt"),
    legend.key.height = unit(1.2, "pt"),
    legend.key.width = unit(1.2, "pt"),
    
    ## text - legend item labels (element_text(); inherits from text)
    legend.text = element_text(family = "Helvetica", size = 11, hjust = 0.5, vjust = 0.5, face = "bold", color = "#222222"), 

    ## text alignment - alignment of legend labels (number from 0 (left) to 1 (right))
    legend.text.align = 0, 
    
    ## title - title of legend (element_text(); inherits from title)
    legend.title = element_text(family = "Helvetica", size = 11, hjust = 0.5, vjust = 0.5, face = "bold", color = "#222222"), 

    ## title alignment - alignment of legend title (number from 0 (left) to 1 (right))
    legend.title.align = 0.5,
    
    ## position - the position of legends ("none", "left", "right", "bottom", "top", or two-element numeric vector)
    legend.position = "bottom", #c(0.5,0)

    ## direction - layout of items in legends ("horizontal" or "vertical")
    legend.direction = "horizontal",
    
    ## justification - anchor point for positioning legend inside plot ("center" or two-element numeric vector) or the justification according to the plot area when positioned outside the plot
    legend.justification = "center", # c(0.5,0.5)

    ## multiple legends 
    ### arrnagement - arrangement of multiple legends ("horizontal" or "vertical")
    legend.box = "horizontal",
    
    ### justification - justification of each legend within the overall bounding box, when there are multiple legends ("top", "bottom", "left", or "right")
    legend.box.just = "top",

    ### margin - margins around the full legend area, as specified using margin()
    legend.box.margin = margin(t = 5.5, r = 5.5, b = 5.5, l = 5.5, unit = "pt"),
    
    ### background - background of legend area (element_rect(); inherits from rect)
    legend.box.background = element_blank(),
    
    ### spacing - The spacing between the plotting area and the legend box (unit)
    legend.box.spacing = unit(11, "pt"),
    
    
    # panel
    
    ## background - # background of plotting area, drawn underneath plot (element_rect(); inherits from rect)
    panel.background = element_blank(), 
    
    ## border - border around plotting area, drawn on top of plot so that it covers tick marks and grid lines. This should be used with fill = NA (element_rect(); inherits from rect)
    panel.border = element_blank(), 
    
    ## spacing - spacing between facet panels (unit). panel.spacing.x & panel.spacing.y inherit from panel.spacing or can be specified separately.
    panel.spacing = unit(5.5, "pt"),
    panel.spacing.x = unit(5.5, "pt"),
    panel.spacing.y = unit(5.5, "pt"),
    
    ## grid - grid lines (element_line()). Specify major grid lines, or minor grid lines separately (using panel.grid.major or panel.grid.minor) or individually for each axis (using panel.grid.major.x, panel.grid.minor.x, panel.grid.major.y, panel.grid.minor.y). Y axis grid lines are horizontal and x axis grid lines are vertical. panel.grid.*.* inherits from panel.grid.* which inherits from panel.grid, which in turn inherits from line
    panel.grid  = element_blank(),
    panel.grid.major = element_blank(),
    panel.grid.minor = element_blank(),
    panel.grid.major.x = element_blank(),
    panel.grid.minor.x = element_blank(),
    panel.grid.major.y = element_blank(),
    panel.grid.minor.y = element_blank(),
    
    # option to place the panel (background, gridlines) over the data layers (logical). Usually used with a transparent or blank panel.background.
    panel.ontop = FALSE,
    
    
    # plot
    
    ## background - background of the entire plot (element_rect(); inherits from rect)
    plot.background = element_blank(),

    ## title - plot title (text appearance) (element_text(); inherits from title) left-aligned by default
    plot.title = element_text(family = "Helvetica", size = 16, hjust = 0.5, vjust = 0, face = "bold", color = "#222222"), 
    
    ## subtitle - plot subtitle (text appearance) (element_text(); inherits from title) left-aligned by default
    plot.subtitle = element_text(family = "Helvetica", size = 14, hjust = 0.5, vjust = 0, face = "bold", color = "#222222"), 
    
    ## caption - caption below the plot (text appearance) (element_text(); inherits from title) right-aligned by default
    plot.caption = element_text(family = "Helvetica", size = 8, hjust = 0.5, vjust = 0, face = "bold", color = "#222222"),
    
    ## tag - upper-left label to identify a plot (text appearance) (element_text(); inherits from title) left-aligned by default
    plot.tag = element_text(family = "Helvetica", size = 8, hjust = 0.5, vjust = 0, face = "bold", color = "#222222"),

    ## tag position - The position of the tag as a string ("topleft", "top", "topright", "left", "right", "bottomleft", "bottom", "bottomright) or a coordinate. If a string, extra space will be added to accommodate the tag.
    plot.tag.position = "topleft",
    
    ## margin - margin around entire plot (unit with the sizes of the top, right, bottom, and left margins)
    plot.margin = margin(t = 5.5, r = 5.5, b = 5.5, l = 5.5, unit = "pt"),
   
    
    # strip (used when facetting a plot)

    ## background - background of facet labels (element_rect(); inherits from rect). Horizontal facet background (strip.background.x) & vertical facet background (strip.background.y) inherit from strip.background or can be specified separately
    strip.background = element_blank(),
    strip.background.x = element_blank(), 
    strip.background.y = element_blank(),

    # placement - placement of strip with respect to axes, either "inside" or "outside". Only important when axes and strips are on the same side of the plot.
    strip.placement = "inside",
    
    # text - facet labels (element_text(); inherits from text). Horizontal facet labels (strip.text.x) & vertical facet labels (strip.text.y) inherit from strip.text or can be specified separately
    strip.text = element_text(family = "Helvetica", size = 11, hjust = 0.5, vjust = 0.5, angle = 0, face = "bold", color = "#222222"),
    strip.text.x = element_text(family = "Helvetica", size = 11, hjust = 0.5, vjust = 0.5, angle = 0, face = "bold", color = "#222222"),
    strip.text.y = element_text(family = "Helvetica", size = 11, hjust = 0.5, vjust = 0.5, angle = 0, face = "bold", color = "#222222"),
    
    # padding - space between strips and axes when strips are switched (unit)
    strip.switch.pad.grid = unit(2.75, "pt"),
    
    # padding - space between strips and axes when strips are switched (unit)
    strip.switch.pad.wrap = unit(2.75, "pt")
  )
}

4.2 Session info

sessionInfo()
## R version 3.5.3 (2019-03-11)
## Platform: x86_64-apple-darwin15.6.0 (64-bit)
## Running under: macOS Mojave 10.14.4
## 
## Matrix products: default
## BLAS: /Library/Frameworks/R.framework/Versions/3.5/Resources/lib/libRblas.0.dylib
## LAPACK: /Library/Frameworks/R.framework/Versions/3.5/Resources/lib/libRlapack.dylib
## 
## locale:
## [1] en_GB.UTF-8/en_GB.UTF-8/en_GB.UTF-8/C/en_GB.UTF-8/en_GB.UTF-8
## 
## attached base packages:
## [1] stats     graphics  grDevices utils     datasets  methods   base     
## 
## loaded via a namespace (and not attached):
##  [1] compiler_3.5.3  magrittr_1.5    tools_3.5.3     htmltools_0.3.6
##  [5] base64enc_0.1-3 yaml_2.2.0      Rcpp_1.0.1      stringi_1.4.3  
##  [9] rmarkdown_1.12  knitr_1.22      stringr_1.4.0   xfun_0.6       
## [13] digest_0.6.18   evaluate_0.13
