01 February, 2023

Contributing authors:

Data analysis, code, and maintenance of this notebook: Carmen Lia Murall, Raphaël Poujol, Susanne Kraemer, Arnaud N’Guessan, Sarah Otto, Art Poon, Jesse Shapiro, Fiona Brinkman, Justin Jia, Zohaib Anwar and Erin Gill. Input and direction by other members of Pillar 6 (https://covarrnet.ca/our-team/#pillar-6 ), which include: Caroline Colijn, Jorg Fritz, Morgan Langille, Paul Gordon, Julie Hussin, Jeff Joy, and William Hsiao.

Sequence collection, generation, release, and feedback on analyses: Canadian laboratories as part of the CPHLN and CanCOGeN are making these data publicly available and contribute feedback on analyses presented here. A complete list of lab authors is in this repository, and more details are below in the Acknowledgement section.

Introduction

This notebook is built to explore Canadian SARS-CoV-2 genomic and epidemiological data with the aim of investigating viral evolution and spread. It is for discussion with pillar 6’s team and for sharing with collaborators, e.g. public health labs. These analyses can spur further research within or across pillars, be used for reports (or data dashboards), support discussions with the science communication pillar for public dissemination, and enable code reuse by public health authorities/laboratories for their internal use.

Canadian genomic and epidemiological data will be regularly pulled from various public sources (see list below) to keep these analyses up-to-date. Only representations of aggregate data will be posted here.

Important caveats and disclaimers:
These analyses represent only a snapshot of SARS-CoV-2 evolution in Canada. Only some infections are detected by PCR testing, only some of those are sent for whole-genome sequencing, and not all sequences are posted to public facing reposittories. Sequencing volumes and priorities have changed during the pandemic, and the sequencing strategy is typically a combination of prioritizing outbreaks, travellers, public health investigations, and random sampling for genomic surveillance.

For example, specific variants or populations might be preferentially sequenced at certain times in certain jurisdictions. When possible, these differences in sampling strategies are mentioned but they are not always known. With the arrival of the Omicron wave, many jurisdictions across Canada reached testing and sequencing capacity mid-late December 2021 and thus switched to targeted testing of priority groups (e.g., hospitalized patients, health care workers, and people in high-risk settings). Therefore, from this time onward, case counts are likely underestimated and the sequenced virus diversity is not necessarily representative of the virus circulating in the overall population.

Thus, interpretation of these plots and comparisons between health regions should be made with caution, considering that the data may not be fully representative. These analyses are subject to frequent change given new data and updated lineage designations.

## 1. LOAD processed metadata of Canadian sequences (with latest pangolin, division, and full seq IDs)
#Download metadata from VirusSeq, put the date here:

# this can be made more compact for faster loading
meta <- read.csv(gzfile("data_needed/virusseq.metadata.csv.gz"), sep="\t")
meta$province <- meta$geo_loc_name_state_province_territory

# Select only the column we want to use later
columnlist=c("fasta_header_name", "province", "host_gender", "host_age_bin",
             "sample_collection_date", "sample_collected_by", 
             "purpose_of_sampling", "purpose_of_sequencing","lineage", 
             "raw_lineage")
meta <- meta[ , columnlist]


### metadata cleaning 
unknown.str <- c("Undeclared", "Not Provided", "Restricted Access", "Missing", 
                 "Not Applicable","","NA","unknow")
meta <- as.data.frame(apply(meta, 2, function(x) {
  x[is.element(x, unknown.str)] <- "Unknown"
  x
}))

meta$sample_collection_date <- as.Date(meta$sample_collection_date)
meta$week <- cut(meta$sample_collection_date, 'week')
meta$month <- gsub("-..$","",as.character(cut(meta$sample_collection_date, 'month')))

source("scripts/scanlineages.R")
meta$pango_group <- create.pango.group(VOCVOI, meta)
meta$pango_group <- as.factor(meta$pango_group)
## 2. LOAD epidemiological data (PHAC)

#from: https://health-infobase.canada.ca/covid-19/epidemiological-summary-covid-19-cases.html?stat=num&measure=total&map=pt#a2
epidataCANall <- read.csv(url("https://health-infobase.canada.ca/src/data/covidLive/covid19-download.csv"))
epidataCANall$date <- as.Date(epidataCANall$date)
epidataCANall$prname <- gsub('_', ' ', epidataCANall$prname)
epidate <- tail(epidataCANall,1)$date #download date

epidataCANall$previousvalue <- 0
#small loop to get the numtoday column from previous versions of this file from the cumulative cases
for(row in 1:nrow(epidataCANall)) {
  p <- epidataCANall[row, "prname"]
  subdf <- epidataCANall[which( 
    (epidataCANall$date > epidataCANall[row, "date"] & epidataCANall$prname==p) 
    ), ]
  if(nrow(subdf) != 0) {
    nextrow <- which( (epidataCANall$date == min(subdf$date) & epidataCANall$prname==p))
    epidataCANall[nextrow, "previousvalue"] <- epidataCANall[row, "totalcases"]
  }
}
epidataCANall$numtoday <- epidataCANall$totalcases - epidataCANall$previousvalue

SARS-CoV-2 in Canada

Current Situation

[Note some recent updates to the analyses, including the ability to view variants showing most growth by province.]

BQ.1.1 is peaking at this time in Canada, with XBB.1.5 still the primary variant of interest due to its mutations, plus growth significantly in multiple provinces with the most recent data. Some other BQ.1*, BF.7, CH.1.1, and select recombinants are growing, but not as significantly as XBB.1.5 right now. Differences between some provinces have become more significant, so province-specific analysis is becoming increasingly important. Mutational analysis shows continued steady growth in prevalence of the S:F486P mutation of note (associated with immune evasion and improved binding to the human ACE2 receptor). Cases with this mutation are now almost exclusively XBB.1.5.

Variants of current interest, due to their current/potential growth advantage, mutations of potential functional significance, or spread in other countries. Note that XBB.1.5 is still of primary note and some of these have only been observed internationally. See also the evolving plots below of “Fastest growing lineages in Canada”:

  • BA.2 and BA.2.75 new subvariants, including possible misclassifications of XBB.1.5 as BA.2.10.1.
  • BE.1.1 and other BA.5-type variants with S:F486P
  • BF.7 new subvariants not yet further classified
  • BM.1.1.1
  • BN.1* subvariants due to high predicted immune evasion
  • BQ.1* new descendants
  • BW.1, BW.1.1 (descendants of BA.5.6, with S:F486A)
  • CH.1.1 (descendant of BM which is BA.2.75.3) and some sublineages recently detected in Canada such as CH.1.1.2
  • CJ.1 with S:F486P (but not showing growth in Canada)
  • DS.1 (highest predicted immune evasion)
  • XAY.2 (BA.2/Delta recombinant with Delta’s S:L452R mutation plus the S:F486P mutation)
  • XAY.3 (BA.2/Delta that also has S:F486P - a first case detected in Canada but not now showing growth)
  • XBB.1, XBB.1.5, XBB.1.9.1, XBB.6.1 and subvariants with S:F486P
  • XBF (BA.5.2/CJ.1 recombinant; has S:F486P and immune evasion like XBB.1.5) - but growth not notable
  • XBK and XBL (have S:F486P and immune evasion like XBB.1.5)
  • XBM (first cases detected in Canada and the US; BA.2.76 / BF.3 recombinant, but growth not notable)
  • …and other variants with mutations associated with immune evasion or higher binding affinity to the human ACE2 receptor (examples: S:R346T and S:F486P), particularly those variants with sets of mutations of interest.

Omicron sublineages in Canada

Here we take a look, sub-dividing the major sub-lineages currently circulating in Canada.

source("scripts/subtype_plotter.R")
sublineagestoplot = data.frame(name=c("BA.1*","BA.2*","BA.2*",
                                      "BA.4*","BA.5.1*","BA.5.2*",
                                      "BQ*","BA.5*","B*"),
                               tabname=c("BA.1","early BA.2","late BA.2",
                                         "BA.4","BA.5.1","BA.5.2",
                                         "BQ","Other BA.5","Divergent lineages"),
                               mindate=c("","","2022-08-01",
                                         "","","",
                                         "","","2022-03-01"),
                               maxdate=c("2022-06-01","2022-08-01","",
                                         "","","",
                                         "","",""),
                               exclude_previously_plotted=c(FALSE,FALSE,FALSE,
                                                            FALSE,FALSE,FALSE,
                                                            FALSE,TRUE,TRUE))

    
plotted=c()
tmp=apply(sublineagestoplot,1,function(sub){
  cat("###", sub[["tabname"]], "\n")
  cat("####", sub[["tabname"]],"sublineages")
  mindate=NA
  maxdate=NA
  if(sub[["mindate"]]!=""){
    cat(" from",sub[["mindate"]])
    mindate=as.Date(sub[["mindate"]])
  }
  if(sub[["maxdate"]]!=""){
    cat(" until",sub[["maxdate"]])
    maxdate=as.Date(sub[["maxdate"]])
  }
  cat("\n\n")
  set=getStrictoSubLineages(sub[["name"]],meta)
  if(sub[["exclude_previously_plotted"]]){
    set=set[!set %in% plotted]
  }
  rarelineages_names=plot.subvariants(sublineage=set,mindate=mindate,maxdate=maxdate)
  rarelineages_names=plot.subvariants(sublineage=set,mindate=mindate,maxdate=maxdate,scale=T)
  plotted <<- unique(c(plotted,set))
  if(rarelineages_names!=""){
    cat("The grey zone represent rare lineages (not in top 15) : ",rarelineages_names)
  }
  cat("\n\n")})

BA.1

BA.1 sublineages until 2022-06-01

The grey zone represent rare lineages (not in top 15) : BA.1.1.3(1), BA.1.13.1(1), BA.1.15.3(1), BA.1.17.1(1), BA.1.5(1), BA.1.8(2), BD.1(2), BA.1.1.11(3), BA.1.1.8(3), BA.1.12(3), BA.1.7(3), BA.1.1.4(4), BA.1.1.9(4), BA.1.10(4), BA.1.14.2(4), BA.1.1.17(5), BA.1.14.1(5), BA.1.9(7), BA.1.16(10), BA.1.1.13(12), BA.1.1.7(15), BC.2(18), BA.1.15.2(27), BA.1.1.15(46), BA.1.21(55), BA.1.19(100), BA.1.1.1(164), BA.1.13(165), BA.1.15.1(176), BA.1.1.2(284)

early BA.2

early BA.2 sublineages until 2022-08-01

The grey zone represent rare lineages (not in top 15) : BA.2.10.3(1), BA.2.24(1), BA.2.25(1), BA.2.3.8(1), BA.2.34(1), BA.2.38.3(1), BA.2.58(1), BA.2.59(1), BA.2.63(1), BA.2.73(1), BA.2.75.3(1), BM.1(1), BM.4.1(1), BA.2.14(2), BA.2.15(2), BA.2.16(2), BA.2.2.1(2), BA.2.27(2), BA.2.3.5(2), BA.2.3.9(2), BA.2.30(2), BA.2.4(2), BA.2.42(2), BA.2.45(2), BA.2.71(2), BA.2.9.6(2), BH.1(2), BA.2.11(3), BA.2.12.2(3), BA.2.3.14(3), BA.2.38.2(3), BA.2.53(3), BA.2.54(3), BA.2.55(3), BA.2.68(3), BA.2.70(3), BA.2.9.4(3), BA.2.29(4), BA.2.3.12(4), BA.2.50(4), BA.2.62(4), BA.2.3.13(5), BA.2.3.15(5), BA.2.75.1(5), BA.2.78(5), BA.2.79(5), BA.2.41(6), BA.2.44(6), BA.2.47(6), BA.2.23.1(7), BA.2.3.16(7), BA.2.35(7), BA.2.64(7), BA.2.81(7), BG.6(7), BA.2.43(8), BA.2.60(8), BA.2.3.7(9), BA.2.61(9), BA.2.31.1(10), BA.2.38.1(10), BA.2.17(11), BA.2.22(11), BA.2.52(11), BA.2.72(11), BA.2.9.1(11), BA.2.26(12), BA.2.5(12), BA.2.6(12), BA.2.9.2(12), BA.2.49(13), BA.2.51(13), BA.2.8(14), BA.2.9.7(14), BG.4(14), BA.2.9.5(16), BA.2.40.1(17), BA.2.13.1(18), BA.2.32(21), BA.2.2(24), BA.2.7(25), BA.2.48(27), BA.2.74(29), BA.2.9.3(29), BG.5(29), BA.2.3.1(31), BA.2.31(36), BA.2.56(39), BA.2.13(47), BA.2.3.17(57), BA.2.75(59), BA.2.3.2(62), BA.2.36(66), BA.2.23(82), BA.2.76(92), BA.2.1(93), BA.2.10.1(100), BA.2.37(106), BG.2(112), BA.2.3.11(123)

late BA.2

late BA.2 sublineages from 2022-08-01

The grey zone represent rare lineages (not in top 15) : BA.2.13(1), BA.2.13.1(1), BA.2.21(1), BA.2.40.1(1), BA.2.47(1), BA.2.48(1), BA.2.75.10(1), BA.2.75.7(1), BA.2.75.9(1), BA.2.78(1), BA.2.82(1), BA.2.9(1), BA.2.9.7(1), BG.5(1), BL.1.3(1), BM.2(1), BM.4.1.1(1), CB.1(1), CH.1(1), CH.2(1), XBB.1.3(1), XBB.4(1), XBH(1), BA.2.1(2), BA.2.38(2), BA.2.38.2(2), BA.2.65(2), BN.2(2), CA.3(2), CJ.1(2), XBB.1.4(2), BA.2.75.6(3), BJ.1(3), BL.5(3), BR.1(3), CA.1(3), CM.3(3), BA.2.18(4), BA.2.3(4), BL.4(4), XBB.1.1(4), BA.2.2(5), BM.1(5), BN.1.7(5), CM.4(5), BH.1(6), BL.3(6), BM.1.1.1(6), BN.2.1(6), BP.1(6), BA.2.74(7), BA.2.75.4(7), BG.2(7), CH.1.1.1(7), BM.2.1(8), BN.1.9(8), BR.3(8), BY.1.2(8), BN.1.2.1(9), BN.5(9), BN.6(9), BN.3.1(10), BR.4(10), CM.9(10), CV.1(10), CH.1.1.2(11), BL.2(14), BN.1.4(15), XBB.3(16), BA.2.76(17), CA.5(18), CM.8.1(18), CA.7(19), BR.1.2(20), BM.1.1(21), BA.2.75.3(22), BR.2.1(23), BA.2.10.1(26), XBB(27), CM.1(28), BL.1(35), XBB.2(38), BN.1.2(40), BN.1.3.1(41), BN.1.5(42)

BA.4

BA.4 sublineages

The grey zone represent rare lineages (not in top 15) : BA.4.1.3(1), BA.4.8(1), BA.4.3(2), BA.4.1.10(3), BA.4.1.4(3), BA.4.1.9(3), BA.4.5(5), DC.1(5)

BA.5.1

BA.5.1 sublineages

The grey zone represent rare lineages (not in top 15) : BA.5.1.26(1), BT.2(1), BA.5.1.16(2), BA.5.1.9(3), DJ.1.1(4), BA.5.1.19(5), DL.1(5), BA.5.1.31(6), CL.1(6), BA.5.1.28(7), BA.5.1.21(8), BA.5.1.8(8), BT.1(8), DE.2(11), BA.5.1.4(14), BA.5.1.17(23), BA.5.1.27(51), BA.5.1.20(53), BA.5.1.18(71), DE.1(75), BA.5.1.30(85)

BA.5.2

BA.5.2 sublineages

The grey zone represent rare lineages (not in top 15) : BA.5.2.10(1), BA.5.2.38(1), BA.5.2.39(1), BF.17(1), BF.23(1), BF.24(1), BF.3.1(1), CD.1(1), CP.5(1), CT.1(1), DA.1(1), DG.1(1), BF.19(2), BF.29(2), BF.32(2), BF.7.10(2), BF.7.3(2), CR.2(2), BF.30(3), BF.7.1(3), BF.7.4.2(3), BU.3(3), CR.1.1(3), BF.7.12(4), BV.2(4), CR.1.2(4), BA.5.2.36(5), BF.18(5), BF.34(6), BF.11.4(7), BU.2(8), BA.5.2.12(9), BA.5.2.7(9), BF.31(9), BF.1.1(10), BA.5.2.44(11), CK.2.1(11), BF.7.8(12), CR.1(12), CK.2(13), CK.3(13), BU.1(14), CP.1(15), BA.5.2.37(16), CE.1(17), CN.1(19), BA.5.2.4(20), CK.2.1.1(20), BF.15(21), CG.1(21), BF.6(23), BF.20(24), BA.5.2.19(33), BA.5.2.32(36), BF.12(36), DB.1(39), BF.25(40), BA.5.2.18(42), BA.5.2.31(43), BA.5.2.16(44), BF.16(44), BA.5.2.23(59), BF.2(59), BA.5.2.33(62), BA.5.2.35(66), BA.5.2.24(71), CK.1(72), BA.5.2.26(77), BF.31.1(85), BA.5.2.14(86), BF.11.1(90), BF.3(94), BF.7.5(101), BF.7.6(102), BF.14(103), BF.7.7(116), BA.5.2.28(122), BF.7.4.1(122), BA.5.2.25(136), BA.5.2.27(144), BF.11.3(165), BF.13(170), BF.7.4(173), BA.5.2.6(178), BF.4(204), BF.8(220), BA.5.2.2(223), BA.5.2.34(227), BF.1(256), BF.21(262), BA.5.2.8(273), BF.11(306)

BQ

BQ sublineages

The grey zone represent rare lineages (not in top 15) : BQ.1.1.17(1), BQ.1.1.22(1), BQ.1.1.26(1), BQ.1.26(1), BQ.1.9(1), BQ.2(1), BQ.1.1.14(2), BQ.1.20(2), BQ.1.4(2), BQ.1.1.24(3), BQ.1.1.28(3), BQ.1.7(3), BQ.1.18(4), BQ.1.8.2(4), BQ.1.1.23(5), BQ.1.1.25(9), BQ.1.19(10), BQ.1.6(12), BQ.1.1.2(14), BQ.1.1.19(17), BQ.1.1.6(24), BQ.1.15(24), BQ.1.1.13(31), BQ.1.1.15(37), BQ.1.1.8(44), BQ.1.22(60), BQ.1.23(66), BQ.1.16(68), BQ.1.8(73), BE.1.1.1(103), BQ.1.11(108), BQ.1.10(119), BQ.1.1.18(136), BQ.1.1.7(141)

Other BA.5

Other BA.5 sublineages

The grey zone represent rare lineages (not in top 15) : BE.1.4.4(1), BE.4.2(1), CQ.1(1), BA.5.3.4(2), BA.5.7(2), BE.4.1.1(2), BE.6(2), CQ.2(3), CQ.1.1(4), XBE(4), BE.1.4.2(5), BA.5.6.4(6), BE.2(9), BA.5.5.3(11), BE.9(12), BA.5.3.3(16), BA.5.6.1(16), DF.1(17), BE.4.1(18), BE.1.4.1(20), BA.5.3(24), BA.5.6.2(24), CC.1(34), BA.5.5.1(35), BW.1(36), BA.5.3.2(38), BE.1.2(47), BA.5.10.1(59), BA.5.5.2(62), BW.1.1(67)

Divergent lineages

Divergent lineages sublineages from 2022-03-01

The grey zone represent rare lineages (not in top 15) : AY.103(1), AY.27(1), XBC.1(1), XW(1), AY.25.1(2), B.1(2), B.1.1(2), B.1.1.529(2), XAF(2), XAP(2), XQ(2), XAG(3), XAY(3), XN(3), XZ(3)

Fastest growing lineages in Canada

Here we show the selection estimates and their 95% confidence intervals for Pango lineages with more than 10 sequences in Canada since 2022-09-13 and with enough data to estimate the confidence interval. Each selection estimate measures the growth rate relative to BQ.1 stricto (i.e., sequences designated as BQ.1 and not its descendants, like BQ.1.1). Plots showing the change in variant frequency over time in Canada are given below for lineages with more than 50 sequences.

Plot (stricto)

Background shading illustrates sub-variants that we need to keep an eye on in blue (s ~ 0-5%, corresponding to doubling times of more than two weeks), those whose rapid growth is likely to displace currently common strains in pink (s ~ 5-10% with one to two week doubling times), and those with a growth advantage characteristic of a variant of concern in orange (s > 10%+ with less than one week doubling times).

Estimating selection of sub-variants with low sequence counts (10-100 dots) is prone to error, such as mistaking one-time super spreader events or pulses of sequence data from one regions as selection. These estimates should be considered as very preliminary.

source("scripts/plot_growing_lineages.R")
selectparam <- function(p,reg){
  if(!any(is.na(p$fit))){
    x=p$mut[[1]];
    p$region==reg&
    (p$fit)$fit[["s1"]]>0 & 
    sum(p$toplot$n2)>10&
    substr(x, nchar(x), nchar(x))!="*"
  }else{FALSE}
  }

tmp=apply(all.regions,1,function(reg){
  cat("####", reg[["shortname"]], "\n")
  cat("##### Plot single lineages in",reg[["name"]],"\n","\n")
  paramselected=allparams[sapply(allparams,function(x){selectparam(x,reg[["name"]])})]
  n=min(25,length(paramselected))
  if(n!=0){
    a=plot_growing_lineage(paramselected[1:n])
  }else{
    cat("![<img height=\"100\"/>](img/NotEnoughData.jpg)")
  }
  cat("\n\n")})

Canada

Plot single lineages in Canada

BC

Plot single lineages in British Columbia

AB

Plot single lineages in Alberta

SK

Plot single lineages in Saskatchawan

MA

Plot single lineages in Manitoba

ON

Plot single lineages in Ontario

QC

Plot single lineages in Quebec

NS

Plot single lineages in Nova Scotia

NB

Plot single lineages in New Brunswick

NL

Plot single lineages in Newfoundland and Labrador

Plot (non stricto)

This plot highlights the groups of related lineages that are growing fastest (e.g., BE.1.1.1* is the monophyletic clade that includes BE.1.1.1 and BQ.1*).

Again, background shading illustrates sub-variants that we need to keep an eye on in blue (s ~ 0-5%, corresponding to doubling times of more than two weeks), those whose rapid growth is likely to displace currently common strains in pink (s ~ 5-10% with one to two week doubling times), and those with a growth advantage characteristic of a variant of concern in orange (s > 10%+ with less than one week doubling times).

Estimating selection of sub-variants with low sequence counts (10-100 dots) is prone to error, such as mistaking one-time super spreader events or pulses of sequence data from one regions as selection. These estimates should be considered as very preliminary.

source("scripts/plot_growing_lineages.R")
selectparam <- function(p,reg){
  if(!any(is.na(p$fit))){
    x=p$mut[[1]];
    p$region==reg&
    (p$fit)$fit[["s1"]]>0 & 
    sum(p$toplot$n2)>10&
    substr(x, nchar(x), nchar(x))=="*"
  }else{FALSE}
  }

tmp=apply(all.regions,1,function(reg){
  cat("####", reg[["shortname"]], "\n")
  cat("##### Plot monophyletic groups of lineages in",reg[["name"]],"\n","\n")
  paramselected=allparams[sapply(allparams,function(x){selectparam(x,reg[["name"]])})]
  n=min(25,length(paramselected))
  if(n!=0){
    a=plot_growing_lineage(paramselected[1:n])
  }else{
    cat("![](img/NotEnoughData.jpg)")
  }
  cat("\n\n")})

Canada

Plot monophyletic groups of lineages in Canada

BC

Plot monophyletic groups of lineages in British Columbia

AB

Plot monophyletic groups of lineages in Alberta

SK

Plot monophyletic groups of lineages in Saskatchawan

MA

Plot monophyletic groups of lineages in Manitoba

ON

Plot monophyletic groups of lineages in Ontario

QC

Plot monophyletic groups of lineages in Quebec

NS

Plot monophyletic groups of lineages in Nova Scotia

NB

Plot monophyletic groups of lineages in New Brunswick

NL

Plot monophyletic groups of lineages in Newfoundland and Labrador

Table of all the selection estimates

paramselected=allparams[sapply(allparams,function(p){if(!any(is.na(p$fit))){
  x=p$mut[[1]];
  (p$fit)$fit[["s1"]]>0 & 
  sum(p$toplot$n2)>10
  }else{FALSE}})]
DT::datatable(plot_growing_lineage(paramselected,makeplot=FALSE))

Mutational composition of Omicron

Tabulation of the most predominant mutational changes in Omicron, with adjacent rows comparing the composition of Canadian sublineages to that sublineage globally.

Mutational profile of Omicron and its sublineages in Canada and globally for the most prevalent (>75%) point mutations in each category (based on the 444202 genomes available on VirusSeq on February 01, 2023).

Selection on Omicron

Here we examine the relative rate of spread of the different sublineages of Omicron currently in Canada. Specifically, we determine if a new or emerging lineage has a selective advantage, s (and by how much), against a reference lineage previously common in Canada (see the methods for more details about selection and how it is estimated).

Currently, the major group of Omicron lineages rising in frequency are the BQ.* group, which descend from BA.5*. We first show this growth of BQ.* and other BA.5* (here excluding BQ.*), relative to the remaining strains, which consist predominantly of BA.2* and BA.4*. Left plot: y-axis is the proportion of sub-lineages BA.4 and BA.5 among Omicron; right plot: y-axis describes the logit function, log(freq(BQ.* or BA.5*)/freq(other)), which gives a straight line whose slope is the selection coefficient if selection is constant over time (see methods).

For comparison, Alpha had a selective advantage of s ~ 6%-11% per day over preexisting SARS-CoV-2 lineages, and Delta had a selective advantage of about 10% per day over Alpha.

Caveat: These selection analyses must be interpreted with caution due to the potential for non-representative sampling, lags in reporting, and spatial heterogeneity in prevalence of different sublineages across Canada. Provinces that do not have at least 20 sequences of BQ.*, BA.5*, and other lineages during this time frame are not displayed.

source("scripts/plot_selection_estimator.R")

startpar3 <- list(p=c(0.8, 0.1), s=c(0.1, 0.1))

setAll=getAllStrictoLineages(meta)
sublineages_BA5 <- getStrictoSubLineages("BA.5*",meta)
sublineages_BQ <- getStrictoSubLineages("BQ*",meta)

setAll=setAll[!setAll %in% sublineages_BA5]
sublineages_BA5=sublineages_BA5[!sublineages_BA5 %in% sublineages_BQ]

#color for each lineage
col <- c(pal["Omicron BA.5"], pal["Omicron BQ"])

#Set a starting date
#Note that the startdate shouldn't be too much before both alleles become common
#or rare migration events that die off could throw off the estimation procedure 
#(so that the parameter estimates account for the presence of those alleles long in the past).
startdate<-max(meta$sample_collection_date)-days(120) #Using a later date with less sampling noise

sub.plot.selection.estimate <- function(region,maxdate=NA){
  maxdate=plot.selection.estimate(region=region,
                                        startdate=startdate, startpar=startpar3,
                                        reference=c(setAll),
                                        mutants=list(sublineages_BA5,sublineages_BQ),
                                        names=list("BA.5*","BQ*","the rest"),
                                        maxdate=maxdate, col=col)
  return(maxdate)
}

#####################################################
# tabs for displaying in notebook
#each PT tab should have curve plot and breakpoint plot side by side

Canada

Canada

BC

British Columbia

## [1] "Not enough data available"

AB

Alberta

SK

Saskatchewan

MB

Manitoba

ON

Ontario

QC

Quebec

East

East provinces

BA.2 sublineages selection

source("scripts/scanlineages.R")
plotsubvariant <- function(region,min,parentalnode){
  nplot=0
  for(p in allparams) {
    if(!any(is.na(p$fit)) && 
       p$region==region && 
       substr(p$mut[[1]], nchar(p$mut[[1]]), nchar(p$mut[[1]]))!="*" &&
       isSubLineage(parentalnode,p$mut[[1]])){
      if((p$fit)$fit[["s1"]]>0 && sum(p$toplot$n2)>min){
          plot.selection(plotparam=p)
          nplot=nplot+1
      }
    }
  }
  if(nplot==0){
    print("Not enough data available")
  }
}


plotba2 <- function(region,min){
  plotsubvariant(region,min,"BA.2*")
}



plotba5 <- function(region,min){
  plotsubvariant(region,min,"BA.5*")
}


plotothers <- function(region,min){
  nplot=0
  for(p in allparams) {
    if(!any(is.na(p$fit)) && 
       p$region==region && 
       substr(p$mut[[1]], nchar(p$mut[[1]]), nchar(p$mut[[1]]))!="*" &&
       !isSubLineage("BA.2*",p$mut[[1]])){
      if((p$fit)$fit[["s1"]]>0 && sum(p$toplot$n2)>min){
          plot.selection(plotparam=p)
          nplot=nplot+1
      }
    }
  }
  if(nplot==0){
    print("Not enough data available")
  }
}

Here we show the trends of the various BA.2.* sublineages over time, relative to the frequency of BQ.1 by itself (shown for sublineages with at least 50 (Canada) or 20 (provinces) cases). Proportions shown here are only among BQ.1 (stricto) and the lineage illustrated. Note that these plots are not necessarily representative of trends in each province and that mixing of data from different provinces may lead to shifts in frequency that are not due to selection.

tmp=apply(all.regions,1,function(reg){
  cat("####", reg[["shortname"]], "\n")
  n_min=20
  if(reg[["name"]]=="Canada"){n_min=50}
  cat("#####",reg[["name"]],"\n","\n")
  plotba2(reg[["name"]],n_min)
  cat("\n\n")})

Canada

Canada

BC

British Columbia

[1] “Not enough data available”

AB

Alberta

SK

Saskatchawan

[1] “Not enough data available”

MA

Manitoba

[1] “Not enough data available”

ON

Ontario

QC

Quebec

NS

Nova Scotia

[1] “Not enough data available”

NB

New Brunswick

[1] “Not enough data available”

NL

Newfoundland and Labrador

[1] “Not enough data available”

BA.5 sublineages selection

Here we show the trends of the various BA.5# sublineages over time, relative to the frequency of BQ.1 by itself (shown for sublineages with at least 50 (Canada) or 20 (provinces) cases). Proportions shown here are only among BQ.1 (stricto) and the lineage illustrated. Note that these plots are not necessarily representative of trends in each province and that mixing of data from different provinces may lead to shifts in frequency that are not due to selection.

tmp=apply(all.regions,1,function(reg){
  cat("####", reg[["shortname"]], "\n")
  n_min=20
  if(reg[["name"]]=="Canada"){n_min=50}
  cat("#####",reg[["name"]],"(n>",n_min,")\n","\n")
  plotba5(reg[["name"]],n_min)
  cat("\n\n")})

Canada

Canada (n> 50 )

BC

British Columbia (n> 20 )

[1] “Not enough data available”

AB

Alberta (n> 20 )

SK

Saskatchawan (n> 20 )

[1] “Not enough data available”

MA

Manitoba (n> 20 )

ON

Ontario (n> 20 )

QC

Quebec (n> 20 )

NS

Nova Scotia (n> 20 )

NB

New Brunswick (n> 20 )

NL

Newfoundland and Labrador (n> 20 )

[1] “Not enough data available”

Variants in Canada over time

These plots show the changing composition of sequences for all Canadian data posted to the VirusSeq Portal according to Pango lineage designation (Pango version 4.2 (Viral AI)), up to 2023-01-21. Because sampling and sequencing procedures vary by region and time, this does not necessarily reflect the true composition of SARS-CoV-2 viruses in Canada over time.

# focus on emergence of VoCs in Canada

meta1 <- meta[as.Date(meta$week) > Variants_Canada_over_time_Start_Date, ]
meta1$week <- as.factor(as.character(meta1$week))

dat <- lapply(unique(meta1$province), function(x) {
  as.data.frame.matrix(table(meta1$week[meta1$province==x],
                             meta1$pango_group[meta1$province==x]))
})
names(dat) <- unique(meta1$province)

dat[['Canada']] <- as.data.frame.matrix(table(meta1$week, meta1$pango_group))

# pass colour legend to JavaScript layer
dat[['legend']] <- as.list(VOCVOI$color)
names(dat[['legend']]) <- VOCVOI$name
dat[['legend']]$other <- 'grey'

r2d3(data=toJSON(dat), script="js/barplot.js", container="div",
     elementId="barplot-element")
rm(dat)
rm(meta1)

From the beginning of the pandemic to the fall of 2021, Canadian sequences were mostly of the wildtype lineages (pre-VOCs). By the beginning of summer 2021, the VOCs Alpha and Gamma were the most sequenced lineages overall in Canada. The Delta wave grew during the summer of 2021 with sublineages AY.25 and AY.27 constituting sizeable proportions of this wave. Omicron arrived in November of 2021 and spread in three main waves, first BA.1* (early 2022), then BA.2* (spring 2022), then BA.5* (summer 2022). Current, multiple sublineages of Omicron persist, with emerging sublineages spreading, such as BQ.1.1 (a BA.5 sub-lineage). See below for jurisdictional differences of these plots.

There are two Pango lineages that have a Canadian origin and that predominately spread within Canada (with some exportations internationally): B.1.438.1 and B.1.1.176. Other lineages of historical interest in Canada:

Canadian trees

Here we present a subsampled phylogenetic snapshot of SARS-CoV-2 genomes from Canada.

### metadata and trees
source("scripts/tree.r")

# load trees from files
mltree <- read.tree("./data_needed/aligned_allSeqs_sample1.rtt.nwk")
ttree <- read.tree("./data_needed/aligned_allSeqs_sample1.timetree.nwk")
#stopifnot(all(sort(mltree$tip.label) == sort(ttree$tip.label)))
dateseq <- seq(ymd('2019-12-01'), ymd('2022-12-01'), by='3 month')

# tips are labeled with [fasta name]_[lineage]_[coldate]
# extracting just the first part makes it easier to link to metadata
mltree$tip.label <- reduce.tipnames(mltree$tip.label)
ttree$tip.label <- reduce.tipnames(ttree$tip.label)


fieldnames<- c("fasta_header_name", "province", "host_gender", "host_age_bin",
               "sample_collected_by", "purpose_of_sampling",
               "lineage", "pango_group","month","week")

# extract rows from metadata table that correspond to ttree 
metasub1 <- meta[meta$fasta_header_name%in% ttree$tip.label, fieldnames]
# sort rows to match tip labels in tree
metasub1 <- metasub1[match(ttree$tip.label, metasub1$fasta_header_name), ]

#omi tree metadata
metasub_omi <- metasub1[grepl("Omicron",metasub1$pango_group ), ]

#scale to number of mutations
mltree$edge.length <- mltree$edge.length*29903
mltree <- ladderize(mltree, FALSE)

###Time Tree
ttree$edge.length[ttree$edge.length == 0] <- 1e-4
#ttree <- ladderize(ttree, FALSE)

hab=unique(meta$host_age_bin)
hab=hab[order(hab)]
months=unique(meta$month)
months=as.character(months[order(months)])
weeks=unique(meta$week)
weeks=as.character(weeks[order(weeks)])
presetColors=data.frame(name=c("other",
                               VOCVOI$name,
                               hab,
                               months,
                               weeks), 
                        color=c("#777777",
                                VOCVOI$color,
                                rev(hcl.colors(length(hab)-1, "Berlin")),"#777777",
                                hcl.colors(length(months), "Berlin"),
                                hcl.colors(length(weeks), "Berlin")
                                ))

#suppressWarnings({
#  res <- ace(metasub1$pango.group, ttree2, type="discrete", model="ER")
#})
#idx <- apply(res$lik.anc, 1, which.max)[2:nrow(res$lik.anc)]  # exclude root edge
#anc <- levels(as.factor(metasub1$pango.group))[idx]
timeTreeJsonObj <- DrawTree(ttree, metasub1, "timetree", presetColors, fieldnames)

#diversity ML tree
diversityTreeJsonObj <- DrawTree(mltree, metasub1, "mltree", presetColors, fieldnames)

### omicron diversity tree
MLtree_omi<-keep.tip(mltree, metasub_omi$fasta_header_name)

OmicrondiversityTreeJsonObj <- DrawTree(MLtree_omi, metasub_omi, "omimltree", presetColors, fieldnames)

Time Tree

Diversity Tree

Omicron diversity tree

Root-to-tip analyses

The slope of root-to-tip plots over time provide an estimate of the substitution rate. A lineage with a steeper positive slope than average for SARS-CoV-2 is accumulating mutations at a faster pace, while a lineage that exhibits a jump up (a shift in intercept but not slope) has accumulated more than expected numbers of mutations in a transient period of time (similar to what we saw with Alpha when it first appeared in the UK).

get.tipnames <- function(tip.label) {
sapply(tip.label, function(x) {
  tokens <- strsplit(x, "_")[[1]]
  ntok <- length(tokens)
  paste(tokens[1:(ntok-2)], collapse='_')
  })
}

source("scripts/fit-rtt.R")
fit1 <- fit.rtt("./data_needed/aligned_allSeqs_sample1.rtt.nwk", plot=TRUE)

fit2 <- fit.rtt("data_needed/aligned_allSeqs_sample2.rtt.nwk", plot=FALSE)
fit3 <- fit.rtt("data_needed/aligned_allSeqs_sample3.rtt.nwk", plot=FALSE)

Molecular clock estimates (based on three independent subsamples)

Here we show the estimate of the substitution rate for 3 independent subsamples of different variants of interest, with their 95% confidence interval.

if(!is.null(fit1)){
est1 <- get.ci(fit1); est1$rep <- 'Rep1'
est2 <- get.ci(fit2); est2$rep <- 'Rep2'
est3 <- get.ci(fit3); est3$rep <- 'Rep3'
sec.frame <- rbind(est1, est2, est3)
sec.frame$est[sec.frame$est < 0] <- 0
sec.frame$lower.95[sec.frame$lower.95 < 0] <- 0
sec.frame=sec.frame[sec.frame$Lineage != "Recombinants",]

pal <- VOCVOI$color
names(pal) <- VOCVOI$name
pal["other"] <- "white"

ggplot(sec.frame, aes(x=Lineage, y=est, group=rep)) + 
  geom_bar(stat="identity", color="black", aes(fill=Lineage), position='dodge') + 
  scale_fill_manual(values=pal) + 
  theme(axis.text.x = element_text(size=9, angle=45, hjust=1, vjust=0.95),
        legend.position='none', panel.grid.major=element_line(colour="grey90")) + 
  geom_errorbar(aes(ymin=lower.95, ymax=upper.95), width=.7,
                position=position_dodge(1)) +
  labs(y="Substitutions / Genome / Day",
       x="Lineage", fill="Subsample")
}

Future development

We are in the process of adding or would like to develop code for some of the following analyses:

  • dN/dS (by variant and by gene/domains)
  • Tajima’s D over time
  • clustering analyses
  • genomically inferred epidemiological parameters: R0, serial interval, etc.

With anonymized data on vaccination status, severity/outcome, reason for sequencing (e.g., outbreak, hospitalization, or general sampling), and setting (workplace, school, daycare, LTC, health institution, other), we could analyze genomic characteristics of the virus relative to the epidemiological and immunological conditions in which it is spreading and evolving. Studies on mutational correlations to superspreading events, vaccination status, or comparisons between variants would allow us to better understand transmission and evolution in these environments.

Methodology

Genome data and metadata are sourced from the Canadian VirusSeq Data Portal. Pango lineage assignments are generated using the pangoLEARN algorithm. Source code for generating this RMarkdown notebook can be found in [https://github.com/CoVaRR-NET/duotang].

Trees

Phylogenetic trees

Canadian genomes were obtained from the VirusSeq data on the February 01, 2023 and down-sampled to two genomes per lineage, province and month before October 2021, and five genomes per lineage, province and month after October 2021 (about 10,000 genomes in total). We used a Python wrapper of minimap2 (version 2.17) to generate multiple sequence alignments for these genome samples. A maximum likelihood (ML) tree was reconstructed from each alignment using the COVID-19 release of IQ-TREE (version 2.2.0). Outliers were identified in by root-to-tip regression using the R package ape and removed from the dataset. TreeTime was used to reconstruct a time-scaled tree under a strict molecular clock model. The resulting trees were converted into interactive plots with ggfree and r2d3.

Mutational composition

Mutation composition graph

We extracted mutation frequencies from unaligned genomes using a custom Python wrapper of minimap2 (version 2.17). These data were supplemented with genomic data and metadata from the NCBI GenNank database, curated by the Nextstrain development team. We used these outputs to generate mutational graphs reporting mutations seen in at least 75% of sequences in the respective variants of concern in Canada. Bars are colored by substitution type, and the corresponding amino acid changes are shown. Genomic position annotations were generated in Python using SnpEFF.

Selection

Selection Coefficents

To estimate selection, we used standard likelihood techniques. In brief, sublineages of interest were prespecified (e.g., BA.1, BA.1.1, BA.2) and counts by day tracked over time. If selection were constant over time, the frequency of sub-type \(i\) at time \(t\) would be expected to rise according to \[p_i(t) = \frac{p_i(0) \exp(s_i t)}{\sum_j p_j(0) \exp(s_j t)},\] where \(s_i\) is the selection coefficient favouring sub-type \(i\). A selection coefficient of \(s_i=0.1\) implies that sub-type \(i\) is expected to rise from 10% to 90% frequency in 44 days (in \(4.4./s_i\) days for other values of \(s_i\)).

At any given time \(t\), the probability of observing \(n_i\) sequences of sublineage \(i\) is multinomially distributed, given the total number of sequences from that day and the frequency of each \(p_i(t)\). Consequently, the likelihood of seeing the observed sequence data over all times \(t\) and over all sublineages \(j\) is proportional to \[L = \prod_t \prod_j p_i(t)^{n_i(t)}.\]

The BBMLE package in R was used to maximize the likelihood of the observed data (using the default optimization method, optim). For each selection coefficient, 95% confidence intervals were obtained by profile likelihood (using uniroot).

Graphs illustrating the rise in frequency of a variant over time are shown (left panels), with the area of each dot proportional to the number of sequences. 95% confidence bands were obtained by randomly drawing 10,000 sets of parameters (\(p_i\) and \(s_i\) for each sub-type) using RandomFromHessianOrMCMC, assuming a multi-normal distribution around the maximum likelihood point (estimated from the Hessian matrix, Pawitan 2001). At each point in time, the 2.5%-97.5% range of values for \(p_i(t)\) are then shown in the confidence bands.

Logit plots (right panels) show \[ln(\frac{p_i(t)}{p_{ref}(t)})\] relative to a given reference genotype (here BA.1), which gives a line whose slope is the strength of selection \(s_i\). Changes in slope indicate changes in selection on a variant (e.g., see Otto et al.).

These estimates of selection ignore heterogeneity within provinces and may be biased by the arrival of travel-related cases while frequencies are very low. Sampling strategies that oversample clustered cases (e.g., sequencing outbreaks) will introduce additional variation beyond the multinomial expectation, but these should lead to one-time shifts in frequency rather than trends over time. Provinces with sampling strategies that are variant specific are removed, unless explicit information about the variant frequencies is available.

Rates

Root-to-tip estimates of substitution rate

Maximum likelihood tree (IQ-TREE) processed with root-to-tip regression and plotting in R.

Data notes by province

All analyses draw on the most recent publicly available viral sequence data on ViralSeq and should be interpreted with caution due to lags in reporting and sequencing priorities that can differ across provinces or territories. Note that the NCCID provides a timeline of Canadian events related to each variant: https://nccid.ca/covid-19-variants/.

BC

British Columbia

Provincial sequencing strategy includes a subset of representative positive samples and prioritized cases (outbreaks, long-term care, travel-related, vaccine escape, hospitalized). Additional up-to-date covid data for this province can be found here:
http://www.bccdc.ca/health-info/diseases-conditions/covid-19/data-trends

AB

Alberta

Additional up-to-date COVID data for this province can be found here:
https://www.alberta.ca/stats/covid-19-alberta-statistics.htm#variants-of-concern

MB

Manitoba

Additional up-to-date COVID data for this province can be found here:
https://geoportal.gov.mb.ca/apps/manitoba-covid-19/explore

QC

Quebec

Provincial random sequencing has been temporarily suspended as of Feb 8th, 2021. Quebec provides a list of updates on changes to screening and sequencing strategies, found here (in French): https://www.inspq.qc.ca/covid-19/donnees/variants#methodologie. Additiona up-to-date COVID data for this province can be found here:
https://www.inspq.qc.ca/covid-19/donnees/variants

NS

Nova Scotia

Additional up-to-date COVID data for this province can be found here:
https://experience.arcgis.com/experience/204d6ed723244dfbb763ca3f913c5cad

NB

New Brunswick

Additional up-to-date COVID data for this province can be found here:
https://experience.arcgis.com/experience/8eeb9a2052d641c996dba5de8f25a8aa (NB dashboard)

NL

Newfoundland and Labrador

Additional up-to-date COVID data for this province can be found here:
https://covid-19-newfoundland-and-labrador-gnl.hub.arcgis.com/

List of useful tools

Collect a list of bioinformatics, phylogenetic, and modelling tools that are useful for SARS-CoV-2 analyses:

Acknowledgements and sources

We thank all the authors, developers, and contributors to the VirusSeq database for making their SARS-CoV-2 sequences publicly available. We especially thank the Canadian Public Health Laboratory Network, academic sequencing partners, diagnostic hospital labs, and other sequencing partners for the provision of the Canadian sequence data used in this work. Genome sequencing in Canada was supported by a Genome Canada grant to the Canadian COVID-19 Genomic Network (CanCOGeN).

We gratefully acknowledge all the Authors, the Originating laboratories responsible for obtaining the specimens, and the Submitting laboratories for generating the genetic sequence and metadata and sharing via the VirusSeq database, on which this research is based.

  • The Canadian VirusSeq Data Portal (https://virusseq-dataportal.ca) We wish to acknowledge the following organisations/laboratories for contributing data to the Portal: Canadian Public Health Laboratory Network (CPHLN), CanCOGGeN VirusSeq, Saskatchewan - Roy Romanow Provincial Laboratory (RRPL), Nova Scotia Health Authority, Alberta ProvLab North (APLN), Queen’s University / Kingston Health Sciences Centre, National Microbiology Laboratory (NML), Institut National de Sante Publique du Quebec (INSPQ), BCCDC Public Health Laboratory, Public Health Ontario (PHO), Newfoundland and Labrador - Eastern Health, Unity Health Toronto, Ontario Institute for Cancer Research (OICR), Provincial Public Health Laboratory Network of Nova Scotia, Centre Hospitalier Universitaire Georges L. Dumont - New Brunswick, and Manitoba Cadham Provincial Laboratory. Please see the complete list of laboratories included in this repository.

  • Public Health Agency of Canada (PHAC) / National Microbiology Laboratory (NML) - (https://health-infobase.canada.ca/covid-19/epidemiological-summary-covid-19-cases.html)

  • Various provincial public health websites (e.g. INSPQ https://www.inspq.qc.ca/covid-19/donnees/)

Session info

The version numbers of all packages in the current environment as well as information about the R install is reported below.

Hide

Show

sessionInfo()
## R version 4.1.3 (2022-03-10)
## Platform: x86_64-conda-linux-gnu (64-bit)
## Running under: Ubuntu 20.04 LTS
## 
## Matrix products: default
## BLAS/LAPACK: /home/jjjjia/miniconda3/envs/duotang/lib/libopenblasp-r0.3.21.so
## 
## locale:
##  [1] LC_CTYPE=C.UTF-8       LC_NUMERIC=C           LC_TIME=C.UTF-8       
##  [4] LC_COLLATE=C.UTF-8     LC_MONETARY=C.UTF-8    LC_MESSAGES=C.UTF-8   
##  [7] LC_PAPER=C.UTF-8       LC_NAME=C              LC_ADDRESS=C          
## [10] LC_TELEPHONE=C         LC_MEASUREMENT=C.UTF-8 LC_IDENTIFICATION=C   
## 
## attached base packages:
## [1] stats4    parallel  stats     graphics  grDevices utils     datasets 
## [8] methods   base     
## 
## other attached packages:
##  [1] HelpersMG_5.7   Matrix_1.5-1    coda_0.19-4     rlang_1.0.6    
##  [5] MASS_7.3-58.1   bbmle_1.0.25    reshape2_1.4.4  forcats_0.5.2  
##  [9] stringr_1.4.1   dplyr_1.0.10    purrr_0.3.5     readr_2.1.1    
## [13] tibble_3.1.8    tidyverse_1.3.2 jsonlite_1.8.3  r2d3_0.2.6     
## [17] ggfree_0.1.0    ape_5.6-2       ggplot2_3.3.6   lubridate_1.8.0
## [21] knitr_1.40      tidyr_1.2.1    
## 
## loaded via a namespace (and not attached):
##  [1] httr_1.4.4          sass_0.4.2          modelr_0.1.9       
##  [4] bslib_0.4.0         assertthat_0.2.1    highr_0.9          
##  [7] googlesheets4_1.0.1 cellranger_1.1.0    yaml_2.3.6         
## [10] numDeriv_2016.8-1.1 pillar_1.8.1        backports_1.4.1    
## [13] lattice_0.20-45     glue_1.6.2          digest_0.6.30      
## [16] rvest_1.0.3         colorspace_2.0-3    htmltools_0.5.3    
## [19] plyr_1.8.8          pkgconfig_2.0.3     broom_1.0.1        
## [22] haven_2.5.1         mvtnorm_1.1-3       scales_1.2.1       
## [25] tzdb_0.3.0          googledrive_2.0.0   farver_2.1.1       
## [28] generics_0.1.3      DT_0.18             ellipsis_0.3.2     
## [31] cachem_1.0.6        withr_2.5.0         cli_3.4.1          
## [34] magrittr_2.0.3      crayon_1.5.2        readxl_1.4.1       
## [37] evaluate_0.17       fs_1.5.2            fansi_1.0.3        
## [40] nlme_3.1-160        xml2_1.3.3          tools_4.1.3        
## [43] hms_1.1.2           gargle_1.2.1        lifecycle_1.0.3    
## [46] munsell_0.5.0       reprex_2.0.2        compiler_4.1.3     
## [49] jquerylib_0.1.4     grid_4.1.3          rstudioapi_0.14    
## [52] htmlwidgets_1.5.4   crosstalk_1.2.0     labeling_0.4.2     
## [55] rmarkdown_2.10      gtable_0.3.1        DBI_1.1.3          
## [58] R6_2.5.1            bdsmatrix_1.3-6     fastmap_1.1.0      
## [61] utf8_1.2.2          stringi_1.7.8       Rcpp_1.0.9         
## [64] vctrs_0.5.0         dbplyr_2.2.1        tidyselect_1.2.0   
## [67] xfun_0.34