plyinteractions provides a consistent interface for importing genomic
interactions from pairs and bedpe files into GInteractions objects
in R and for manipulating them using a tidy grammar.
plyranges
operates on genomic ranges (e.g. implemented as GRanges
objects in Bioconductor) and introduces a tidy grammar for manipulating
them. Genomic interactions (implemented as GInteractions
objects in Bioconductor) are more complex than genomic ranges in that each
observation (row) corresponds to a pair of two genomic ranges, each one
with its own metadata. plyinteractions extends plyranges syntax to
manipulate genomic interactions in R using dplyr verbs and tidy operations.
The grammar of tidy genomic data transformation defined in
plyranges
and available for GInteractions currently supports:
- 
dplyrverbs (forGInteractionsandGroupedGInteractions):- Group genomic interactions with 
group_by; - Summarize grouped genomic interactions with 
summarize; - Tally/count grouped genomic interactions with 
tallyandcount; - Modify genomic interactions with 
mutate; - Subset genomic interactions with 
filterusing<data-masking>and logical expressions; - Pick out any columns from the associated metadata with 
selectusing<tidy-select>arguments; - Subset using indices with 
slice; - Order genomic interactions with 
arrangeusing categorical/numerical variables. 
 - Group genomic interactions with 
 - 
plyrangesverbs (forPinnedGInteractionsandAnchoredPinnedGInteractions):- Stretch specific anchors of genomic interactions to a given width with 
stretch; anchor_*functions to control how stretching is performed;- Shift specific anchors of genomic interactions with 
shift; - Obtain flanking 
GRangesfrom specific anchors of genomic interactions withflank. 
 - Stretch specific anchors of genomic interactions to a given width with 
 
Note: In the genomic interaction field, the "anchor" term typically
refers to the two genomic loci brought together into an interaction. In
plyranges, the term anchor is used to specify which "part" of a genomic
locus is fixed (e.g. "5p", 3p", "center") and, incidently, which one can be modified
by plyranges verbs.
For more details on GInteractions "anchors" vs. plyranges anchoring
system, read
this section from our vignette.
- 
Overlapping operations (for
GInteractionsandPinnedGInteractions):find_overlapscount_overlapsfilter_by_overlapsandfilter_by_non_overlapsjoin_overlap_left
 
plyinteractions can be currently be installed from GitHub:
BiocManager::install("tidyomics/plyinteractions")Read vignette("plyinteractions") for more details.
Please note that this project is released with a Contributor Code of Conduct. By contributing to this project, you agree to abide by its terms.
plyinteractions package is heavily based on plyranges. It adapts a
number of functions and methods defined in that package, and would not have
been developed without the seminal work from Stuart Lee, Dianne Cook and Michael
Lawrence:
- Lee, Stuart, Dianne Cook, and Michael Lawrence. 2019. “Plyranges: A Grammar of Genomic Data Transformation.” Genome Biology 20 (1): 4. https://doi.org/10.1186/s13059-018-1597-8.
 
This package is largely inspired by the tidyverse:
- Wickham H, François R, Henry L, Müller K, Vaughan D (2023). dplyr: A Grammar of Data Manipulation. R package version 1.1.2, https://CRAN.R-project.org/package=dplyr.
 - Henry L, Wickham H (2023). rlang: Functions for Base Types and Core R and 'Tidyverse' Features. R package version 1.1.1, https://CRAN.R-project.org/package=rlang.
 
