mirror of https://github.com/msberends/AMR
You can not select more than 25 topics
Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
750 lines
32 KiB
750 lines
32 KiB
# ==================================================================== # |
|
# TITLE # |
|
# Antimicrobial Resistance (AMR) Data Analysis for R # |
|
# # |
|
# SOURCE # |
|
# https://github.com/msberends/AMR # |
|
# # |
|
# LICENCE # |
|
# (c) 2018-2021 Berends MS, Luz CF et al. # |
|
# Developed at the University of Groningen, the Netherlands, in # |
|
# collaboration with non-profit organisations Certe Medical # |
|
# Diagnostics & Advice, and University Medical Center Groningen. # |
|
# # |
|
# This R package is free software; you can freely use and distribute # |
|
# it for both personal and commercial purposes under the terms of the # |
|
# GNU General Public License version 2.0 (GNU GPL-2), as published by # |
|
# the Free Software Foundation. # |
|
# We created this package for both routine data analysis and academic # |
|
# research and it was publicly released in the hope that it will be # |
|
# useful, but it comes WITHOUT ANY WARRANTY OR LIABILITY. # |
|
# # |
|
# Visit our website for the full manual and a complete tutorial about # |
|
# how to conduct AMR data analysis: https://msberends.github.io/AMR/ # |
|
# ==================================================================== # |
|
|
|
#' Get Properties of a Microorganism |
|
#' |
|
#' Use these functions to return a specific property of a microorganism based on the latest accepted taxonomy. All input values will be evaluated internally with [as.mo()], which makes it possible to use microbial abbreviations, codes and names as input. See *Examples*. |
|
#' @inheritSection lifecycle Stable Lifecycle |
|
#' @param x any [character] (vector) that can be coerced to a valid microorganism code with [as.mo()]. Can be left blank for auto-guessing the column containing microorganism codes if used in a data set, see *Examples*. |
|
#' @param property one of the column names of the [microorganisms] data set: `r vector_or(colnames(microorganisms), sort = FALSE, quotes = TRUE)`, or must be `"shortname"` |
|
#' @param language language of the returned text, defaults to system language (see [get_locale()]) and can be overwritten by setting the option `AMR_locale`, e.g. `options(AMR_locale = "de")`, see [translate]. Also used to translate text like "no growth". Use `language = NULL` or `language = ""` to prevent translation. |
|
#' @param ... other arguments passed on to [as.mo()], such as 'allow_uncertain' and 'ignore_pattern' |
|
#' @param ab any (vector of) text that can be coerced to a valid antibiotic code with [as.ab()] |
|
#' @param open browse the URL using [`browseURL()`][utils::browseURL()] |
|
#' @details All functions will return the most recently known taxonomic property according to the Catalogue of Life, except for [mo_ref()], [mo_authors()] and [mo_year()]. Please refer to this example, knowing that *Escherichia blattae* was renamed to *Shimwellia blattae* in 2010: |
|
#' - `mo_name("Escherichia blattae")` will return `"Shimwellia blattae"` (with a message about the renaming) |
|
#' - `mo_ref("Escherichia blattae")` will return `"Burgess et al., 1973"` (with a message about the renaming) |
|
#' - `mo_ref("Shimwellia blattae")` will return `"Priest et al., 2010"` (without a message) |
|
#' |
|
#' The short name - [mo_shortname()] - almost always returns the first character of the genus and the full species, like `"E. coli"`. Exceptions are abbreviations of staphylococci (such as *"CoNS"*, Coagulase-Negative Staphylococci) and beta-haemolytic streptococci (such as *"GBS"*, Group B Streptococci). Please bear in mind that e.g. *E. coli* could mean *Escherichia coli* (kingdom of Bacteria) as well as *Entamoeba coli* (kingdom of Protozoa). Returning to the full name will be done using [as.mo()] internally, giving priority to bacteria and human pathogens, i.e. `"E. coli"` will be considered *Escherichia coli*. In other words, `mo_fullname(mo_shortname("Entamoeba coli"))` returns `"Escherichia coli"`. |
|
#' |
|
#' Since the top-level of the taxonomy is sometimes referred to as 'kingdom' and sometimes as 'domain', the functions [mo_kingdom()] and [mo_domain()] return the exact same results. |
|
#' |
|
#' The Gram stain - [mo_gramstain()] - will be determined based on the taxonomic kingdom and phylum. According to Cavalier-Smith (2002, [PMID 11837318](https://pubmed.ncbi.nlm.nih.gov/11837318)), who defined subkingdoms Negibacteria and Posibacteria, only these phyla are Posibacteria: Actinobacteria, Chloroflexi, Firmicutes and Tenericutes. These bacteria are considered Gram-positive, except for members of the class Negativicutes which are Gram-negative. Members of other bacterial phyla are all considered Gram-negative. Species outside the kingdom of Bacteria will return a value `NA`. Functions [mo_is_gram_negative()] and [mo_is_gram_positive()] always return `TRUE` or `FALSE` (except when the input is `NA` or the MO code is `UNKNOWN`), thus always return `FALSE` for species outside the taxonomic kingdom of Bacteria. |
|
#' |
|
#' Determination of yeasts - [mo_is_yeast()] - will be based on the taxonomic kingdom and class. *Budding yeasts* are fungi of the phylum Ascomycetes, class Saccharomycetes (also called Hemiascomycetes). *True yeasts* are aggregated into the underlying order Saccharomycetales. Thus, for all microorganisms that are fungi and member of the taxonomic class Saccharomycetes, the function will return `TRUE`. It returns `FALSE` otherwise (except when the input is `NA` or the MO code is `UNKNOWN`). |
|
#' |
|
#' Intrinsic resistance - [mo_is_intrinsic_resistant()] - will be determined based on the [intrinsic_resistant] data set, which is based on `r format_eucast_version_nr(3.2)`. The [mo_is_intrinsic_resistant()] functions can be vectorised over arguments `x` (input for microorganisms) and over `ab` (input for antibiotics). |
|
#' |
|
#' All output [will be translated][translate] where possible. |
|
#' |
|
#' The function [mo_url()] will return the direct URL to the online database entry, which also shows the scientific reference of the concerned species. |
|
#' |
|
#' SNOMED codes - [mo_snomed()] - are from the `r SNOMED_VERSION$current_source`. See *Source* and the [microorganisms] data set for more info. |
|
#' @inheritSection mo_matching_score Matching Score for Microorganisms |
|
#' @inheritSection catalogue_of_life Catalogue of Life |
|
#' @inheritSection as.mo Source |
|
#' @rdname mo_property |
|
#' @name mo_property |
|
#' @return |
|
#' - An [integer] in case of [mo_year()] |
|
#' - A [list] in case of [mo_taxonomy()] and [mo_info()] |
|
#' - A named [character] in case of [mo_url()] |
|
#' - A [numeric] in case of [mo_snomed()] |
|
#' - A [character] in all other cases |
|
#' @export |
|
#' @seealso Data set [microorganisms] |
|
#' @inheritSection AMR Reference Data Publicly Available |
|
#' @inheritSection AMR Read more on Our Website! |
|
#' @examples |
|
#' # taxonomic tree ----------------------------------------------------------- |
|
#' mo_kingdom("E. coli") # "Bacteria" |
|
#' mo_phylum("E. coli") # "Proteobacteria" |
|
#' mo_class("E. coli") # "Gammaproteobacteria" |
|
#' mo_order("E. coli") # "Enterobacterales" |
|
#' mo_family("E. coli") # "Enterobacteriaceae" |
|
#' mo_genus("E. coli") # "Escherichia" |
|
#' mo_species("E. coli") # "coli" |
|
#' mo_subspecies("E. coli") # "" |
|
#' |
|
#' # colloquial properties ---------------------------------------------------- |
|
#' mo_name("E. coli") # "Escherichia coli" |
|
#' mo_fullname("E. coli") # "Escherichia coli" - same as mo_name() |
|
#' mo_shortname("E. coli") # "E. coli" |
|
#' |
|
#' # other properties --------------------------------------------------------- |
|
#' mo_gramstain("E. coli") # "Gram-negative" |
|
#' mo_snomed("E. coli") # 112283007, 116395006, ... (SNOMED codes) |
|
#' mo_type("E. coli") # "Bacteria" (equal to kingdom, but may be translated) |
|
#' mo_rank("E. coli") # "species" |
|
#' mo_url("E. coli") # get the direct url to the online database entry |
|
#' mo_synonyms("E. coli") # get previously accepted taxonomic names |
|
#' |
|
#' # scientific reference ----------------------------------------------------- |
|
#' mo_ref("E. coli") # "Castellani et al., 1919" |
|
#' mo_authors("E. coli") # "Castellani et al." |
|
#' mo_year("E. coli") # 1919 |
|
#' |
|
#' # abbreviations known in the field ----------------------------------------- |
|
#' mo_genus("MRSA") # "Staphylococcus" |
|
#' mo_species("MRSA") # "aureus" |
|
#' mo_shortname("VISA") # "S. aureus" |
|
#' mo_gramstain("VISA") # "Gram-positive" |
|
#' |
|
#' mo_genus("EHEC") # "Escherichia" |
|
#' mo_species("EHEC") # "coli" |
|
#' |
|
#' # known subspecies --------------------------------------------------------- |
|
#' mo_name("doylei") # "Campylobacter jejuni doylei" |
|
#' mo_genus("doylei") # "Campylobacter" |
|
#' mo_species("doylei") # "jejuni" |
|
#' mo_subspecies("doylei") # "doylei" |
|
#' |
|
#' mo_fullname("K. pneu rh") # "Klebsiella pneumoniae rhinoscleromatis" |
|
#' mo_shortname("K. pneu rh") # "K. pneumoniae" |
|
#' |
|
#' \donttest{ |
|
#' # Becker classification, see ?as.mo ---------------------------------------- |
|
#' mo_fullname("S. epi") # "Staphylococcus epidermidis" |
|
#' mo_fullname("S. epi", Becker = TRUE) # "Coagulase-negative Staphylococcus (CoNS)" |
|
#' mo_shortname("S. epi") # "S. epidermidis" |
|
#' mo_shortname("S. epi", Becker = TRUE) # "CoNS" |
|
#' |
|
#' # Lancefield classification, see ?as.mo ------------------------------------ |
|
#' mo_fullname("S. pyo") # "Streptococcus pyogenes" |
|
#' mo_fullname("S. pyo", Lancefield = TRUE) # "Streptococcus group A" |
|
#' mo_shortname("S. pyo") # "S. pyogenes" |
|
#' mo_shortname("S. pyo", Lancefield = TRUE) # "GAS" (='Group A Streptococci') |
|
#' |
|
#' |
|
#' # language support -------------------------------------------------------- |
|
#' mo_gramstain("E. coli", language = "de") # "Gramnegativ" |
|
#' mo_gramstain("E. coli", language = "nl") # "Gram-negatief" |
|
#' mo_gramstain("E. coli", language = "es") # "Gram negativo" |
|
#' |
|
#' # mo_type is equal to mo_kingdom, but mo_kingdom will remain official |
|
#' mo_kingdom("E. coli") # "Bacteria" on a German system |
|
#' mo_type("E. coli") # "Bakterien" on a German system |
|
#' mo_type("E. coli") # "Bacteria" on an English system |
|
#' |
|
#' mo_fullname("S. pyogenes", |
|
#' Lancefield = TRUE, |
|
#' language = "de") # "Streptococcus Gruppe A" |
|
#' mo_fullname("S. pyogenes", |
|
#' Lancefield = TRUE, |
|
#' language = "nl") # "Streptococcus groep A" |
|
#' |
|
#' |
|
#' # other -------------------------------------------------------------------- |
|
#' |
|
#' mo_is_yeast(c("Candida", "E. coli")) # TRUE, FALSE |
|
#' |
|
#' # gram stains and intrinsic resistance can also be used as a filter in dplyr verbs |
|
#' \donttest{ |
|
#' if (require("dplyr")) { |
|
#' example_isolates %>% |
|
#' filter(mo_is_gram_positive()) |
|
#' |
|
#' example_isolates %>% |
|
#' filter(mo_is_intrinsic_resistant(ab = "vanco")) |
|
#' } |
|
#' |
|
#' |
|
#' # get a list with the complete taxonomy (from kingdom to subspecies) |
|
#' mo_taxonomy("E. coli") |
|
#' # get a list with the taxonomy, the authors, Gram-stain, |
|
#' # SNOMED codes, and URL to the online database |
|
#' mo_info("E. coli") |
|
#' } |
|
#' } |
|
mo_name <- function(x, language = get_locale(), ...) { |
|
if (missing(x)) { |
|
# this tries to find the data and an <mo> column |
|
x <- find_mo_col(fn = "mo_name") |
|
} |
|
meet_criteria(x, allow_NA = TRUE) |
|
meet_criteria(language, has_length = 1, is_in = c(LANGUAGES_SUPPORTED, ""), allow_NULL = TRUE, allow_NA = TRUE) |
|
|
|
translate_AMR(mo_validate(x = x, property = "fullname", language = language, ...), |
|
language = language, |
|
only_unknown = FALSE, |
|
only_affect_mo_names = TRUE) |
|
} |
|
|
|
#' @rdname mo_property |
|
#' @export |
|
mo_fullname <- mo_name |
|
|
|
#' @rdname mo_property |
|
#' @export |
|
mo_shortname <- function(x, language = get_locale(), ...) { |
|
if (missing(x)) { |
|
# this tries to find the data and an <mo> column |
|
x <- find_mo_col(fn = "mo_shortname") |
|
} |
|
meet_criteria(x, allow_NA = TRUE) |
|
meet_criteria(language, has_length = 1, is_in = c(LANGUAGES_SUPPORTED, ""), allow_NULL = TRUE, allow_NA = TRUE) |
|
|
|
x.mo <- as.mo(x, language = language, ...) |
|
|
|
metadata <- get_mo_failures_uncertainties_renamed() |
|
|
|
replace_empty <- function(x) { |
|
x[x == ""] <- "spp." |
|
x |
|
} |
|
|
|
# get first char of genus and complete species in English |
|
genera <- mo_genus(x.mo, language = NULL) |
|
shortnames <- paste0(substr(genera, 1, 1), ". ", replace_empty(mo_species(x.mo, language = NULL))) |
|
|
|
# exceptions for where no species is known |
|
shortnames[shortnames %like% ".[.] spp[.]"] <- genera[shortnames %like% ".[.] spp[.]"] |
|
# exceptions for staphylococci |
|
shortnames[shortnames == "S. coagulase-negative"] <- "CoNS" |
|
shortnames[shortnames == "S. coagulase-positive"] <- "CoPS" |
|
# exceptions for streptococci: Group A Streptococcus -> GAS |
|
shortnames[shortnames %like% "S. group [ABCDFGHK]"] <- paste0("G", gsub("S. group ([ABCDFGHK])", "\\1", shortnames[shortnames %like% "S. group [ABCDFGHK]"], perl = TRUE), "S") |
|
# unknown species etc. |
|
shortnames[shortnames %like% "unknown"] <- paste0("(", trimws(gsub("[^a-zA-Z -]", "", shortnames[shortnames %like% "unknown"], perl = TRUE)), ")") |
|
|
|
shortnames[is.na(x.mo)] <- NA_character_ |
|
load_mo_failures_uncertainties_renamed(metadata) |
|
translate_AMR(shortnames, language = language, only_unknown = FALSE, only_affect_mo_names = TRUE) |
|
} |
|
|
|
|
|
|
|
#' @rdname mo_property |
|
#' @export |
|
mo_subspecies <- function(x, language = get_locale(), ...) { |
|
if (missing(x)) { |
|
# this tries to find the data and an <mo> column |
|
x <- find_mo_col(fn = "mo_subspecies") |
|
} |
|
meet_criteria(x, allow_NA = TRUE) |
|
meet_criteria(language, has_length = 1, is_in = c(LANGUAGES_SUPPORTED, ""), allow_NULL = TRUE, allow_NA = TRUE) |
|
|
|
translate_AMR(mo_validate(x = x, property = "subspecies", language = language, ...), language = language, only_unknown = TRUE) |
|
} |
|
|
|
#' @rdname mo_property |
|
#' @export |
|
mo_species <- function(x, language = get_locale(), ...) { |
|
if (missing(x)) { |
|
# this tries to find the data and an <mo> column |
|
x <- find_mo_col(fn = "mo_species") |
|
} |
|
meet_criteria(x, allow_NA = TRUE) |
|
meet_criteria(language, has_length = 1, is_in = c(LANGUAGES_SUPPORTED, ""), allow_NULL = TRUE, allow_NA = TRUE) |
|
|
|
translate_AMR(mo_validate(x = x, property = "species", language = language, ...), language = language, only_unknown = TRUE) |
|
} |
|
|
|
#' @rdname mo_property |
|
#' @export |
|
mo_genus <- function(x, language = get_locale(), ...) { |
|
if (missing(x)) { |
|
# this tries to find the data and an <mo> column |
|
x <- find_mo_col(fn = "mo_genus") |
|
} |
|
meet_criteria(x, allow_NA = TRUE) |
|
meet_criteria(language, has_length = 1, is_in = c(LANGUAGES_SUPPORTED, ""), allow_NULL = TRUE, allow_NA = TRUE) |
|
|
|
translate_AMR(mo_validate(x = x, property = "genus", language = language, ...), language = language, only_unknown = TRUE) |
|
} |
|
|
|
#' @rdname mo_property |
|
#' @export |
|
mo_family <- function(x, language = get_locale(), ...) { |
|
if (missing(x)) { |
|
# this tries to find the data and an <mo> column |
|
x <- find_mo_col(fn = "mo_family") |
|
} |
|
meet_criteria(x, allow_NA = TRUE) |
|
meet_criteria(language, has_length = 1, is_in = c(LANGUAGES_SUPPORTED, ""), allow_NULL = TRUE, allow_NA = TRUE) |
|
|
|
translate_AMR(mo_validate(x = x, property = "family", language = language, ...), language = language, only_unknown = TRUE) |
|
} |
|
|
|
#' @rdname mo_property |
|
#' @export |
|
mo_order <- function(x, language = get_locale(), ...) { |
|
if (missing(x)) { |
|
# this tries to find the data and an <mo> column |
|
x <- find_mo_col(fn = "mo_order") |
|
} |
|
meet_criteria(x, allow_NA = TRUE) |
|
meet_criteria(language, has_length = 1, is_in = c(LANGUAGES_SUPPORTED, ""), allow_NULL = TRUE, allow_NA = TRUE) |
|
|
|
translate_AMR(mo_validate(x = x, property = "order", language = language, ...), language = language, only_unknown = TRUE) |
|
} |
|
|
|
#' @rdname mo_property |
|
#' @export |
|
mo_class <- function(x, language = get_locale(), ...) { |
|
if (missing(x)) { |
|
# this tries to find the data and an <mo> column |
|
x <- find_mo_col(fn = "mo_class") |
|
} |
|
meet_criteria(x, allow_NA = TRUE) |
|
meet_criteria(language, has_length = 1, is_in = c(LANGUAGES_SUPPORTED, ""), allow_NULL = TRUE, allow_NA = TRUE) |
|
|
|
translate_AMR(mo_validate(x = x, property = "class", language = language, ...), language = language, only_unknown = TRUE) |
|
} |
|
|
|
#' @rdname mo_property |
|
#' @export |
|
mo_phylum <- function(x, language = get_locale(), ...) { |
|
if (missing(x)) { |
|
# this tries to find the data and an <mo> column |
|
x <- find_mo_col(fn = "mo_phylum") |
|
} |
|
meet_criteria(x, allow_NA = TRUE) |
|
meet_criteria(language, has_length = 1, is_in = c(LANGUAGES_SUPPORTED, ""), allow_NULL = TRUE, allow_NA = TRUE) |
|
|
|
translate_AMR(mo_validate(x = x, property = "phylum", language = language, ...), language = language, only_unknown = TRUE) |
|
} |
|
|
|
#' @rdname mo_property |
|
#' @export |
|
mo_kingdom <- function(x, language = get_locale(), ...) { |
|
if (missing(x)) { |
|
# this tries to find the data and an <mo> column |
|
x <- find_mo_col(fn = "mo_kingdom") |
|
} |
|
meet_criteria(x, allow_NA = TRUE) |
|
meet_criteria(language, has_length = 1, is_in = c(LANGUAGES_SUPPORTED, ""), allow_NULL = TRUE, allow_NA = TRUE) |
|
|
|
translate_AMR(mo_validate(x = x, property = "kingdom", language = language, ...), language = language, only_unknown = TRUE) |
|
} |
|
|
|
#' @rdname mo_property |
|
#' @export |
|
mo_domain <- mo_kingdom |
|
|
|
#' @rdname mo_property |
|
#' @export |
|
mo_type <- function(x, language = get_locale(), ...) { |
|
if (missing(x)) { |
|
# this tries to find the data and an <mo> column |
|
x <- find_mo_col(fn = "mo_type") |
|
} |
|
meet_criteria(x, allow_NA = TRUE) |
|
meet_criteria(language, has_length = 1, is_in = c(LANGUAGES_SUPPORTED, ""), allow_NULL = TRUE, allow_NA = TRUE) |
|
|
|
x.mo <- as.mo(x, language = language, ...) |
|
out <- mo_kingdom(x.mo, language = NULL) |
|
out[which(mo_is_yeast(x.mo))] <- "Yeasts" |
|
translate_AMR(out, language = language, only_unknown = FALSE) |
|
} |
|
|
|
#' @rdname mo_property |
|
#' @export |
|
mo_gramstain <- function(x, language = get_locale(), ...) { |
|
if (missing(x)) { |
|
# this tries to find the data and an <mo> column |
|
x <- find_mo_col(fn = "mo_gramstain") |
|
} |
|
meet_criteria(x, allow_NA = TRUE) |
|
meet_criteria(language, has_length = 1, is_in = c(LANGUAGES_SUPPORTED, ""), allow_NULL = TRUE, allow_NA = TRUE) |
|
|
|
x.mo <- as.mo(x, language = language, ...) |
|
metadata <- get_mo_failures_uncertainties_renamed() |
|
|
|
x <- rep(NA_character_, length(x)) |
|
# make all bacteria Gram negative |
|
x[mo_kingdom(x.mo) == "Bacteria"] <- "Gram-negative" |
|
# overwrite these 4 phyla with Gram-positives |
|
# Source: https://itis.gov/servlet/SingleRpt/SingleRpt?search_topic=TSN&search_value=956097 (Cavalier-Smith, 2002) |
|
x[(mo_phylum(x.mo) %in% c("Actinobacteria", |
|
"Chloroflexi", |
|
"Firmicutes", |
|
"Tenericutes") & |
|
# but class Negativicutes (of phylum Firmicutes) are Gram-negative! |
|
mo_class(x.mo) != "Negativicutes") |
|
# and of course our own ID for Gram-positives |
|
| x.mo == "B_GRAMP"] <- "Gram-positive" |
|
|
|
load_mo_failures_uncertainties_renamed(metadata) |
|
translate_AMR(x, language = language, only_unknown = FALSE) |
|
} |
|
|
|
#' @rdname mo_property |
|
#' @export |
|
mo_is_gram_negative <- function(x, language = get_locale(), ...) { |
|
if (missing(x)) { |
|
# this tries to find the data and an <mo> column |
|
x <- find_mo_col(fn = "mo_is_gram_negative") |
|
} |
|
meet_criteria(x, allow_NA = TRUE) |
|
meet_criteria(language, has_length = 1, is_in = c(LANGUAGES_SUPPORTED, ""), allow_NULL = TRUE, allow_NA = TRUE) |
|
|
|
x.mo <- as.mo(x, language = language, ...) |
|
metadata <- get_mo_failures_uncertainties_renamed() |
|
grams <- mo_gramstain(x.mo, language = NULL) |
|
load_mo_failures_uncertainties_renamed(metadata) |
|
out <- grams == "Gram-negative" & !is.na(grams) |
|
out[x.mo %in% c(NA_character_, "UNKNOWN")] <- NA |
|
out |
|
} |
|
|
|
#' @rdname mo_property |
|
#' @export |
|
mo_is_gram_positive <- function(x, language = get_locale(), ...) { |
|
if (missing(x)) { |
|
# this tries to find the data and an <mo> column |
|
x <- find_mo_col(fn = "mo_is_gram_positive") |
|
} |
|
meet_criteria(x, allow_NA = TRUE) |
|
meet_criteria(language, has_length = 1, is_in = c(LANGUAGES_SUPPORTED, ""), allow_NULL = TRUE, allow_NA = TRUE) |
|
|
|
x.mo <- as.mo(x, language = language, ...) |
|
metadata <- get_mo_failures_uncertainties_renamed() |
|
grams <- mo_gramstain(x.mo, language = NULL) |
|
load_mo_failures_uncertainties_renamed(metadata) |
|
out <- grams == "Gram-positive" & !is.na(grams) |
|
out[x.mo %in% c(NA_character_, "UNKNOWN")] <- NA |
|
out |
|
} |
|
|
|
#' @rdname mo_property |
|
#' @export |
|
mo_is_yeast <- function(x, language = get_locale(), ...) { |
|
if (missing(x)) { |
|
# this tries to find the data and an <mo> column |
|
x <- find_mo_col(fn = "mo_is_yeast") |
|
} |
|
meet_criteria(x, allow_NA = TRUE) |
|
meet_criteria(language, has_length = 1, is_in = c(LANGUAGES_SUPPORTED, ""), allow_NULL = TRUE, allow_NA = TRUE) |
|
|
|
x.mo <- as.mo(x, language = language, ...) |
|
metadata <- get_mo_failures_uncertainties_renamed() |
|
|
|
x.kingdom <- mo_kingdom(x.mo, language = NULL) |
|
x.phylum <- mo_phylum(x.mo, language = NULL) |
|
x.class <- mo_class(x.mo, language = NULL) |
|
x.order <- mo_order(x.mo, language = NULL) |
|
|
|
load_mo_failures_uncertainties_renamed(metadata) |
|
|
|
out <- rep(FALSE, length(x)) |
|
out[x.kingdom == "Fungi" & x.class == "Saccharomycetes"] <- TRUE |
|
out[x.mo %in% c(NA_character_, "UNKNOWN")] <- NA |
|
out |
|
} |
|
|
|
#' @rdname mo_property |
|
#' @export |
|
mo_is_intrinsic_resistant <- function(x, ab, language = get_locale(), ...) { |
|
if (missing(x)) { |
|
# this tries to find the data and an <mo> column |
|
x <- find_mo_col(fn = "mo_is_intrinsic_resistant") |
|
} |
|
meet_criteria(x, allow_NA = TRUE) |
|
meet_criteria(ab, allow_NA = FALSE) |
|
meet_criteria(language, has_length = 1, is_in = c(LANGUAGES_SUPPORTED, ""), allow_NULL = TRUE, allow_NA = TRUE) |
|
|
|
x <- as.mo(x, language = language, ...) |
|
ab <- as.ab(ab, language = NULL, flag_multiple_results = FALSE, info = FALSE) |
|
|
|
if (length(x) == 1 & length(ab) > 1) { |
|
x <- rep(x, length(ab)) |
|
} else if (length(ab) == 1 & length(x) > 1) { |
|
ab <- rep(ab, length(x)) |
|
} |
|
if (length(x) != length(ab)) { |
|
stop_("length of `x` and `ab` must be equal, or one of them must be of length 1.") |
|
} |
|
|
|
# show used version number once per session (pkg_env will reload every session) |
|
if (message_not_thrown_before("intrinsic_resistant_version.mo", entire_session = TRUE)) { |
|
message_("Determining intrinsic resistance based on ", |
|
format_eucast_version_nr(3.2, markdown = FALSE), ". ", |
|
font_red("This note will be shown once per session.")) |
|
} |
|
|
|
# runs against internal vector: INTRINSIC_R (see zzz.R) |
|
paste(x, ab) %in% INTRINSIC_R |
|
} |
|
|
|
#' @rdname mo_property |
|
#' @export |
|
mo_snomed <- function(x, language = get_locale(), ...) { |
|
if (missing(x)) { |
|
# this tries to find the data and an <mo> column |
|
x <- find_mo_col(fn = "mo_snomed") |
|
} |
|
meet_criteria(x, allow_NA = TRUE) |
|
meet_criteria(language, has_length = 1, is_in = c(LANGUAGES_SUPPORTED, ""), allow_NULL = TRUE, allow_NA = TRUE) |
|
|
|
mo_validate(x = x, property = "snomed", language = language, ...) |
|
} |
|
|
|
#' @rdname mo_property |
|
#' @export |
|
mo_ref <- function(x, language = get_locale(), ...) { |
|
if (missing(x)) { |
|
# this tries to find the data and an <mo> column |
|
x <- find_mo_col(fn = "mo_ref") |
|
} |
|
meet_criteria(x, allow_NA = TRUE) |
|
meet_criteria(language, has_length = 1, is_in = c(LANGUAGES_SUPPORTED, ""), allow_NULL = TRUE, allow_NA = TRUE) |
|
|
|
mo_validate(x = x, property = "ref", language = language, ...) |
|
} |
|
|
|
#' @rdname mo_property |
|
#' @export |
|
mo_authors <- function(x, language = get_locale(), ...) { |
|
if (missing(x)) { |
|
# this tries to find the data and an <mo> column |
|
x <- find_mo_col(fn = "mo_authors") |
|
} |
|
meet_criteria(x, allow_NA = TRUE) |
|
meet_criteria(language, has_length = 1, is_in = c(LANGUAGES_SUPPORTED, ""), allow_NULL = TRUE, allow_NA = TRUE) |
|
|
|
x <- mo_validate(x = x, property = "ref", language = language, ...) |
|
# remove last 4 digits and presumably the comma and space that preceed them |
|
x[!is.na(x)] <- gsub(",? ?[0-9]{4}", "", x[!is.na(x)], perl = TRUE) |
|
suppressWarnings(x) |
|
} |
|
|
|
#' @rdname mo_property |
|
#' @export |
|
mo_year <- function(x, language = get_locale(), ...) { |
|
if (missing(x)) { |
|
# this tries to find the data and an <mo> column |
|
x <- find_mo_col(fn = "mo_year") |
|
} |
|
meet_criteria(x, allow_NA = TRUE) |
|
meet_criteria(language, has_length = 1, is_in = c(LANGUAGES_SUPPORTED, ""), allow_NULL = TRUE, allow_NA = TRUE) |
|
|
|
x <- mo_validate(x = x, property = "ref", language = language, ...) |
|
# get last 4 digits |
|
x[!is.na(x)] <- gsub(".*([0-9]{4})$", "\\1", x[!is.na(x)], perl = TRUE) |
|
suppressWarnings(as.integer(x)) |
|
} |
|
|
|
#' @rdname mo_property |
|
#' @export |
|
mo_rank <- function(x, language = get_locale(), ...) { |
|
if (missing(x)) { |
|
# this tries to find the data and an <mo> column |
|
x <- find_mo_col(fn = "mo_rank") |
|
} |
|
meet_criteria(x, allow_NA = TRUE) |
|
meet_criteria(language, has_length = 1, is_in = c(LANGUAGES_SUPPORTED, ""), allow_NULL = TRUE, allow_NA = TRUE) |
|
|
|
mo_validate(x = x, property = "rank", language = language, ...) |
|
} |
|
|
|
#' @rdname mo_property |
|
#' @export |
|
mo_taxonomy <- function(x, language = get_locale(), ...) { |
|
if (missing(x)) { |
|
# this tries to find the data and an <mo> column |
|
x <- find_mo_col(fn = "mo_taxonomy") |
|
} |
|
meet_criteria(x, allow_NA = TRUE) |
|
meet_criteria(language, has_length = 1, is_in = c(LANGUAGES_SUPPORTED, ""), allow_NULL = TRUE, allow_NA = TRUE) |
|
|
|
x <- as.mo(x, language = language, ...) |
|
metadata <- get_mo_failures_uncertainties_renamed() |
|
|
|
out <- list(kingdom = mo_kingdom(x, language = language), |
|
phylum = mo_phylum(x, language = language), |
|
class = mo_class(x, language = language), |
|
order = mo_order(x, language = language), |
|
family = mo_family(x, language = language), |
|
genus = mo_genus(x, language = language), |
|
species = mo_species(x, language = language), |
|
subspecies = mo_subspecies(x, language = language)) |
|
|
|
load_mo_failures_uncertainties_renamed(metadata) |
|
out |
|
} |
|
|
|
#' @rdname mo_property |
|
#' @export |
|
mo_synonyms <- function(x, language = get_locale(), ...) { |
|
if (missing(x)) { |
|
# this tries to find the data and an <mo> column |
|
x <- find_mo_col(fn = "mo_synonyms") |
|
} |
|
meet_criteria(x, allow_NA = TRUE) |
|
meet_criteria(language, has_length = 1, is_in = c(LANGUAGES_SUPPORTED, ""), allow_NULL = TRUE, allow_NA = TRUE) |
|
|
|
x <- as.mo(x, language = language, ...) |
|
metadata <- get_mo_failures_uncertainties_renamed() |
|
|
|
IDs <- mo_name(x = x, language = NULL) |
|
syns <- lapply(IDs, function(newname) { |
|
res <- sort(microorganisms.old[which(microorganisms.old$fullname_new == newname), "fullname"]) |
|
if (length(res) == 0) { |
|
NULL |
|
} else { |
|
res |
|
} |
|
}) |
|
if (length(syns) > 1) { |
|
names(syns) <- mo_name(x) |
|
result <- syns |
|
} else { |
|
result <- unlist(syns) |
|
} |
|
|
|
load_mo_failures_uncertainties_renamed(metadata) |
|
result |
|
} |
|
|
|
#' @rdname mo_property |
|
#' @export |
|
mo_info <- function(x, language = get_locale(), ...) { |
|
if (missing(x)) { |
|
# this tries to find the data and an <mo> column |
|
x <- find_mo_col(fn = "mo_info") |
|
} |
|
meet_criteria(x, allow_NA = TRUE) |
|
meet_criteria(language, has_length = 1, is_in = c(LANGUAGES_SUPPORTED, ""), allow_NULL = TRUE, allow_NA = TRUE) |
|
|
|
x <- as.mo(x, language = language, ...) |
|
metadata <- get_mo_failures_uncertainties_renamed() |
|
|
|
info <- lapply(x, function(y) |
|
c(mo_taxonomy(y, language = language), |
|
list(synonyms = mo_synonyms(y), |
|
gramstain = mo_gramstain(y, language = language), |
|
url = unname(mo_url(y, open = FALSE)), |
|
ref = mo_ref(y), |
|
snomed = unlist(mo_snomed(y))))) |
|
if (length(info) > 1) { |
|
names(info) <- mo_name(x) |
|
result <- info |
|
} else { |
|
result <- info[[1L]] |
|
} |
|
|
|
load_mo_failures_uncertainties_renamed(metadata) |
|
result |
|
} |
|
|
|
#' @rdname mo_property |
|
#' @export |
|
mo_url <- function(x, open = FALSE, language = get_locale(), ...) { |
|
if (missing(x)) { |
|
# this tries to find the data and an <mo> column |
|
x <- find_mo_col(fn = "mo_url") |
|
} |
|
meet_criteria(x, allow_NA = TRUE) |
|
meet_criteria(open, allow_class = "logical", has_length = 1) |
|
meet_criteria(language, has_length = 1, is_in = c(LANGUAGES_SUPPORTED, ""), allow_NULL = TRUE, allow_NA = TRUE) |
|
|
|
x.mo <- as.mo(x = x, language = language, ... = ...) |
|
metadata <- get_mo_failures_uncertainties_renamed() |
|
|
|
df <- microorganisms[match(x.mo, microorganisms$mo), c("mo", "fullname", "source", "kingdom", "rank")] |
|
df$url <- ifelse(df$source == "LPSN", |
|
paste0(CATALOGUE_OF_LIFE$url_LPSN, "/species/", gsub(" ", "-", tolower(df$fullname), fixed = TRUE)), |
|
paste0(CATALOGUE_OF_LIFE$url_CoL, "/data/search?type=EXACT&q=", gsub(" ", "%20", df$fullname, fixed = TRUE))) |
|
|
|
genera <- which(df$kingdom == "Bacteria" & df$rank == "genus") |
|
df$url[genera] <- gsub("/species/", "/genus/", df$url[genera], fixed = TRUE) |
|
subsp <- which(df$kingdom == "Bacteria" & df$rank %in% c("subsp.", "infraspecies")) |
|
df$url[subsp] <- gsub("/species/", "/subspecies/", df$url[subsp], fixed = TRUE) |
|
|
|
u <- df$url |
|
names(u) <- df$fullname |
|
|
|
if (isTRUE(open)) { |
|
if (length(u) > 1) { |
|
warning_("Only the first URL will be opened, as `browseURL()` only suports one string.") |
|
} |
|
utils::browseURL(u[1L]) |
|
} |
|
|
|
load_mo_failures_uncertainties_renamed(metadata) |
|
u |
|
} |
|
|
|
|
|
#' @rdname mo_property |
|
#' @export |
|
mo_property <- function(x, property = "fullname", language = get_locale(), ...) { |
|
if (missing(x)) { |
|
# this tries to find the data and an <mo> column |
|
x <- find_mo_col(fn = "mo_property") |
|
} |
|
meet_criteria(x, allow_NA = TRUE) |
|
meet_criteria(property, allow_class = "character", has_length = 1, is_in = colnames(microorganisms)) |
|
meet_criteria(language, has_length = 1, is_in = c(LANGUAGES_SUPPORTED, ""), allow_NULL = TRUE, allow_NA = TRUE) |
|
|
|
translate_AMR(mo_validate(x = x, property = property, language = language, ...), language = language, only_unknown = TRUE) |
|
} |
|
|
|
mo_validate <- function(x, property, language, ...) { |
|
check_dataset_integrity() |
|
dots <- list(...) |
|
Becker <- dots$Becker |
|
if (is.null(Becker) | property %in% c("kingdom", "phylum", "class", "order", "family", "genus")) { |
|
Becker <- FALSE |
|
} |
|
Lancefield <- dots$Lancefield |
|
if (is.null(Lancefield) | property %in% c("kingdom", "phylum", "class", "order", "family", "genus")) { |
|
Lancefield <- FALSE |
|
} |
|
has_Becker_or_Lancefield <- Becker %in% c(TRUE, "all") | Lancefield %in% c(TRUE, "all") |
|
|
|
if (tryCatch(all(x[!is.na(x)] %in% MO_lookup$mo) & !has_Becker_or_Lancefield, error = function(e) FALSE)) { |
|
# special case for mo_* functions where class is already <mo> |
|
x <- MO_lookup[match(x, MO_lookup$mo), property, drop = TRUE] |
|
|
|
} else { |
|
# try to catch an error when inputting an invalid argument |
|
# so the 'call.' can be set to FALSE |
|
tryCatch(x[1L] %in% MO_lookup[1, property, drop = TRUE], |
|
error = function(e) stop(e$message, call. = FALSE)) |
|
|
|
if (!all(x[!is.na(x)] %in% MO_lookup[, property, drop = TRUE]) | has_Becker_or_Lancefield) { |
|
x <- exec_as.mo(x, property = property, language = language, ...) |
|
} |
|
} |
|
|
|
if (property == "mo") { |
|
return(set_clean_class(x, new_class = c("mo", "character"))) |
|
} else if (property == "snomed") { |
|
return(as.double(eval(parse(text = x)))) |
|
} else { |
|
return(x) |
|
} |
|
} |
|
|
|
find_mo_col <- function(fn) { |
|
# this function tries to find an mo column in the data the function was called in, |
|
# which is useful when functions are used within dplyr verbs |
|
df <- get_current_data(arg_name = "x", call = -3) # will return an error if not found |
|
mo <- NULL |
|
try({ |
|
mo <- suppressMessages(search_type_in_df(df, "mo")) |
|
}, silent = TRUE) |
|
if (!is.null(df) && !is.null(mo) && is.data.frame(df)) { |
|
if (message_not_thrown_before(fn = fn)) { |
|
message_("Using column '", font_bold(mo), "' as input for `", fn, "()`") |
|
} |
|
return(df[, mo, drop = TRUE]) |
|
} else { |
|
stop_("argument `x` is missing and no column with info about microorganisms could be found.", call = -2) |
|
} |
|
}
|
|
|