Downloads preprocessed datasets to a local cache directory and returns a curated set of genomic biomarkers from multiple sources (CIViC, CGI, MitelmanDB)
The dataset comes as a list object, with three elements:
metadata- a data frame with metadata regarding drug resources useddata- a list with four elements ('civic','cgi','mitelmandb','custom_fusions')fpath- path to cache file
Value
Each entry of the source-specific (e.g. 'civic') entry in the data list contains
a list of three data frames:
variant - list of all biomarker variants, extensively populated according to variant aliases (identifer - column variant_id)
clinical - cross-references between variants recorded in the
variantdata frame and clinical evidence items (identifier - column evidence_id) and underlying literature evidence (identifier - column source_id)literature - lists literature for all source_id's listed in the
clinicaldata frame
