webchem - Chemical Information from the Web
Chemical information from around the web. This package interacts with a suite of web services for chemical information. Sources include: Alan Wood's Compendium of Pesticide Common Names, Chemical Identifier Resolver, ChEBI, Chemical Translation Service, ChemSpider, ETOX, Flavornet, NIST Chemistry WebBook, OPSIN, PubChem, SRS, Wikidata.
Last updated
cas-numberchemical-informationchemspideridentifierropensciwebscraping
11.50 score 184 stars 10 dependents 224 scripts 1.1k downloadshoardr - Manage Cached Files
Suite of tools for managing cached files, targeting use in other R packages. Uses 'rappdirs' for cross-platform paths. Provides utilities to manage cache directories, including targeting files by path or by key; cached directories can be compressed and uncompressed easily to save disk space.
Last updated
cachingdatafilesxmlpdf
7.92 score 23 stars 16 dependents 17 scripts 2.2k downloadstailor - Tool for Adapter-domain Identification and Linking Of RBPs
Bacteriophages (phages) are viruses that infect bacteria. Many phages encode receptor binding proteins (RBPs) that mediate attachment to their bacterial hosts. Most RBPs contain a conserved N-terminal domain (we refer to this as an "adapter") which anchors the protein to the phage particle. This package provides functions to indentify these adapters and group them based on sequence similarity.
Last updated
6.79 score 3 stars 51 dependents 32 scripts 28k downloadsmulea - Enrichment Analysis Using Multiple Ontologies and False Discovery Rate
Background - Traditional gene set enrichment analyses are typically limited to a few ontologies and do not account for the interdependence of gene sets or terms, resulting in overcorrected p-values. To address these challenges, we introduce mulea, an R package offering comprehensive overrepresentation and functional enrichment analysis. Results - mulea employs a progressive empirical false discovery rate (eFDR) method, specifically designed for interconnected biological data, to accurately identify significant terms within diverse ontologies. mulea expands beyond traditional tools by incorporating a wide range of ontologies, encompassing Gene Ontology, pathways, regulatory elements, genomic locations, and protein domains. This flexibility enables researchers to tailor enrichment analysis to their specific questions, such as identifying enriched transcriptional regulators in gene expression data or overrepresented protein domains in protein sets. To facilitate seamless analysis, mulea provides gene sets (in standardised GMT format) for 27 model organisms, covering 22 ontology types from 16 databases and various identifiers resulting in almost 900 files. Additionally, the muleaData ExperimentData Bioconductor package simplifies access to these pre-defined ontologies. Finally, mulea's architecture allows for easy integration of user-defined ontologies, or GMT files from external sources (e.g., MSigDB or Enrichr), expanding its applicability across diverse research areas. Conclusions - mulea is distributed as a CRAN R package. It offers researchers a powerful and flexible toolkit for functional enrichment analysis, addressing limitations of traditional tools with its progressive eFDR and by supporting a variety of ontologies. Overall, mulea fosters the exploration of diverse biological questions across various model organisms.
Last updated
annotationdifferentialexpressiongeneexpressiongenesetenrichmentgographandnetworkmultiplecomparisonpathwaysreactomesoftwaretranscriptionvisualizationenrichmentenrichment-analysisfunctional-enrichment-analysisgene-set-enrichmentontologiestranscriptomicscpp
6.67 score 29 stars 40 scripts 642 downloadstaxizedb - Offline Access to Taxonomic Databases
Download taxonomic databases, convert them into 'SQLite' format, and query them locally for fast, reliable, and reproducible access to taxonomic data.
Last updated
itistaxizetaxonomic-databasestaxonomy
6.42 score 33 stars 132 scripts 313 downloadsbreakpoint - An R Package for Multiple Break-Point Detection via the Cross-Entropy Method
Implements the Cross-Entropy (CE) method, which is a model based stochastic optimization technique to estimate both the number and their corresponding locations of break-points in continuous and discrete measurements (Priyadarshana and Sofronov (2015), Priyadarshana and Sofronov (2012a), Priyadarshana and Sofronov (2012b)).
Last updated
2.30 score 40 scripts 102 downloads