The intent of this package is to analyze metadata about the series of repositories in review at Bioconductor. We include code to interrogate the contribution issues, summarize package capabilities, and categorize all packages in review. Currently OpenAI LLMs are used to create summaries and classifications of packages.
The listing below is a demonstration created with the package on Nov 19 2025. There are clear deficiencies of labeling, and improved prompting and tooling are needed. Some repositories are excluded because insufficient metadata is present in the DESCRIPTION or README.md.
1. Single-Cell RNA-Seq Analysis
- posDemux: Demultiplexing and filtering sequence reads with combinatorial barcodes.
- fourSynergy: Ensemble algorithm for analyzing 4C-seq data.
- CellMentor: Dimensional reduction using supervised cell type-aware non-negative matrix factorization.
- hammers: Tools for single-cell RNA sequencing (scRNA-seq) data analysis.
2. Tumor and Mutational Data Simulation
- ClonalSim: Simulating the clonal evolution of tumors with realistic sequencing noise.
- MutSeqRData: Experimental data package for mutational sequencing analysis.
3. Metabolomics
- MetaboAnnotatoR: Metabolite annotation of features from Liquid Chromatography-Mass Spectrometry datasets.
- MetaProViz: Mechanistic hypotheses in metabolomics data by integrating literature.
- MetabolomicsPipeline: Analyzing metabolomics data with subpathway analysis and other tools.
4. Gene and Protein Expression Analysis
- scrapple: Wrappers for single-cell RNA-Seq analysis.
- TraianProt: Downstream analysis of quantitative proteomic data.
- OAtools: Streamlined analysis of gene expression data on OpenArray platform.
5. Chromatin and DNA Interaction
- annoLinker: Annotation of genomic peaks using DNA interaction data.
- SMTrackR: Visualization of protein-DNA binding states on sequenced DNA molecules.
6. Genomic Visualization and Enrichment Analysis
- GOfan: Visualization of Gene Ontology enrichment results using sunburst layout.
- ClusterGVis: Clustering and visualization of gene expression data.
- ImageArray: Handling large image arrays in digital pathology and microscopy.
7. Data Processing and Pipeline Tools
- tidyprint: Enhancing usability of ‘SummarizedExperiment’ objects within a tidy workflow.
- proBatch: Analysis and correction of batch effects in high-throughput experiments.
- StatescopeR: Discovering cell states from gene expression profiles and bulk RNA.
8. Genetic Variation and Splicing Impact
- fRagmentomics: Extraction of fragmentomic features and mutational status from cfDNA.
- GXwasR: Conducting sex-aware genetic association analyses in complex traits.
- SpliceImpactR: The impact of alternative RNA splicing on protein structure.
9. Machine Learning and Prediction Models
- asuri: Discovery of marker genes for survival and risk prediction.
- dioscRi: Deep learning-based tool for clinical prediction in cytometry data.
- CENTRE: Prediction of cell-type specific enhancer target interactions.
10. Epigenomics and Transcriptomics Analysis
- epiSeeker: Annotation and visualization of multi-omics epigenetic data.
- decemedip: Deconvoluting cell types in MeDIP-seq data using a Bayesian model.
- HumanRetinaLRSData: Dataset package for RNA sequencing and gene expression analysis.
11. Variant Representation and Interoperability
- AnVILVRS: Interface to the AnVIL VRS Toolkit for genomic variation representation.
- immReferent: Interface for immune receptor (TCR/BCR) and HLA gene IMGT reference data.