Introduction

A preprint describes the production of a curated resource of 29 million single cells.

Metadata interactions

## # Source:   SQL [?? x 56]
## # Database: DuckDB v0.10.2 [vincent@Linux 6.5.0-35-generic:R 4.4.0/:memory:]
##    cell_              sample_    cell_type cell_type_harmonised confidence_class
##    <chr>              <chr>      <chr>     <chr>                           <dbl>
##  1 AAACCTGAGAGACGAA_1 689e2fe4f… basal ce… basal_cell                          1
##  2 AAACCTGAGTTGTCGT_1 689e2fe4f… basal ce… basal_cell                          1
##  3 AAACCTGCAGTCGATT_1 689e2fe4f… luminal … luminal_cell_of_pro…                1
##  4 AAACCTGCAGTTCATG_1 689e2fe4f… luminal … luminal_cell_of_pro…                1
##  5 AAACCTGGTCTAAACC_1 689e2fe4f… basal ce… basal_cell                          1
##  6 AAACCTGTCGTACCGG_1 689e2fe4f… basal ce… basal_cell                          1
##  7 AAACCTGTCTTGTACT_1 689e2fe4f… basal ce… basal_cell                          1
##  8 AAACGGGAGTACGTTC_1 689e2fe4f… basal ce… basal_cell                          1
##  9 AAACGGGAGTAGGTGC_1 689e2fe4f… luminal … luminal_cell_of_pro…                1
## 10 AAACGGGAGTGGGATC_1 689e2fe4f… basal ce… basal_cell                          1
## # ℹ more rows
## # ℹ 51 more variables: cell_annotation_azimuth_l2 <chr>,
## #   cell_annotation_blueprint_singler <chr>,
## #   cell_annotation_monaco_singler <chr>, sample_id_db <chr>,
## #   `_sample_name` <chr>, assay <chr>, assay_ontology_term_id <chr>,
## #   file_id_db <chr>, cell_type_ontology_term_id <chr>,
## #   development_stage <chr>, development_stage_ontology_term_id <chr>, …
library(dplyr)
md |> group_by(tissue) |> summarise(n=n())
## # Source:   SQL [?? x 2]
## # Database: DuckDB v0.10.2 [vincent@Linux 6.5.0-35-generic:R 4.4.0/:memory:]
##    tissue                                                   n
##    <chr>                                                <dbl>
##  1 peripheral zone of prostate                          74403
##  2 transition zone of prostate                         191641
##  3 bronchus                                             17521
##  4 renal medulla                                       877939
##  5 epithelial cell of alveolus of lung (cell culture)  143015
##  6 ileum                                                80215
##  7 prostate gland                                       66245
##  8 muscle of abdomen                                    12284
##  9 cerebellum                                         1368270
## 10 breast                                                9770
## # ℹ more rows