Studies were incorporated in a barcode, matrix, feature format. This format can be found on the 
                        
                        10x Genomics website
                         
                        for processing of single cell studies to obtain the gene expression matrices. 
                        For each study, the metadata incorporated in GEO were manually curated into profiles and the samples were separated based on applicable groups and conditions. 
                        The expression data from the cells for the samples within the same profile and condition were aggregated into an expression matrix with the cell barcodes having the sample name appended to it to ensure unique cell names.
                    
                Submission Date: Sep 22, 2022
Summary: Seropositivity for autoantibodies against islet autoantigens is associated with the development of autoimmune type 1 diabetes and B cell targeted therapies are effective in both mouse models and in patients who are affected by or at risk for autoimmune type 1 diabetes. The role of B cell receptor affinity in autoimmune type 1 diabetes is unclear. Here, we employed single cell RNA sequencing to define the relationship between B cell receptor affinity for insulin and B cell phenotype during disease development using immunoglobulin heavy chain (VH125) transgenic mouse model (VH125.NOD) in which insulin binding B cells lose self-tolerance, becoming activated during development of autoimmmun type 1 diabetes.
GEO Accession ID: GSE213973
PMID: 36275764
                    
                        Preprocessing and downstream analysis were computed using the scanpy Python library and the
                        steps of processing followed the Seurat vignette. Cells and genes with no expression or very low expression were 
                        removed from the dataset based on a predefined threshold. The data was then normalized across the expression within the cells and log normalized. The top 2000 highly variable genes were extracted to be used for downstream analysis. 
                        For each of these aggregated data matrices, the clusters were computed using the leiden algorithm. Scanpy was then used to compute the PCA, t-SNE, and UMAPs.
                        The points in the plots are labelled by their corresponding cell type labels. The cell type labels were computed using the wilcoxon method as the differential gene expression method.
                        The top 250 genes were then used for enrichment analysis against the CellMarker library in order to determine the most appropriate cell type label with the lowest p-value.
                    
                Select conditions below to toggle them from the plot:
| Cell Types | Cell Samples | 
|---|---|
                    
                        Differential gene expression can be computed for a single cell type labeled group of cells vs the rest. 
                        These include wilcoxon,
                        DESeq2, or characteristic direction.