Please use this identifier to cite or link to this item:
|Title:||Cellsig plug-in enhances CIBERSORTx signature selection for multi-dataset transcriptomes with sparse multilevel modelling.||Austin Authors:||Al Kamran Khan, Md Abdullah;Wu, Jian;Yuhan, Sun;Barrow, Alexander David;Papenfuss, Anthony T;Mangiola, Stefano||Affiliation:||Department of Microbiology and Immunology, The University of Melbourne at The Peter Doherty Institute for Infection and Immunity, Parkville, Victoria, Australia.
Olivia Newton-John Cancer Research Institute
Department of Microbiology and Immunology, The University of Melbourne at The Peter Doherty Institute for Infection and Immunity, Parkville, Victoria, Australia.
The Walter and Eliza Hall Institute of Medical Research, Parkville, Victoria, Australia.
Department of Medical Biology, University of Melbourne, Melbourne, Victoria, Australia.
|Issue Date:||11-Nov-2023||Date:||2023||Publication information:||Bioinformatics (Oxford, England) 2023-11-11||Abstract:||The precise characterisation of cell-type transcriptomes is pivotal to understanding cellular lineages, deconvolution of bulk transcriptomes, and clinical applications. Single-cell RNA sequencing resources like the Human Cell Atlas have revolutionised cell-type profiling. However, challenges persist due to data heterogeneity and discrepancies across different studies. One limitation of prevailing tools such as CIBERSORTx is their inability to address hierarchical data structures and handle non-overlapping gene sets across samples, relying on filtering or imputation. Here, we present cellsig, a Bayesian sparse multilevel model designed to improve signature estimation by adjusting data for multilevel effects and modelling for gene-set sparsity. Our model is tailored to large-scale, heterogeneous pseudobulk and bulk RNA sequencing data collections with non-overlapping gene sets. We tested the performances of cellsig on a novel curated Human Bulk Cell-type Catalogue, which harmonises 1,435 samples across 58 datasets. We show that cellsig significantly enhances cell-type marker gene ranking performance. This approach is valuable for cell-type signature selection, with implications for marker gene validation, single-cell annotation, and deconvolution benchmarks. Codes and the interactive app are available at https://github.com/stemangiola/cellsig; and the database is available at https://doi.org/10.5281/zenodo.7582421. Supplementary data are available at Bioinformatics online.||URI:||https://ahro.austin.org.au/austinjspui/handle/1/34231||DOI:||10.1093/bioinformatics/btad685||ORCID:||Journal:||Bioinformatics (Oxford, England)||PubMed URL:||37952182||ISSN:||1367-4811||Type:||Journal Article|
|Appears in Collections:||Journal articles|
Show full item record
checked on Dec 7, 2023
Items in AHRO are protected by copyright, with all rights reserved, unless otherwise indicated.