Cell heterogeneity studies using single-cell sequencing are gaining great significance in the era of personalized medicine. In particular, characterization of tumor heterogeneity is an emergent issue to improve clinical oncology, since both inter- and intra-tumor level heterogeneity influence the utility and application of molecular classifications through specific biomarkers. Majority of studies have exploited gene expression to discriminate cell types. However, to provide a more nuanced view of the underlying differences, isoform expression and alternative splicing events have to be analyzed in detail. In this study, we utilize publicly available single cell and bulk RNA sequencing datasets of breast cancer cells from primary tumors and immortalized cell lines. Breast cancer is very heterogeneous with well defined molecular subtypes and was therefore chosen for this study. RNA-seq data were explored in terms of genes, isoforms abundance and splicing events. The study was conducted from an average based approach (gene level expression) to detailed and deeper ones (isoforms abundance/splicing events) to perform a comparative analysis, and, thus, highlight the importance of the splicing machinery in defining the tumor heterogeneity. Moreover, here we demonstrate how the investigation of gene isoforms expression can help to identify the appropriate in vitro models. We furthermore extracted marker isoforms, and alternatively spliced genes between and within the different single cell populations to improve the classification of the breast cancer subtypes.

Exploiting single-cell RNA sequencing data to link alternative splicing and cancer heterogeneity: A computational approach

Mario Rosario Guarracino
2019-01-01

Abstract

Cell heterogeneity studies using single-cell sequencing are gaining great significance in the era of personalized medicine. In particular, characterization of tumor heterogeneity is an emergent issue to improve clinical oncology, since both inter- and intra-tumor level heterogeneity influence the utility and application of molecular classifications through specific biomarkers. Majority of studies have exploited gene expression to discriminate cell types. However, to provide a more nuanced view of the underlying differences, isoform expression and alternative splicing events have to be analyzed in detail. In this study, we utilize publicly available single cell and bulk RNA sequencing datasets of breast cancer cells from primary tumors and immortalized cell lines. Breast cancer is very heterogeneous with well defined molecular subtypes and was therefore chosen for this study. RNA-seq data were explored in terms of genes, isoforms abundance and splicing events. The study was conducted from an average based approach (gene level expression) to detailed and deeper ones (isoforms abundance/splicing events) to perform a comparative analysis, and, thus, highlight the importance of the splicing machinery in defining the tumor heterogeneity. Moreover, here we demonstrate how the investigation of gene isoforms expression can help to identify the appropriate in vitro models. We furthermore extracted marker isoforms, and alternatively spliced genes between and within the different single cell populations to improve the classification of the breast cancer subtypes.
File in questo prodotto:
File Dimensione Formato  
2019Biochemistry.pdf

solo utenti autorizzati

Descrizione: Articolo in rivista
Tipologia: Versione Editoriale (PDF)
Licenza: Copyright dell'editore
Dimensione 2.41 MB
Formato Adobe PDF
2.41 MB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11580/84951
Citazioni
  • ???jsp.display-item.citation.pmc??? 4
  • Scopus 7
social impact