Combining GWAS with scRNAseq to better understand asthma

Sarah Djeddi, PhD

Postdoctoral Research Fellow

Affiliation: Boston Children’s Hospital, Harvard Medical School
Division: Immunology
Department: Pediatrics
Lab: Gutierrez-Arcelus Lab

Postdoctoral Research Fellow

Affiliation: Boston Children’s Hospital, Harvard Medical School
Division: Immunology
Department: Pediatrics
Lab: Gutierrez-Arcelus Lab

Getting to know Sarah 

It is no question why we sat down with Sarah for this edition of the CDN Researcher Spotlight…her work, passion and recent success speaks for itself. Before we dive in, let us tell you a little bit about Sarah:

She is a French computational biologist with a PhD in bioinformatics from Strasbourg University. She has worked on a vast amount of research projects in her professional career and recently has published a preprint on the relation between rhinovirus infections and childhood-onset asthma. 

Who or what motivated Sarah to get to where she is? What motivates Sarah now?

Sarah did her PhD in the Institute of Genetics, Molecular and Cellular Biology in France where she worked on integrating omics data to better understand congenital diseases. During that time she developed intellectual curiosity about complex genetic diseases and how to better understand them. This was a key motivating factor to carry out her postdoctoral work at Maria Gutierrez-Arcelus’ lab at BCH since she is a key referent in the functional genomics field which aims to integrate genetics with other data modalities to draw relations between a persons genotype and their phenotype.

What values guide her decision making? 

Her key driving force is to improve patients' lives. She aims to do this by furthering our understanding of disease etiology, the causal mechanisms that develop into a disease, by jointly studying different data modalities.

The study that became the preprint:

 “Could you summarize to the general public why you decided to carry out this study and what are the main takeaways?” 

“We are interested in understanding the molecular mechanisms of immune-mediated diseases by using a combination of genetic and multi-omics approaches.

Asthma is one of the themes we are studying in the lab since we know it is a disease influenced by genetics and the environment, however we only understand some part of the  genetic origin. Only 47% of risk loci co-localize with leukocyte T-cell, a cell type that is part of the immune system, and has been associated with the genetics of asthma.  So we decided to tackle this question by thinking about other cell types that are under-studied in the context of asthma and in functional genomics studies. Epithelial cells for instance, is another main cell type studied for asthma but not usually from the genetic perspective; so we decided to focus on those cell types; because epithelial cells are the first line of contact for respiratory viruses and allergens. Moreover, epidemiological studies have shown that viral infections in early life are associated with childhood-onset asthma development and it remains unclear whether rhinovirus is causal in asthma or whether it is a biomarker for children already predisposed to asthma.”

“Keeping in mind the general public, could you briefly explain what are GWAS and how are you incorporating them into this study?” 

“I will start with an introduction about GWAS, which stands for Genome-wide association study and it aims to identify associations of genotypes with phenotypes by testing for differences in the allele frequency of genetic variants between patients and healthy controls. The summary statistic obtained from a genome-wide association study is what use for our analysis; you can think of millions of variants that will have an associated P-value and then we can assess which of those variants are associated with the disease. 

In our study we are combining single-cell RNA-seq data and GWAS data by looking at genes that are over-expressed in a specific context (ex: genes being unregulated upon rhinovirus infection or genes being upregulated upon asthma). And then we check if those genes carry more risk variants than expected by chance. In this way, we are identifying cell states that are likely mediating genetic risk for a disease; and in our case we found an enrichment of genes in childhood-onset asthma risk loci in epithelial cells infected with rhinovirus. And thanks to single cell RNA-seq, we found that it is the non-ciliated airway epithelial cells that are likely driving the genetic susceptibility to childhood-onset asthma.”

“What do you think is the power of using single-cell technologies?”

“Single-cell technologies are great for many reasons, compared to more ancestral methods where we would analyze bulk populations of cells, single cells allow researchers to study individual cells. Bulk methods provide a snapshot of the overall population, whereas single-cell analysis captures the dynamics of individual cells over time. These single-cells technologies allow to identify cell subset/state in a more fine grain manner, this permit to identify rare cell types and understand better their roles and potential mechanisms associated with them. We can also use single-cell technology better understand cell-to-cell interactions and communication.”

“Could you elaborate on how identifying epithelial cells as key cells expressing asthma-associated genes can lead to better treatment or prevention of asthma?” 

“The identification of airway epithelial cells infected with rhinovirus, and specifically non-ciliated cells as being the one upregulating asthma-associated genes help to better understand the genetic origin of childhood-onset asthma. It has been shown in the literature that drug targets that have genetic association evidence are more likely to be approved, and then move forward to clinical trial.” 

“If you could carry out further analysis, what would you like to have done?”

“Now our results represent preliminary results from cells taken from a few individuals (10-20 individuals), if we could replicate our results in cohorts of larger sample size (hundreds to thousands of individuals) it would be ideal. In that way we could identify the likely causal genes of risk variants and this will help to characterize more precisely the underlying molecular mechanisms. We would also need functional validations to prove and understand better our findings, we could think about CRISPR strategy for example. Finally, if our findings are further validated we could imagine the development of a rhinovirus vaccine or other protective intervention in order to prevent childhood-onset asthma.”

Next
Next

Integrating single cell & spatial transcriptomics