Functional evaluation of the effect of T3 hormone in a transcriptomic study
A. Objective
To functionally characterize by means of different enrichment strategies (overrepresentation and GSEA methods), the results obtained in the differential expression analysis of the experiment where we used RNA-Seq in mice, and in which we were interested in detecting differentially expressed genes between these 2 groups: wild type (WT) and treated with T3 hormone.
B. Data
After performing differential expression analysis between two experimental groups, the following results were obtained:
Top genes (significant and with higher expression in T3 when we compared T3 vs. WT):
128 genes, top list
-
-
C. Work plan
Open the “top list” data file with a notepad or similar and inspect its contents. Inspect the other two files as well.
Step 1. From the home of the tool go to the “Gene List Analysis” tab. Upload this txt file “top list” or copy the ids of the genes in the window indicated for this purpose.
Step 2. Select the organism. In this case “mus musculus”.
Step 3. Choose the type of analysis. Then we check all the available options:
C.1. Functional description
We will start with a description of the functions annotated (GO terms, signaling pathways) to these genes and for this we will use these two options: “Functional classification viewed in gene list” and “Functional classification viewed in graphics charts” that will provide us respectively the list and the graphical summary of functions associated to this group of genes.
We need:
After obtaining the functional classification list of these genes, we will save it in a file (“Send list to file”).
From the option “Functional classification viewed in graphic charts” get a bar chart for each Gene Ontology ontology. The same information, please represent it from a “pie chart”.
C.2. Statistical analysis: Statistical overrepresentation test
This method provides us with the functions that are overrepresented in our gene list versus the rest of the reference genome.
The functional results will characterize the genes included in our list of interest.
We would like to:
Perform an analysis using the PANTHER GO-Slim Biological Processes. The results we obtain will be visualized in a “multiple pie chart”.
Repeat the analysis but this time we will use all the Biological Process. Comment on the results.
Then reproduce the previous two points with the “Bottom list” file.
C.3. Statistical analysis: Statistical enrichment test
This functional enrichment method incorporates information of interest (clinical, biological or statistical) that ranks the genes in the list.
The functional results will characterize all the genes in our experiment, including additional information that weights the genes in this list, in this case according to their differential expression level (the contrast statistic). So input we will need will be: “list of all ranked genes”.
We would like to:
Perform an analysis using the PANTHER GO-Slim Biological Process.
Repeat the analysis but this time we will use all the Biological Process. Comment on the results.
Some questions on the comparison of methods: GSA vs. ORA
What is the difference in input between the two methods?
Do you think you will find more results with GSA than with ORA methods?
Will the Gene Sets that participate in the same function and that we detect as significant include only genes that are differentially significantly expressed or may there be genes that are not significant but have a common expression pattern in that Gene Set?
D. Working from R
D.1. ClusterProfiler: ORA & GSEA methods
ClusterProfiler package implements methods to
analyze and visualize functional profiles GO, KEGG, DisGeNET, Reactome…) of gene and gene clusters.
We are going to work the same previous exercises, but from R using clusterProfiler package. Are you ready? This
script + data could help us.
Questions for you
-
D.2. mdgsa: GSEA method
We continue with the example “Functional evaluation of the effect of T3 hormone in transcriptomic study with RNA-Seq data”. In this case we will perform a functional characterization with a GSEA method using R.
In this zipped folder: you will find:
Run the R script: