1. Open this file and explore the content: tcga_rnaseq.txt. Description:
-
Contains 10 normal samples, 20 tumor samples with 2 subtypes (Basal-like and Her2-enriched).
This dataset was normalized from TMM.
2. Upload your file to Babelomics 5.0.
3. Go to section Expression > Clustering and try several clustering strategies for samples & genes:
UPGMA + Euclidean (square)
UPGMA + Correlation coeff. (Spearman)
Which distance parameter is better for proper clustering?
4. Repeat the analysis using the same distance parameters and SOTA method.
SOTA + Euclidean (square)
SOTA + Correlation coeff. (Spearman)
Do the results change based on the method or the distance parameter?
5. Try to cluster your samples with K-means.
Set k-value 6 and use Correlation coeff. (Spearman)
Repeat the same analysis with k-value 3.
Check the results of K-means.
Are the results acceptable?
Is the dendrogram representing any hierarchy between the samples?
6. Try to cluster your samples with K-means.