Clustered t-SNE analysis

Enables improved visualization of t-SNE plot trends through algorithmic clustering

This tab displays the t-SNE plot after applying a clustering algorithm (selected in the setup, e.g., Louvain) to the 2D t-SNE coordinates. This helps to automatically identify and label distinct groups within the visualization.

Screenshot of the Clustered t-SNE Analysis tab showing colored clusters

What it Shows

  • Clustered t-SNE Plot: The main visualization is the standard t-SNE scatter plot, but now the points (samples) are colored according to the cluster they were assigned to by the chosen algorithm.

  • Cluster Labels: Each distinct cluster is typically labeled directly on the plot (e.g., "1 - 54" indicates Cluster 1 containing 54 points).

  • Legend: A legend maps the colors back to the cluster IDs.

  • Algorithm Information: Text above the plot often specifies the clustering method used (e.g., Louvain) and may mention specific details, such as how outliers are handled (e.g., designated cluster "100").

  • Silhouette Score: An average silhouette score might be displayed, providing a metric for how well-separated the clusters are (values range from -1 to +1, higher values indicate better clustering).

How to Interpret

  • Identify Groups: Use the colors and labels to clearly see the distinct groups identified by the clustering algorithm within the t-SNE map.

  • Evaluate Clustering: Compare the automatically identified clusters to any visual patterns you observed in the standard t-SNE plot or based on known grouping variables. The silhouette score gives a quantitative measure of cluster separation.

  • Foundation for Further Analysis: These cluster assignments form the basis for the analyses performed in the Dataset Analysis tab, which examine the characteristics of these groups using the original high-dimensional data.

This view displays the t-SNE plot where samples are colored based on the clusters identified by the algorithm you selected in the setup (e.g., Louvain, Hierarchical).

  • Visualization: Each point represents a sample, colored according to its assigned cluster ID. This helps you clearly visualize the groups found by the clustering method within the t-SNE map.

  • Silhouette Score: A description above or near the plot usually includes the average silhouette score for the clustering result.

    • This score measures how similar each sample is to its own cluster compared to other clusters.

    • Values range from -1 to +1.

    • A high value (closer to +1) indicates that samples are well-matched to their own cluster and poorly matched to neighboring clusters, suggesting well-defined, distinct clusters.

    • Values near 0 indicate overlapping clusters.

    • Negative values generally indicate that samples might have been assigned to the wrong cluster.

  • Download: You can download this plot as an SVG file or right-click to save it as a PNG directly from PANDORA.

Last updated

Was this helpful?