Hierarchical clustering

Allows users to perform hierarchical clustering on a dataset.

Use the Hierarchical Clustering tab to group similar rows (e.g., samples, observations) and columns (e.g., genes, variables) in your data based on their values. This method creates a hierarchy of clusters, often visualized as a tree structure (dendrogram) alongside a heatmap.

Configure how the clustering is performed and how the results are displayed. For general options like column selection and basic preprocessing, see the main documentation sections. Settings specific to this tab include:

  • Scale (for Heatmap Coloring): Choose how values are scaled before being mapped to colors in the heatmap. This affects visual appearance but not the clustering itself (unless scaling is also selected as a preprocessing step).

    • Row: Scale values within each row independently.

    • Column: Scale values within each column independently.

    • None: Use the original (potentially preprocessed) values without additional scaling for coloring.

  • Display: Toggle the visibility of different plot elements:

    • Legend: Show or hide the color key legend for the heatmap.

    • Rownames: Show or hide the names of the rows along the side of the heatmap.

    • Colnames: Show or hide the names of the columns along the top/bottom of the heatmap.

    • Numbers: Display the actual scaled numeric values on the heatmap cells (useful for smaller heatmaps).

  • Colnames Size & Rownames Size: Adjust the font size for the column and row labels using +/- buttons or by entering a numeric value.

  • Clustering Distance: Select the metric used to measure the distance (dissimilarity) between data points (rows) and between variables (columns). Common options include Euclidean, Manhattan, Correlation-based distances, etc.

  • Clustering Method (Linkage): Choose the algorithm used to merge clusters during the hierarchical process. Common options include ward.D2, complete, average, single. The choice affects the shape of the resulting clusters.

  • Tree Ordering: Apply methods to optimize the ordering of branches within the dendrogram for better visualization, without changing the clustering hierarchy itself (e.g., based on leaf weights).

Last updated

Was this helpful?