# Data overview

Use the **Data Overview** tab for an initial look at your dataset. Here you can inspect, filter, preprocess, and visualize specific columns before starting more complex analyses.

<figure><img src="https://237325963-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2F9LdC62ZpkxqvCBTPwVZU%2Fuploads%2Fgit-blob-f6587a88841f264fbda5e734e217bb366f1e60f0%2Fdiscovery-data-overview-tabls-plot.png?alt=media" alt=""><figcaption><p>Main overview</p></figcaption></figure>

{% tabs %}
{% tab title="Table Plot" %}
This plot visualizes the data in a tabular format, allowing users to examine aggregated distribution patterns across multiple variables.

<figure><img src="https://237325963-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2F9LdC62ZpkxqvCBTPwVZU%2Fuploads%2FteOmhhqioWl9Q9LRkO1a%2Fimage.png?alt=media&#x26;token=5de63a5e-66f4-428e-a0c5-43c83c0e87e7" alt=""><figcaption><p>Table plot illustrating the distribution patterns across the foldchange data of the hemagglutination inhibition assay (HAI) performed with various influenza virus strains </p></figcaption></figure>
{% endtab %}

{% tab title="Distribution Plot" %}
Displays the distribution of values in selected columns, which helps in identifying skewness, outliers, and patterns in the data.

<figure><img src="https://237325963-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2F9LdC62ZpkxqvCBTPwVZU%2Fuploads%2FCHvJQmPBmKL1r5ORd6es%2Fimage.png?alt=media&#x26;token=4311fd53-a995-4b57-bd0d-b5ba2c4eb6db" alt=""><figcaption><p><strong>Distribution Plot</strong></p></figcaption></figure>
{% endtab %}
{% endtabs %}

#### Key Features

* **Preliminary Inspection:** Get a quick summary and visualize distributions for selected columns.
* **Side Panel Controls:**
  * **Column Selection:** Choose which columns (variables) from your dataset you want to visualize.
  * **Preprocessing:** Apply basic preprocessing steps like normalization or handling missing values directly here. (See Preprocessing Options for details).
  * **Theme & Colors:** Customize the plot appearance using different themes and color palettes. (See Theme and Display Settings).
* **Download Options:**
  * Download the currently displayed plot as an SVG file (scalable vector graphic, good for publications).
  * Download the raw data subset currently being viewed (after any filtering/preprocessing applied in this tab).
* **Dynamic Activation:** This tab will only be active if your loaded dataset has sufficient data to display meaningful visualizations. This prevents errors with empty or very small datasets.
* **Bottom Information Bar:** When you select columns, this bar provides useful metadata about them:
  * Count of unique values.
  * Indication if the column is numeric (required for some analyses like PCA).
  * Flag if the column has zero variance (`zv`).
  * Percentage of missing values (NA).
