# Variable importance

A **Variable Importance Score** in a machine learning model tells you **how much each input feature (or variable) contributes to the model's predictions.**

Essentially:

* **Higher score = More influential:** The variable has a greater impact on the model's output.
* **Lower score = Less influential:** The variable has less impact.

It helps you understand which factors are the key drivers for your model's outcomes and can be useful for feature selection (deciding which variables are most useful to keep).

<figure><img src="https://237325963-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2F9LdC62ZpkxqvCBTPwVZU%2Fuploads%2Fgit-blob-52b31721603ce50b30814767f1cb60c446eefb8f%2FExploration_Variable%20Importance_Main_v2-min.png?alt=media" alt=""><figcaption></figcaption></figure>

{% tabs %}
{% tab title="1. Feature Filtering" %}
**Feature filtering** allows the user to filter and select features to investigate features across the dataset and their contributions to the model's variance. The user has the following filtering options:

* **Class:** Only show features associated with predictions for the selected binary outcomes.
* **Order:** Choose to sort the feature table by various options, including but not limited to rank, name, and feature variance score.
  * Toggle the adjacent switch next to sort in ascending or descending order.
* **Download:** Download an Excel sheet with the information shown in the table provided for all features.
  {% endtab %}

{% tab title="2. Features across dataset" %}
The **Features across the dataset** sub-tab allows users to view the feature value distribution for each outcome from the base dataset (prior to training and testing). The feature values for each outcome are provided in the dot plot, and up to 25 features can be selected for these plots.

The plot display can be customized; see the [side panel options](https://atomic-lab.gitbook.io/pandora/data-analysis/discovery/side-panel) for more information.

The dot plots can be downloaded as SVG files or right-clicked and saved as a PNG in PANDORA.

<figure><img src="https://237325963-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2F9LdC62ZpkxqvCBTPwVZU%2Fuploads%2Fzfox9LQVd9wRqTIjlKs6%2FVaraible%20importance_Features%20across%20dataset%20plots_v3.png?alt=media&#x26;token=67d108dc-a0c9-4f48-aa74-300fd3c44e65" alt=""><figcaption></figcaption></figure>
{% endtab %}

{% tab title="3. Variable Importance" %}
This **Variable importance** sub-tab provides a bar plot showing feature importance in descending order. For Clarity, users can hover over bars to view the associated feature name and exact variable importance score.

This bar plot can be right-clicked and saved as a PNG in PANDORA.

<figure><img src="https://237325963-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2F9LdC62ZpkxqvCBTPwVZU%2Fuploads%2F1i4QtAwts5gB4fHEqWrt%2FVaraible%20importance_Bar%20plot_whitebackground.png?alt=media&#x26;token=efd771cc-e5d1-4e2d-88e0-6d27055c37ec" alt=""><figcaption></figcaption></figure>
{% endtab %}
{% endtabs %}
