datexplore.visualise

Module Contents

Functions

visualise(df[, display])

This function generates visualizations for a pandas DataFrame to identify patterns in missing values, correlation between variables, and distribution of variables and variable pairs.

datexplore.visualise.visualise(df, display=False)[source]

This function generates visualizations for a pandas DataFrame to identify patterns in missing values, correlation between variables, and distribution of variables and variable pairs.

This function creates three types of plots: 1. A heatmap of missing values: Each cell in the heatmap represents a value in the DataFrame. Cells are colored differently to indicate whether the value is missing or not.

This helps in identifying patterns or areas with missing data.
  1. A correlation heatmap: This heatmap shows the correlation coefficients between all pairs of columns in the DataFrame. High positive or negative values indicate strong relationships, while values close to zero suggest weak relationship. This is useful for understanding the relationships between variables.

  2. A pairplot: This creates a grid of scatter plots for each pair of variables in the DataFrame. It helps in visualizing the distribution of individual variables and the relationships between them.

dfpandas.DataFrame

The DataFrame for which the visualizations are to be generated.

None

This function does not return any value. Instead, it displays the generated plots on the screen.

  • The function utilizes seaborn and matplotlib libraries for plotting. Ensure these libraries are imported as ‘sns’ for seaborn and ‘plt’ for matplotlib.pyplot.

>> from dataexplore import visualise >> visualise(df)