datexplore.clean_names

Module Contents

Functions

clean_names(data[, case])

Clean all column names of a dataframe.

datexplore.clean_names.clean_names(data, case='snake_case')[source]

Clean all column names of a dataframe.

Make all column names in a dataframe such that the names only use letters, numbers, and underscores. Capitalization format is specifed using the case parameter. “Unclean” column labels should be delimited by spaces for best results with case format.

Parameters:
  • data (pandas.core.frame.DataFrame) – Dataframe containing data with column names.

  • case (str, optional) – Indicates the capitalization structure of the clean names (snake_case, CamelCase, lowerCamelCase)

Returns:

Dataframe with clean column names.

Return type:

pandas.core.frame.DataFrame

Examples

>>> from datexplore.clean_names import clean_names
>>> import pandas as pd
>>> data = pd.DataFrame{'Even Numbers': [2, 4, 6, 8],'odd numbers': [1, 3, 5, 7]}
>>> clean_data = clean_names(data)
>>> print(clean_data.columns)
Index(['even_numbers', 'odd_numbers'], dtype='object')