tabensemb.data.dataprocessor#

Functions

get_data_processor(name)

Classes

CategoricalOrdinalEncoder(**kwargs)

A categorical feature encoder that transforms string values to unique integer values.

CorrFeatureSelector(**kwargs)

Select features that are not correlated (in the sense of Pearson correlation).

FeatureValueSelector(**kwargs)

Select data with the specified feature value.

IQRRemover(**kwargs)

Remove outliers using the IQR strategy.

NaNFeatureRemover(**kwargs)

Remove features that contain no valid value.

RFEFeatureSelector(**kwargs)

Select features using recursive feature elimination, adapted from the implementation of RFECV in sklearn.

SampleDataAugmenter(**kwargs)

This is a sample of data augmentation, which is not reasonable at all and is only used to test data augmentation.

StandardScaler(**kwargs)

A standard scaler implemented using StandardScaler from sklearn.

StdRemover(**kwargs)

Remove outliers using the standard error strategy.

VarianceFeatureSelector(**kwargs)

Remove features that almost (by a certain fraction) contain an identical value.