tabensemb.data#

AbstractDataStep(**kwargs)

By inheriting this class, the input kwargs will be used to update default values defined in _defaults().

DataModule(config[, initialize])

The dataset manager that provides loading, processing, and analyzing utilities.

Data Derivation#

AbstractDeriver(**kwargs)

dataderiver

Data Imputation#

AbstractImputer(**kwargs)

The base class for all data-imputers.

AbstractSklearnImputer(**kwargs)

A base class for sklearn-style imputers that has fit_transform and transform methods that return np.ndarray.

dataimputer

Data Processing#

AbstractProcessor(**kwargs)

The base class for data-processors that change the content of the tabular dataset.

AbstractAugmenter(**kwargs)

A kind of data processor that increases the number of data points.

AbstractTransformer(**kwargs)

The base class for data-processors that change values of the tabular dataset.

AbstractFeatureSelector(**kwargs)

The base class for data-processors that reduce the number of features.

AbstractScaler(**kwargs)

This is a marker for scaling processors like a standard scaler or a normalizer.

dataprocessor

Data Splitting#

AbstractSplitter([train_val_test, cv])

The base class for data-splitters that split the dataset and return training, validation, and testing indices.

datasplitter

Utils#

utils