armlet.data.cleaning package#
Subpackages#
Module contents#
This module contains the components for performing data cleaning.
- class armlet.data.cleaning.DataCleaningMethod(sensitive_attributes: list[str], error_type)#
Bases:
ABC- clean_client_errors(clients_data_dict)#
- clean_errors(all_data)#
- abstractmethod clean_errors_dataloader(data: tuple[DataFrame, DataFrame]) tuple[DataFrame, DataFrame]#
- clean_server_errors(server_data)#
- compute_cleaning_metrics()#
- count_n_samples_by_sensitive_attributes(X)#
- init_counters()#
- armlet.data.cleaning.compute_global_stat_values(clients_data_dict: dict[str, tuple[DataFrame, DataFrame]], cfg)#
- armlet.data.cleaning.data_cleaning_pipeline(data, cfg_cleaning, sensitive_attributes)#
Clean the data by following the data cleaning pipeline.
Warning
Label errors and missing values cleaning methods are only applicable to tabular data. Missing values cleaning methods can also be used with image datasets.