arche.rules.compare

Module Contents

arche.rules.compare.MAX_MISSING_VALUES = 6
arche.rules.compare.fields(source_df: pd.DataFrame, target_df: pd.DataFrame, names: List[str], normalize: bool = False, err_thr: float = 0.25) → Result

Finds fields values difference between dataframes.

Parameters
  • - a list of field names (names) –

  • - if set, all fields converted to str and processed with lower (normalize) –

  • - sets the failure threshold for missing values (err_thr) –

Returns

Result with same, missing and new values.

arche.rules.compare.tagged_fields(source_df: pd.DataFrame, target_df: pd.DataFrame, tagged_fields: TaggedFields, tags: List[str]) → Result

Compare fields tagged with tags between two dataframes.