CompareΒΆ

Arche allows you to compare two data sets. All you have to do is call the Arche with a source (that is the data source to validate) and a target (a data source to compare with)

[1]:
from arche import *
import pandas as pd
[2]:
url = "https://raw.githubusercontent.com/scrapinghub/arche/master/docs/source/nbs/data"
[3]:
a = Arche(source=pd.read_csv(f"{url}/items_products_8.csv"), target=pd.read_csv(f"{url}/items_products_7.csv"),
          schema=f"{url}/products.json")
WARNING
Pandas stores `NA` (missing) data differently, which might affect schema validation. Should you care, consider passing raw data in array-like types.
For more details, see https://pandas.pydata.org/pandas-docs/stable/user_guide/gotchas.html#nan-integer-na-values-and-na-type-promotions
[4]:
a.report_all()
[ ]: