githubEdit

Which records matched with each other in a cluster

Ready to audit a cluster? Plug in the Zingg ID and go. Get pair-level traceability for clusters.

Zingg Enterprise Feature

Coming Soon!

Many times, records end up being in a cluster even when they may not have directly matched. For example Recod A matched with B, and B matched with C. Some records may match a lot more to other records in the cluster, while one record may just match with one andhence get included in the cluster. If you want to audit clusters, investigate anomalies, present rationale to stakeholders, or refine your matching strategy with confidence, it is important to understand pair wise matching.

The explain phase helps us learn this.

Configuration

The explain configuration references the original matching config and defines where to write the results.

{
  "config": "path_to_original_matching_config/config.json",
  "explainOutput": [
    {
      "name": "outputExplain",
      "format": "csv",
      "props": {
        "location": "/tmp/zinggOutput_explain",
        "delimiter": ",",
        "header": true
      }
    }
  ]
}  

Run the explain phase from your Zingg Enterprise Installation and pass the Zingg ID of the cluster you want to inspect:

Example:

This will provide a pair-level view (pk1, pk2) for the selected Zingg ID

  • pk1: Primary key (record ID) of the first record in the pair

  • pk2: Primary key (record ID) of the second record in the pair

Last updated

Was this helpful?