Lookup Data

Sometimes we want to lookup certain records in match output, The lookup feature helps to achieve so. For given lookup records, it assigns zingg-id which decribes which entity cluster it belongs to.

The lookup phase is run as follows:

./scripts/zingg.sh --phase runLookup --conf <location to lookupConf.json>

Example lookupConf.json:

{
  "config" : "config.json",
  "lookupData": [{
    "name":"lookup-test-data",
    "format":"inMemory"
  }
  ],
  "lookupOutput": [
    {
      "name":"lookup-output",
      "format":"csv",
      "props": {
        "location": "/tmp/zinggOutput/lookup",
        "delimiter": ",",
        "header":true
      }
    }
  ]
} 

Last updated

Was this helpful?