Comment on page
Linking across datasets
To match two datasets against each other
In many cases like reference data mastering, enrichment, etc, two individual datasets are free of duplicates but they need to be matched against each other. The link phase is used for such scenarios.
./zingg.sh --phase link --conf config.json
The sample output is given in the image below. The linked records are given the same z_cluster id. The last column (z_source) in the output tells the source dataset of that record.