Comment on page
Using pre-existing training data
If you already have some training data that you want to start with, you can use that as well with Zingg. Add an attribute trainingSamples to the config and define the training pairs.
The training data supplied to Zingg should have a z_cluster column that groups the records together. It also needs the z_isMatch column which is 1 if the pairs match or 0 if they do not match.