Improving Accuracy By Defining Own Functions
You can add your own blocking functions which will be evaluated by Zingg to build the blocking tree.
These business specific blocking functions go into Hash Functions and need to be added to HashFunctionRegistry and hash functions config
Also, for similarity, you can define your own measures. Each dataType has predefined features, for example String fuzzy type is configured for affine and jaro
You can define your own comparisons and use them
Last updated