Zingg-0.3.3
  • Welcome to Zingg
  • Step By Step Guide
    • Installation
      • Working with Docker Image
    • Hardware Sizing
    • Configuration
    • Creating training data
      • findTrainingData
      • label
      • findAndLabel
      • Using preexisting training data
      • Exporting labeled data as csv
    • Building and saving the model
    • Finding the matches
    • Linking across datasets
  • Data Sources and Sinks
    • Zingg Pipes
    • Snowflake
    • Cassandra
    • MongoDB
    • Neo4j
    • Parquet
  • Running Zingg on Cloud
    • Running on AWS
    • Running on Azure
    • Running on Databricks
  • Zingg Models
    • Pretrained models
  • Improving Accuracy By Defining Own Functions
  • Generating Documentation
  • Output Scores
  • Security And Privacy
  • Updating Labeled Pairs
  • Reporting bugs and contributing
  • Community
  • Frequently Asked Questions
  • Reading Material
Powered by GitBook
On this page

Improving Accuracy By Defining Own Functions

You can add your own blocking functions which will be evaluated by Zingg to build the blocking tree.

These business specific blocking functions go into Hash Functions and need to be added to HashFunctionRegistry and hash functions config

Also, for similarity, you can define your own measures. Each dataType has predefined features, for example String fuzzy type is configured for affine and jaro

You can define your own comparisons and use them

PreviousPretrained modelsNextGenerating Documentation

Last updated 2 years ago