# Using pre-existing training data

## Supplementing Zingg with existing training data

If you already have some training data that you want to start with, you can use that as well with Zingg. Add an attribute trainingSamples to the config and define the training pairs.

The training data supplied to Zingg should have a z\_cluster column that groups the records together. The z\_cluster uniquely identies the group. We also need to add the z\_isMatch column which is 1 if the pairs match or 0 if they do not match. The z\_isMatch value has to be same for all the records in the z\_cluster group. They either match with each other or they dont.

An example is provided in [Github training data](https://github.com/zinggAI/zingg/blob/0.4.0/examples/febrl/training.csv).

The above training data can be specified using [trainingSamples attribute in the configuration.](https://github.com/zinggAI/zingg/blob/0.4.0/examples/febrl/configWithTrainingSamples.json)

In addition, labeled data of one model can also be exported and used as training data for another model. For details, check out [exporting labeled data](https://docs.zingg.ai/0.4.0/stepbystep/createtrainingdata/exportlabeleddata).

Please note: It is advisable to still run [findTrainingData](https://docs.zingg.ai/0.4.0/stepbystep/createtrainingdata/findtrainingdata) and [label](https://docs.zingg.ai/0.4.0/stepbystep/createtrainingdata/label) a few rounds to tune Zingg with the supplied training data as well as patterns it needs to learn independently.
