AWS S3
Zingg can use AWS S3 as a source and sink
Steps to run zingg on S3
Set a bucket e.g. zingg28032023 and a folder inside it e.g. zingg
Create aws access key and export via env vars (ensure that the user with below keys has read/write access to above) export AWS_ACCESS_KEY_ID= export AWS_SECRET_ACCESS_KEY= (if mfa is enabled AWS_SESSION_TOKEN env var would also be needed )
Download hadoop-aws-3.1.0.jar and aws-java-sdk-bundle-1.11.271.jar via maven
Set above in zingg.conf spark.jars=//hadoop-aws-3.1.0.jar,//aws-java-sdk-bundle-1.11.271.jar
Run using below commands
Model location
Last updated