For standalone local Spark without a cluster
Zingg can be easily run on a single machine to process up to a few million records.
To prepare your machine, please do the following steps:
A) Install the specified JDK version
B) Apache Spark - Download the specified version from spark.apache.org and unzip it in a folder under home
Please add the following entries to respective directory. Linux - ~/.bash_aliases macOS - ~/.zshrc
~/.bash_aliases
~/.zshrc
export JAVA_HOME=path to jdk
export SPARK_HOME=path to location of Apache Spark
export SPARK_MASTER=local[*]
C) Correct entry of host under /etc/hosts
/etc/hosts
Run ifconfig to find the IP of the machine and make sure it is added to the /etc/hosts for localhost.
ifconfig
Last updated 12 days ago
Was this helpful?