Few useful links and information for learning Apache Spark.
Example of standalone Docker cluster for learning purposes:
Fast-paced course for beginners:
https://www.packtpub.com/application-development/apache-spark-7-days-video
On Mac I had to:
- install pyton 3.7 and pip3 with Homebrew (I’m not using conda)
- install: pip3 install jupyter, pip3 install findspark
- add system variable for Jupyter Notebook from course: export SPARK_HOME=~/Tutorials/spark-2.4.1-bin-hadoop2.7/