We just released a new open source boilerplate template to help you (any Spark user) run spark-submit commands smoothly – such as inserting dependencies, project source code and more. TLDR: Here is an open source template to help you get started At Soluto, as part of our everyday Data Science work, we create ETL (Extract, Transform, Load) jobs. Our main tool for this is Spark, specifically, PySpark, with spark-submit Spark is used for distributed computing on large scale data sets….