Don’t collect large RDDs | Apache Spark - Best Practices and Tuning