Use TreeReduce/TreeAggregate instead of Reduce/Aggregate | Apache Spark - Best Practices and Tuning