hdp-task There are 2 subtasks: generator: A MapReduce job for creating test csv data processor: A Spark job for analyzing the test csv data. The scripts can be run in the HDP 2.6 sandbox.