Pilot Flying J interview question

How would you handle distributed data processing in pySpark.