Apache Spark [http://spark.apache.org/] is definitely one of the hottest topics
in the Data Science community at the moment. Last month when we visited PyData
Amsterdam 2016 [http://pydata.org/amsterdam2016/] we witnessed a great example
of Spark's immense popularity. The speakers at PyData talking about Spark had