One of the great features of Apache Spark is to allow you to write code in popular languages in data processing field, Python and R. But you could face poor performance when using Spark from those language for some reason.
Since Apache Spark 1.4, using DataFrame API would give you almost same performance as when using JVM languages like Scala or Java.
In this talk, I will show the background, examples, pitfalls of DataFrame API and how to get the best performance from Apache Spark from non-JVM languages.