I come from pandas background and am used to reading data from CSV files into a dataframe and then simply changing the column names to something useful using the...
I’m just wondering what is the difference between an RDD and DataFrame (Spark 2.0.0 DataFrame is a mere type alias for Dataset...