A DataFrame in SparkSQL is a Dataset organized into names columns. It is conceptually like a table in SQL.
In Java and Scala, a DataFrame is a represented by a DataSet of rows.
We can create a DataFrame from an existing RDD, a Hive table or from other Spark data sources.