A DataFrame in Spark SQL is a distributed collection of data organized into named columns, similar to a table in a relational database. A DataFrame can be constructed from a variety of data sources, including structured data files, Hive tables, and external databases.