0 votes
in Apache Spark by
What is a Parquet file and what are its advantages?

1 Answer

0 votes
by

Parquet is a columnar format that is supported by several data processing systems. With the Parquet file, Spark can perform both read and write operations. 

Some of the advantages of having a Parquet file are:

  1. It enables you to fetch specific columns for access.
  2. It consumes less space
  3. It follows the type-specific encoding
  4. It supports limited I/O operations
...