0 votes
in Apache Spark by

What are the data formats supported by Spark?

1 Answer

0 votes
by

Spark supports a variety of data formats, including but not limited to:

Text Files: Plain text files (e.g., CSV, JSON).

SequenceFiles: A Hadoop data format.

Parquet: A columnar storage format.

ORC: Optimized Row Columnar format.

Avro: A binary format used for serializing data.

Image Files: For processing images.

LibSVM: Common format for support vector machine algorithms.

...