Jan 13, 2020 in Big Data | Hadoop
Q: What are the two ways to create RDD in Spark?

1 Answer

0 votes
Jan 13, 2020

We can create RDD in Spark in following two ways:

1. Internal: We can parallelize an existing collection of data within our Spark Driver program and create a RDD out of it.

2. External: We can also create RDD by referencing a Dataset in an external data

 

source like AWS S3, HDFS, HBASE etc.

 

Related questions

0 votes
Jun 8, 2020 in Spark Sql
0 votes
Mar 9, 2020 in Spark Sql
...