What is significance of using –split-by clause in Apache Sqoop?

Question

What is significance of using –split-by clause in Apache Sqoop?

1 Answer

SakshiSharma · Answer 1 · 2022-12-22T15:37:02+0000

split-by is a clause, it is used to specify the columns of the table which are helping to generate splits for data imports during importing the data into the Hadoop cluster. This clause specifies the columns and helps to improve the performance via greater parallelism. And also it helps to specify the column that has an even distribution of data to create splits,that data is imported.

What is significance of using –split-by clause in Apache Sqoop?

Please log in or register to answer this question.

1 Answer