+1 vote
in Spark Sql by
Take first 10 record and last 10 record of file and combine both using DataFrame in Spark?

1 Answer

0 votes
by

val loadDF = spark.read.format(“csv”).option(“path”,”file:///home/maria_dev/Files/assignment_table.csv”).load()

val combineDF = loadDF.take(10) ++ loadDF.take(loadDF.count.toInt).takeRight(10)

val schemaDesign = loadDF.first.toSeq.map(c => c.toString.trim).map(col => StructField(col,StringType,true))

val schema = StructType(schemaDesign)

val createDF = spark.createDataFrame(spark.sparkContext.parallelize(combineDF),schema)

createDF.show()

Related questions

+1 vote
asked Mar 9, 2020 in Spark Sql by SakshiSharma
+1 vote
asked Sep 12, 2020 in QuickTest Professional (QTP) by JackTerrance
...