Consider sales dataset with column names as CustomerID, Location, Merchant, Amount. Requirement is to create a paired RDD with CustomerID as key and Amount as value. Which of the below code snippet is correct to create a paired RDD.

val SalesData = sc.textFile("HDFS Path") 

val PairedSalesData = SalesData.map{record => (record.split(",")(0),record.split(",")(3).toLong)}

val SalesData = sc.textFile("HDFS Path") 

val PairedSalesData = SalesData.flatMap{record => (record.split(",")(1),record.split(",")(4).toLong)}

val SalesData = sc.textFile("HDFS Path")

val PairedSalesData = SalesData.reduce{record => (record.split(",")(0),record.split(",")(3).toLong)}

val SalesData = sc.textFile("HDFS Path")

val PairedSalesData = SalesData.filter{record => (record.split(",")(0),record.split(",")(3).toLong)}

Verified Answer
Correct Option - a

To get all Infosys Certified Spark Professional Exam questions Join Telegram Group https://rebrand.ly/lex-telegram-236dee

Telegram