Consider sales dataset with column names as CustomerID, Location, Merchant, Amount. Requirement is to create a paired RDD with CustomerID as key and Amount as value. Which of the below code snippet is correct to create a paired RDD.
val SalesData = sc.textFile("HDFS Path")
val PairedSalesData = SalesData.map{record => (record.split(",")(0),record.split(",")(3).toLong)}
val SalesData = sc.textFile("HDFS Path")
val PairedSalesData = SalesData.flatMap{record => (record.split(",")(1),record.split(",")(4).toLong)}
val SalesData = sc.textFile("HDFS Path")
val PairedSalesData = SalesData.reduce{record => (record.split(",")(0),record.split(",")(3).toLong)}
val SalesData = sc.textFile("HDFS Path")
val PairedSalesData = SalesData.filter{record => (record.split(",")(0),record.split(",")(3).toLong)}
To get all Infosys Certified Spark Professional Exam questions Join Telegram Group https://rebrand.ly/lex-telegram-236dee