Consider the below code snippet.
rdd=sc.parallelize(["user1,password1","user2,password2","user2,password2","user4,password4","user1,password1"])
The above dataset indicates the login details of each user.
Schema is (usernaname,password).
Choose the correct code snippet which generates the below output. The output will be [(“user4”,1),(“user2”,2),(“user1”,2)]

Question

Consider the below code snippet.

rdd=sc.parallelize(["user1,password1","user2,password2","user2,password2","user4,password4","user1,password1"])

The above dataset indicates the login details of each user.

Schema is (usernaname,password).

Choose the correct code snippet which generates the below output. The output will be [(“user4”,1),(“user2”,2),(“user1”,2)]

anonymous · Accepted Answer

Correct Solution - a