More RDD Ops - If we have to find count of each unique word... | Automated hands-on| CloudxLab

Apache Spark Basics

86 / 89

Previous Index Next

We have an RDD containing a list of words. Each record of this RDD is a word. You can assume that the RDD is similar to what gets created when we run the following code:

var rdd = sc.parallelize(Array("this", "this", "is", "good", "is"))

If we have to find the count of each unique word in an RDD having each word as a record. Which approach gives correct results?

w.map((_, 1)).reduceByKey(_ + _).collect()
w.countByValue()
Both

Note - Having trouble with the assessment engine? Follow the steps listed here

Previous Index Next

Please login to comment

Be the first one to comment!