Return An Rdd Of Grouped Items

[Solved] Return An Rdd Of Grouped Items | Abap - Code Explorer | yomemimo.com
Question : Return an RDD of grouped items.

Answered by : abdoulaye

rdd = sc.parallelize([1, 1, 2, 3, 5, 8])
result = rdd.groupBy(lambda x: x % 2).collect()
sorted([(x, sorted(y)) for (x,y) in result])
# [(0, [2, 8]), (1, [1, 1, 3, 5])]

Source : http://spark.apache.org/docs/2.1.0/api/python/pyspark.html#pyspark.RDD.groupBy | Last Update : Wed, 18 Mar 20

Answers related to return an rdd of grouped items

Code Explorer Popular Question For Abap