Question : pyspark dense
Answered by : nasty-narwhal-p25lfch62w2l
import pyspark.sql.functions as F
import pyspark.sql.types as T
#or: to_array = F.udf(lambda v: list([float(x) for x in v]), T.ArrayType(T.FloatType()))
to_array = F.udf(lambda v: v.toArray().tolist(), T.ArrayType(T.FloatType()))
df = df.withColumn('features', to_array('features'))
Source : https://stackoverflow.com/questions/58490770/convert-pyspark-densevector-to-array | Last Update : Wed, 21 Oct 20