Please suggest pyspark dataframe alternative for Pandas df['col'].unique()
.
I want to list out all the unique values in a pyspark dataframe column.
Not the SQL type way (registertemplate then SQL query for distinct values).
Also I don't need groupby->countDistinct
, instead I want to check distinct VALUES in that column.
See Question&Answers more detail:
os 与恶龙缠斗过久,自身亦成为恶龙;凝视深渊过久,深渊将回以凝视…