apache spark - Datatype error when comparing rows of a dataframe (Python)

Question

Welcome To Ask or Share your Answers For Others

apache spark - Datatype error when comparing rows of a dataframe (Python)

asked Feb 19, 2021 in Technique[技术] by 深蓝 (71.8m points)

apache spark - Datatype error when comparing rows of a dataframe (Python)

I have a dataframe with the folloing schema:

root
 |-- distanceValue: integer (nullable = true)
 |-- timeOfMeasurement: timestamp (nullable = true)
 |-- EventProcessedUtcTime: timestamp (nullable = true)
 |-- latency: interval (nullable = true)

And the dataframe looks something like this:

distance |timeOfMeasurement           |EventProcessedUtcTime       |latency
---------+----------------------------+----------------------------+---------------------------------
15       |2021-01-04T07:07:45.098+0000|2021-01-04T07:07:45.676+0000|{"months": 0, "days": 0, "microseconds": 578885}
26       |2021-01-04T07:07:46.098+0000|2021-01-04T07:07:46.301+0000|{"months": 0, "days": 0, "microseconds": 203909}
23       |2021-01-04T07:07:47.113+0000|2021-01-04T07:07:47.353+0000|{"months": 0, "days": 0, "microseconds": 240287}

When trying to compare the distance with the distance from the previous row

import pandas as pd
df['same'] = df.distance.eq(df.distance.shift()) 
    
# OR

import numpy as np
df['same'] = np.where(df.distance == df.distance.shift())

I get an error saying: Could not parse datatype: interval

The distance however is integer... Is the value getting mixed up with the latency which is an interval? Thank you for any help

与恶龙缠斗过久,自身亦成为恶龙；凝视深渊过久,深渊将回以凝视…

1 Answer

深蓝 · Answer 1 · 2021-02-19T03:47:10+0000

You can use the lag function in Spark:

from pyspark.sql import functions as F, Window

df = df.withColumn(
    'same',
    F.col('distance') == F.lag('distance').over(Window.orderBy('timeOfMeasurement'))
)

Categories

apache spark - Datatype error when comparing rows of a dataframe (Python)

apache spark - Datatype error when comparing rows of a dataframe (Python)

Please log in or register to add a comment.

Please log in or register to answer this question.

1 Answer

Please log in or register to add a comment.

Just Browsing Browsing

Most popular tags