Welcome to OStack Knowledge Sharing Community for programmer and developer-Open, Learning and Share
Welcome To Ask or Share your Answers For Others

Categories

0 votes
2.7k views
in Technique[技术] by (71.8m points)

python 3.x - How to efficiently run a query on a large dataframe in apply's function?

I have a code that runs on a large dataframe with about ~15k rows. it'll create a series of objects from each row. in order to create the object it'll have to query the dataframe which takes a long time. is there any cleaner and more performant way of achieving this?

import pandas as pd
foos: pd.Series = df.apply(get_foo, axis=1, args=(df, ))
def get_foo(row: pd.Series, df: pd.DataFrame):
    ...
    bar = df.query("col == @row.loc['colA']")
    ...
    return MyObject(bar=bar)

与恶龙缠斗过久,自身亦成为恶龙;凝视深渊过久,深渊将回以凝视…
Welcome To Ask or Share your Answers For Others

1 Answer

0 votes
by (71.8m points)
等待大神解答

与恶龙缠斗过久,自身亦成为恶龙;凝视深渊过久,深渊将回以凝视…
Welcome to OStack Knowledge Sharing Community for programmer and developer-Open, Learning and Share
Click Here to Ask a Question

...