Remove duplicates by columns A, keeping the row with the highest value in column B

May 20, 2022 by IT Nursery

I have a dataframe with repeat values in column A. I want to drop duplicates, keeping the row with the highest value in column B.

So this:

Should turn into this:

I’m guessing there’s probably an easy way to do this—maybe as easy as sorting the DataFrame before dropping duplicates—but I don’t know groupby’s internal logic well enough to figure it out. Any suggestions?

13 Answers
13

Leave a Comment Cancel reply