pandas-groupby – IT Nursery

Multiple aggregations of the same column using pandas GroupBy.agg()

May 28, 2022 by IT Nursery

Is there a pandas built-in way to apply two different aggregating functions f1, f2 to the same column df[“returns”], without having to call agg() multiple times? Example dataframe: import pandas as pd import datetime as dt import numpy as np pd.np.random.seed(0) df = pd.DataFrame({ “date” : [dt.date(2012, x, 1) for x in range(1, 11)], “returns” … Read more

How to loop over grouped Pandas dataframe?

May 25, 2022 by IT Nursery

DataFrame: c_os_family_ss c_os_major_is l_customer_id_i 0 Windows 7 90418 1 Windows 7 90418 2 Windows 7 90418 Code: print df for name, group in df.groupby(‘l_customer_id_i’).agg(lambda x: ‘,’.join(x)): print name print group I’m trying to just loop over the aggregated data, but I get the error: ValueError: too many values to unpack @EdChum, here’s the expected output: … Read more

Get the row(s) which have the max value in groups using groupby

May 9, 2022 by IT Nursery

How do I find all rows in a pandas DataFrame which have the max value for count column, after grouping by [‘Sp’,’Mt’] columns? Example 1: the following DataFrame, which I group by [‘Sp’,’Mt’]: Sp Mt Value count 0 MM1 S1 a **3** 1 MM1 S1 n 2 2 MM1 S3 cb **5** 3 MM2 S3 … Read more

Converting a Pandas GroupBy output from Series to DataFrame

April 29, 2022 by IT Nursery

I’m starting with input data like this df1 = pandas.DataFrame( { “Name” : [“Alice”, “Bob”, “Mallory”, “Mallory”, “Bob” , “Mallory”] , “City” : [“Seattle”, “Seattle”, “Portland”, “Seattle”, “Seattle”, “Portland”] } ) Which when printed appears as this: City Name 0 Seattle Alice 1 Seattle Bob 2 Portland Mallory 3 Seattle Mallory 4 Seattle Bob 5 … Read more