Group list-of-tuples by second element, take average of first element

Question

I have a list of tuples (x,y) like:

l = [(2,1), (4,6), (3,1), (2,7), (7,10)]

Now I want to make a new list:

l = [(2.5,1), (4,6), (2,7), (7,10)]

with the new list having the average of the first value (x) of tuples if there are more than one tuple with the same second value (y) in the tuple.

Here since for (x,y) = (2,1) and (3,1) the second element in the tuple y=1 is common therefore the average of x=2 and 3 is in the new list. y=1 does not occur anywhere else, therefore the other tuples remain unchanged.

Quang Hoang · Accepted Answer

Since you tagged pandas:

l = [(2,1), (4,6), (3,1), (2,7), (7,10)]
df = pd.DataFrame(l)

Then df is a data frame with two columns:

Now you want to compute the average of the numbers in column 0 with the same value in column 1:

(df.groupby(1).mean()     # compute mean on each group
   .reset_index()[[0,1]]  # restore the column order
   .values                # return the underlying numpy array
 )

Output:

array([[ 2.5,  1. ],
       [ 4. ,  6. ],
       [ 2. ,  7. ],
       [ 7. , 10. ]])

Group list-of-tuples by second element, take average of first element

Tags:

python

python-3.x

pandas

numpy

pandas-groupby

ubuntu_noob

1 Answers

Quang Hoang

Recent Activity

Donate For Us

Group list-of-tuples by second element, take average of first element

Tags:

python

python-3.x

pandas

numpy

pandas-groupby

ubuntu_noob

1 Answers

Quang Hoang

Related questions

Recent Activity

Donate For Us