parallel_mean#

parallel_mean(mean_a, count_a, mean_b, count_b)[source]#

Compute the mean based on stats from two partitions of the data.

See “Parallel Algorithm” in https://en.wikipedia.org/wiki/Algorithms_for_calculating_variance

Parameters

mean_a – the mean of partition a
count_a – the number of elements in partition a
mean_b – the mean of partition b
count_b – the number of elements in partition b

Returns

the mean of the two partitions if they were combined