0 votes
in Clustering - The Data Ensemble by

Which of the following clustering representations and dendrogram depicts the use of the group average proximity function in hierarchical clustering?

A.  Q 21, Option A

B.  Q 21, Option B

C. Q 21, Option C

D. Q 21, Option D

1 Answer

0 votes
by
Answer: (C)

For the group average version of hierarchical clustering, the proximity of two clusters is defined to be the average of the pairwise proximities between all pairs of points in the different clusters. This is an intermediate approach between MIN and MAX. This is expressed by the following equation:

Here, the distance between some clusters. dist({3, 6, 4}, {1}) = (0.2218 + 0.3688 + 0.2347)/(3 ∗ 1) = 0.2751. dist({2, 5}, {1}) = (0.2357 + 0.3421)/(2 ∗ 1) = 0.2889. dist({3, 6, 4}, {2, 5}) = (0.1483 + 0.2843 + 0.2540 + 0.3921 + 0.2042 + 0.2932)/(6∗1) = 0.2637. Because dist({3, 6, 4}, {2, 5}) is smaller than dist({3, 6, 4}, {1}) and dist({2, 5}, {1}), these two clusters are merged at the fourth stage.

Related questions

0 votes
asked Mar 20, 2023 in Clustering - The Data Ensemble by john ganales
0 votes
asked Mar 19, 2023 in Clustering - The Data Ensemble by Robin
...