How does Analyzr optimize the number of clusters when clustering a dataset?

When optimizing the number of clusters for a dataset, Analyzr will pick the number of clusters associated with the highest Silhouette score. Note that the Silhouette score is most relevant when dealing with well-behaved, i.e. convex-shaped, clusters. In many real-life cases clusters are not convex and the Silhouette score may no longer be relevant. In this case you are best served relying on your domain expertise to identify the most relevant number of clusters. 

Did you find it helpful? Yes No

Send feedback
Sorry we couldn't be helpful. Help us improve this article with your feedback.