Imagine you work with patient data from our HyperClinic. We'd like to cluster our patients into groups, so we may gain new insights about them. You find the random initialization of k-means too naive, so you'd like to try out k-means++. You've selected a data point for the first centroid. Which data point should be selected for the second centroid initialization? Below you find data points together with their distances to the first centroid.
K-Means
k-means++
Report a typo
Select one option from the list
___
By continuing, you agree to the JetBrains Academy Terms of Service as well as Hyperskill Terms of Service and Privacy Policy.
Create a free account to access the full topic
By continuing, you agree to the JetBrains Academy Terms of Service as well as Hyperskill Terms of Service and Privacy Policy.