Top 100+ questions in Clustering - The Data Ensemble
Clustering - The Data Ensemble
Q: Which of the following method is used for finding the optimal of a cluster in the K-Mean algorithm?
Mar 20
Clustering - The Data Ensemble
john ganales
Q: Feature scaling is an important step before applying the K-Mean algorithm. What is the reason behind this?
Mar 20
Clustering - The Data Ensemble
john ganales
Q: If two variables, V1 and V2, are used for clustering. Which of the following are true for K means clustering with k =3?
Mar 20
Clustering - The Data Ensemble
john ganales
Q: Assume you want to cluster 7 observations into 3 clusters using the K-Means clustering algorithm. After first iteration, clusters C1, C2, C3 have following observations:
Mar 20
Clustering - The Data Ensemble
john ganales
Q: Assume you want to cluster 7 observations into 3 clusters using the K-Means clustering algorithm. After the first iteration, clusters C1, C2, C3 have following observations:
Mar 20
Clustering - The Data Ensemble
john ganales
Q: The K-Means algorithm has some limitations. For example, it makes hard assignments (A point either completely belongs to a cluster or not belongs at all) of points to clusters.
Mar 20
Clustering - The Data Ensemble
john ganales
Q: Which of the following is/are valid iterative strategies for treating missing values before clustering analysis?
Mar 20
Clustering - The Data Ensemble
john ganales
Q: What should be the best choice of no. of clusters based on the following results?
Mar 20
Clustering - The Data Ensemble
john ganales
Q: Which of the following clustering representations and dendrogram depicts the use of Ward's method proximity function in hierarchical clustering?
Mar 20
Clustering - The Data Ensemble
john ganales
Q: Which of the following clustering representations and dendrogram depicts the use of the group average proximity function in hierarchical clustering?
Mar 20
Clustering - The Data Ensemble
john ganales
Q: Which of the following clustering representations and dendrogram depicts the use of MAX or complete link proximity function in hierarchical clustering?
Mar 20
Clustering - The Data Ensemble
john ganales
Q: Which of the following clustering representations and dendrogram depicts the use of MIN or single link proximity function in hierarchical clustering?
Mar 20
Clustering - The Data Ensemble
john ganales
Q: Which of the following is/are true?
Mar 20
Clustering - The Data Ensemble
john ganales
Q: Which of the following metrics do we have for finding dissimilarity between two clusters in hierarchical clustering?
Mar 20
Clustering - The Data Ensemble
john ganales
Q: In which of the following cases will K-Means clustering fail to give good results?
Mar 20
Clustering - The Data Ensemble
john ganales
Q: What is the most appropriate no. of clusters for the data points represented by the following dendrogram?
Mar 19
Clustering - The Data Ensemble
Robin
Q: In the figure below, if you draw a horizontal line on the y-axis for y=2. What will be the number of clusters formed?
Mar 19
Clustering - The Data Ensemble
Robin
Q: What could be the possible reason(s) for producing two different dendrograms using an agglomerative clustering algorithm for the same dataset?
Mar 19
Clustering - The Data Ensemble
Robin
Q: How can Clustering (Unsupervised Learning) be used to improve the accuracy of the Linear Regression model (Supervised Learning)?
Mar 19
Clustering - The Data Ensemble
Robin
Q: After performing K-Means Clustering analysis on a dataset, you observed the following dendrogram. Which of the following conclusion can be drawn from the dendrogram?
Mar 19
Clustering - The Data Ensemble
Robin
Q: Which of the following algorithm is most sensitive to outliers?
Mar 19
Clustering - The Data Ensemble
Robin
Q: Which of the following clustering algorithms suffers from the problem of convergence at local optima?
Mar 19
Clustering - The Data Ensemble
Robin
Q: Which of the following can act as possible termination conditions in K-Means?
Mar 19
Clustering - The Data Ensemble
Robin
Q: Is it possible that the assignment of observations to clusters does not change between successive iterations in K-Means?
Mar 19
Clustering - The Data Ensemble
Robin
Q: For two runs of K-Mean clustering, is it expected to get the same clustering results?
Mar 19
Clustering - The Data Ensemble
Robin
Q: What is the minimum no. of variables/ features required to perform clustering?
Mar 19
Clustering - The Data Ensemble
Robin
Q: Which of the following is the most appropriate strategy for data cleaning before performing clustering analysis, given less than the desirable number of data points? Capping and flouring of variables
Mar 19
Clustering - The Data Ensemble
Robin
Q: Can decision trees be used for performing clustering?
Mar 19
Clustering - The Data Ensemble
Robin
Q: Sentiment Analysis is an example of:
Mar 19
Clustering - The Data Ensemble
Robin
Q: Movie recommendation systems are an example of:
Mar 19
Clustering - The Data Ensemble
Robin
...