Data Science Questions and Answers Part-19

1. Hierarchical clustering should be primarily used for exploration.
a) True
b) False

Answer: a
Explanation: Hierarchical clustering is deterministic.

2. Which of the following function is used for k-means clustering?
a) k-means
b) k-mean
c) heatmap
d) none of the mentioned

Answer: a
Explanation: K-means requires a number of clusters.

3. Which of the following clustering requires merging approach?
a) Partitional
b) Hierarchical
c) Naive Bayes
d) None of the mentioned

Answer: b
Explanation: Hierarchical clustering requires a defined distance as well.

4. K-means is not deterministic and it also consists of number of iterations.
a) True
b) False

Answer: a
Explanation: K-means clustering produces the final estimate of cluster centroids

5. Which of the following gave rise to need of graphs in data analysis?
a) Data visualization
b) Communicating results
c) Decision making
d) All of the mentioned

Answer: d
Explanation: A picture can tell better story than data.

6. Which of the following is characteristic of exploratory graph?
a) Made slowly
b) Axes are not cleaned up
c) Color is used for personal information
d) All of the mentioned

Answer: c
Explanation: A large number of exploratory graphs are made.

7. Point out the correct statement.
a) coplots are one dimensional data graph
b) Exploratory graphs are made quickly
c) Exploratory graphs are made relatively less in number
d) All of the mentioned

Answer: a
Explanation: coplot is used for two dimensional representation.

8. Which of the following graph can be used for simple summarization of data?
a) Scatterplot
b) Overlaying
c) Barplot
d) All of the mentioned

Answer: c
Explanation: A bar chart or bar graph is a chart that presents Grouped data with rectangular bars with lengths proportional to the values that they represent.

9. Color and shape are used to add dimensions to graph data.
a) True
b) False

Answer: a
Explanation: Graphs are commonly used by print and electronic media.

10. Which of the following information is not given by five-number summary?
a) Mean
b) Median
c) Mode
d) All of the mentioned

Answer: c
Explanation: The mode is the value that appears most often in a set of data.