Data Science Questions and Answers Part-18

1. Which of the following parameter defines line type such as dashed and dotted?
a) lty
b) pch
c) lwd
d) all of the mentioned

Answer: a
Explanation: lwd is used for line width.

2. The core plotting engine is encapsulated in graphics package.
a) True
b) False

Answer: a
Explanation: graphics package contain plotting functions.

3. Which of the following argument specifies margin size with regards to par function?
a) las
b) bg
c) mar
d) all of the mentioned

Answer: c
Explanation: par function is used to specify global parameters.

4. How many stages commonly occurs in creation of plot?
a) 2
b) 5
c) 8
d) All of the mentioned

Answer: a
Explanation: The base plotting system is highly flexible.

5. Base graphics are used most commonly for creating 2D graphics.
a) True
b) False

Answer: a
Explanation: Base graphics is a very powerful system for creating 2D graphics.

6. Point out the correct statement.
a) The choice of an appropriate metric will influence the shape of the clusters
b) Hierarchical clustering is also called HCA
c) In general, the merges and splits are determined in a greedy manner
d) All of the mentioned

Answer: d
Explanation: Some elements may be close to one another according to one distance and farther away according to another.

7. Which of the following is finally produced by Hierarchical Clustering?
a) final estimate of cluster centroids
b) tree showing how close things are to each other
c) assignment of each point to clusters
d) all of the mentioned

Answer: b
Explanation: Hierarchical clustering is an agglomerative approach.

8. Which of the following is required by K-means clustering?
a) defined distance metric
b) number of clusters
c) initial guess as to cluster centroids
d) all of the mentioned

Answer: d
Explanation: K-means clustering follows partitioning approach.

9. Point out the wrong statement.
a) k-means clustering is a method of vector quantization
b) k-means clustering aims to partition n observations into k clusters
c) k-nearest neighbor is same as k-means
d) none of the mentioned

Answer: c
Explanation: k-nearest neighbor has nothing to do with k-means.

10. Which of the following combination is incorrect?
a) Continuous – euclidean distance
b) Continuous – correlation similarity
c) Binary – manhattan distance
d) None of the mentioned

Answer: d
Explanation: You should choose a distance/similarity that makes sense for your problem.