Hadoop Questions and Answers Part-16

1. Point out the correct statement.
a) Another limitation of the Hadoop MapReduce framework is its pull-based scheduling model
b) The MapReduce framework sorts the outputs of the maps, which are then input to the reduce tasks
c) The MapReduce framework takes care of scheduling tasks, monitoring them and re-executes the failed tasks
d) All of the mentioned

Answer: d
Explanation: Typically both the input and the output of the job are stored in a file-system.

2. Hadoop __________ is a utility which allows users to create and run jobs with any executables.
a) Streaming
b) Pipes
c) Orchestration
d) All of the mentioned

Answer: a
Explanation: Applications specify the input/output locations and supply map and reduce functions.

3. Hadoop _________ is a SWIG- compatible C++ API to implement MapReduce applications.
a) Streaming
b) Pipes
c) Orchestration
d) All of the mentioned

Answer: b
Explanation: The MapReduce framework operates exclusively on pairs.

4. Point out the wrong statement.
a) MapReduce configuration allows the framework to effectively schedule tasks on the nodes where data is already present
b) Typically the compute nodes and the storage nodes are different
c) The MapReduce framework consists of a single master JobTracker and one slave TaskTracker per cluster-node
d) None of the mentioned

Answer: b
Explanation: MapReduce framework and the Hadoop Distributed File System are running on the same set of nodes.

5. The key and value classes have to be _________ by the Mapreduce framework.
a) collected
b) serializable
c) compacted
d) none of the mentioned

Answer: b
Explanation: Writable interface needs to be implemented for key classes.

6. Key classes have to implement the __________ interface to facilitate sorting by the framework.
a) Writable
b) Comparable
c) WritableComparable
d) None of the mentioned

Answer: c
Explanation: Input and Output types of a MapReduce job:(input) <k1, v1> -> map -> <k2, v2> -> combine -> <k2, v2> -> reduce -> <k3, v3> (output).

7. The ________ option allows applications to add jars to the classpaths of the maps and reduces.
a) optionname
b) -libjars
c) -archives
d) all of the mentioned

Answer: b
Explanation: Applications can specify a comma separated list of paths which would be present in the current working directory of the task using the option -files.

8. The option ___________ allows to pass comma separated list of archives as arguments.
a) optionname
b) -libjars
c) -archives
d) none of the mentioned

Answer: c
Explanation: These archives are unarchived and a link with name of the archive is created in the current working directory of tasks.

9. Users can specify a different symbolic name for files and archives passed through -files and -archives option, using __________
a) $
b) @
c) #
d) $

Answer: c
Explanation: MapReduce is the primary method for non-primary-key-based querying.

10. The Mapper implementation processes one line at a time via _________ method.
a) map
b) reduce
c) mapper
d) reducer

Answer: a
Explanation: The Mapper outputs are sorted and then partitioned per Reducer.