Hadoop Questions and Answers Part-10

1. Applications can use the _________ provided to report progress or just indicate that they are alive.
a) Collector
b) Reporter
c) Dashboard
d) None of the mentioned

  Discussion

Answer: b
Explanation: In scenarios where the application takes a significant amount of time to process individual key/value pairs, this is crucial since the framework might assume that the task has timed-out and kill that task.

2. Which of the following parameter is to collect keys and combined values?
a) key
b) values
c) reporter
d) output

  Discussion

Answer: d
Explanation: The reporter parameter is for a facility to report progress.

3. ________ is the name of the archive you would like to create.
a) archive
b) archiveName
c) name
d) none of the mentioned

  Discussion

Answer: b
Explanation: The name should have a *.har extension.

4. Point out the correct statement.
a) A Hadoop archive maps to a file system directory
b) Hadoop archives are special format archives
c) A Hadoop archive always has a *.har extension
d) All of the mentioned

  Discussion

Answer: d
Explanation: A Hadoop archive directory contains metadata (in the form of _index and _masterindex) and data (part-*) files.

5. Using Hadoop Archives in __________ is as easy as specifying a different input filesystem than the default file system.
a) Hive
b) Pig
c) MapReduce
d) All of the mentioned

  Discussion

Answer: c
Explanation: Hadoop Archives is exposed as a file system MapReduce will be able to use all the logical input files in Hadoop Archives as input.

6. The __________ guarantees that excess resources taken from a queue will be restored to it within N minutes of its need for them.
a) capacitor
b) scheduler
c) datanode
d) none of the mentioned

  Discussion

Answer: b
Explanation: Free resources can be allocated to any queue beyond its guaranteed capacity.

7. Point out the wrong statement.
a) The Hadoop archive exposes itself as a file system layer
b) Hadoop archives are immutable
c) Archive rename, deletes and creates return an error
d) None of the mentioned

  Discussion

Answer: d
Explanation: All the fs shell commands in the archives work but with a different URI.

8. _________ is a pluggable Map/Reduce scheduler for Hadoop which provides a way to share large clusters.
a) Flow Scheduler
b) Data Scheduler
c) Capacity Scheduler
d) None of the mentioned

  Discussion

Answer: c
Explanation: The Capacity Scheduler supports multiple queues, where a job is submitted to a queue.

9. Which of the following parameter describes destination directory which would contain the archive?
a) -archiveName <name>
b) <source>
c) <destination>
d) none of the mentioned

  Discussion

Answer: c
Explanation: -archiveName <name> is the name of the archive to be created.

10. _________ identifies filesystem path names which work as usual with regular expressions.
a) -archiveName <name>
b) <source>
c) <destination>
d) none of the mentioned

  Discussion

Answer: d
Explanation: identifies destination directory which would contain the archive.