Hadoop Questions and Answers Part-23

1. The __________ is a framework-specific entity that negotiates resources from the ResourceManager.
a) NodeManager
b) ResourceManager
c) ApplicationMaster
d) All of the mentioned

Answer: c
Explanation: Each ApplicationMaster has the responsibility for negotiating appropriate resource containers from the schedule.

2. Point out the wrong statement.
a) From the system perspective, the ApplicationMaster runs as a normal container
b) The ResourceManager is the per-machine slave, which is responsible for launching the applications’ containers
c) The NodeManager is the per-machine slave, which is responsible for launching the applications’ containers, monitoring their resource usage
d) None of the mentioned

Answer: b
Explanation: ResourceManager has a scheduler, which is responsible for allocating resources to the various applications running in the cluster, according to constraints such as queue capacities and user limits.

3. Apache Hadoop YARN stands for _________
a) Yet Another Reserve Negotiator
b) Yet Another Resource Network
c) Yet Another Resource Negotiator
d) All of the mentioned

Answer: c
Explanation: YARN is a cluster management technology.

4. MapReduce has undergone a complete overhaul in hadoop is _________
a) 0.21
b) 0.23
c) 0.24
d) 0.26

Answer: b
Explanation: The fundamental idea of MRv2 is to split up the two major functionalities of the JobTracker.

5. The ____________ is the ultimate authority that arbitrates resources among all the applications in the system.
a) NodeManager
b) ResourceManager
c) ApplicationMaster
d) All of the mentioned

Answer: b
Explanation: The ResourceManager and per-node slave, the NodeManager (NM), form the data-computation framework.

6. The __________ is responsible for allocating resources to the various running applications subject to familiar constraints of capacities, queues etc.
a) Manager
b) Master
c) Scheduler
d) None of the mentioned

Answer: c
Explanation: The Scheduler is a pure scheduler in the sense that it performs no monitoring or tracking of status for the application.

7. The CapacityScheduler supports _____________ queues to allow for more predictable sharing of cluster resources.
a) Networked
b) Hierarchical
c) Partition
d) None of the mentioned

Answer: b
Explanation: The Scheduler has a pluggable policy plugin, which is responsible for partitioning the cluster resources among the various queues, applications etc.

8. ___________ generates keys of type LongWritable and values of type Text.
a) TextOutputFormat
b) TextInputFormat
c) OutputInputFormat
d) None of the mentioned

Answer: b
Explanation: If K2 and K3 are the same, you don’t need to call setMapOutputKeyClass().

9. Point out the correct statement.
a) The reduce input must have the same types as the map output, although the reduce output types may be different again
b) The map input key and value types (K1 and V1) are different from the map output types
c) The partition function operates on the intermediate key
d) All of the mentioned

Answer: d
Explanation: In practice, the partition is determined solely by the key (the value is ignored).

10. In _____________ the default job is similar, but not identical, to the Java equivalent.
a) Mapreduce
b) Streaming
c) Orchestration
d) All of the mentioned

Answer: b
Explanation: MapReduce Types and Formats MapReduce has a simple model of data processing.