RAPID FIREs : APACHE SPARK / HADOOP
Sanjeev Kumar
Senior Software Engineer @ Wipro Technologies | IBM Certified, AWS Cloud Practitioner
1. Faster cousin of Apache Hadoop.
Ans: Apache Spark
2. "Hello World" of Big Data programming.
Ans : Word Count Problem
3. A Programming Paradigm that is the heart of Apache Hadoop.
Ans: Map Reduce
4. Split, Map, Shuffle and Reduce - Which one of these MapReduce stage is costliest?
Ans: Shuffle
5. Python, R , Scala and Java - Apache Spark is written in which one of these Spark languages?
Ans: Scala
6. Basic abstraction that is used to represent incoming data in Apache Spark.
Ans: RDD ( Resilient Distributed Dataset)