DEPARTMENT OF COMPUTER ENGINEERING
UNIT TEST - I (AY 2024-25)
Class:BE- A/B/C Time: 2:00 pm to 3:30 pm Semester: VII
Date: 20/08/2024 Subject: Big Data Analytics Max marks: 40
Note the following instructions
1. Attempt all questions. 2. Draw neat diagrams wherever necessary.
3. Write everything in ink (no pencil) only. 4. Assume data, if missing, with justification.
_________________________________________________________________________________
Q1 Attempt any two Marks CO BL
a) Compare big data analytics with traditional data mining [05] CO1 L2
b) Explain Hadoop ecosystem with core components [05] CO1 L2
c) What is the difference between structured, semi-structured and [05] CO1 L2
unstructured data?
d) Describe any five characteristics of Big Data. [05] CO1 L2
Q2
a) Demonstrate with the help of pseudocode matrix vector multiplication [10] CO2 L3
using MapReduce.
5 6 * 1 2
7 8 3 4
OR
b) Write a map reduce pseudocode for word count problems. Apply map [10] CO2 L3
reduce working on the following document “This is a school. Students
study in this school.”
AND
c) Generalize the architecture of HDFS in brief. [05] CO2 L2
OR
d) Demonstrate MapReduce execution pipeline with suitable examples. [05] CO2 L2
Q3
a) Demonstrate how business problems have been successfully solved faster, [10] CO3 L2
cheaper and more effectively considering NoSQL Google’s MapReduce
case study. Also illustrate the business drivers and the findings on it.
OR
b) Explain different architectural patterns in NoSQL? [10] CO3 L2
AND
c) Describe the CAP theorem of NoSQL database. Can you adopt NoSQL for [05] CO3 L2
traditional banking applications? Justify your answer.
OR
d) Express what you understand by BASE properties in NoSQL databases? [05] CO3 L2
Discuss the situations where the BASE transaction model can be applied.