0% found this document useful (0 votes)

24 views2 pages

Big Data Analytics Exam Questions

The document outlines the Continuous Assessment Test for Big Data Analytics, detailing the structure, including Part A and Part B questions focused on key concepts like Hadoop, NoSQL databases, and case studies related to smart cities and customer retention. It emphasizes the importance of understanding big data's 4Vs and the application of various database models in real-world scenarios. The test aims to assess students' comprehension of big data concepts and their practical implications in different industries.

Uploaded by

Ravi Prakash

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

24 views2 pages

Big Data Analytics Exam Questions

Uploaded by

Ravi Prakash

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Reg. No.

Continuous Assessment Test- I [CAT - I]

Year : III
Semester : 06
Branch :B.E/[Link].- CSE/CSBS/AI&DS
Sub. Code : CCS334
Subject Name : BIG DATA ANALYTICS
QP Code : 226507

[Regulations 2021]

Date: 14-02-25 Time: 90 Min Marks: 50

Answer ALL Questions
Part A [6 x 2 = 12 Marks]
1.1 Explain the role of Hadoop in big data processing. [A2] CO1
1.2 Propose a scenario where the analysis of unstructured data such as social media [B2] CO1
posts could benefit a business.
1.3 Define Big Data. [A1] CO1
1.4 Explain the convergence of key trends that have contributed to the growth of [A2] CO1
Big Data.
1.5 Compare and contrast traditional data and unstructured data. [B2] CO1
1.6 Can you recall an industry that extensively uses big data? [A1] CO1
Part B [ 1x13=13 Marks]
1.7 a Case Study: Big Data Analytics in a Smart City [C1] CO1
Scenario: A Smart City initiative is collecting data from various sources,
such as traffic sensors, public transportation systems, environmental
monitoring stations, and social media platforms. This data is being used to
optimize traffic management, improve public health, enhance city services,
and support better decision-making by city planners.
Question:
How can the 4Vs of Big Data (Volume, Velocity, Variety, and Veracity) be
effectively managed and utilized in a Smart City project to improve urban
living conditions?
[OR]

b Case Study: Improving Customer Retention with Big Data Analytics [C1] CO1
A retail company has been experiencing a decline in customer retention over
the past year. The company collects vast amounts of data from its online and
in-store interactions with customers. This includes purchase histories,
website browsing behavior, social media interactions, and feedback surveys.
Question:
How can Big Data Analytics be used to identify patterns and insights from
this vast array of customer data to improve customer retention? Outline the
steps, tools, and techniques you would apply in analyzing the data.
Additionally, discuss potential challenges the company may face when
working with this data, and propose solutions for overcoming them.
Part A [6 x 2 = 12 Marks]
2.1 Explain the concept of NoSQL databases and highlight the main reasons behind [B1] CO2
their emergence in the field of data management.
2.2 Compare and contrast the key-value data model and the document data model [A2] CO2
used in NoSQL databases. Provide examples of scenarios where each model
might be more suitable.
2.3 Assess the advantages and disadvantages of schemaless databases. [B1] CO2
2.4 Discuss the concept of graph database? [B2] CO2
2.5 Differentiate between schema less databases and traditional relational databases. [A2] CO2
2.6 Describe the types of data and relationships that would benefit from Graph [A1] CO2
database model.
Part B [1x13=13 Marks]
2.7 a Case Study: Implementing a Graph-Based Database in Healthcare for [C1] CO2
Disease Diagnosis and Treatment
Scenario: A Healthcare Organization is looking to enhance its capabilities
for disease diagnosis, patient care, and treatment planning by using a graph-
based database to analyze complex relationships between patients, medical
conditions, symptoms, treatments, and healthcare providers. This system
aims to help in identifying potential disease patterns, providing personalized
treatment recommendations, and improving clinical decision-making.
Question:
How can a graph-based database be used in healthcare to improve disease
diagnosis, personalize treatment plans, and enhance medical research?
[OR]
b Case Study: Choosing the Right NoSQL Database for a Social Media [C1] CO2
Platform

A startup is building a new social media platform aimed at connecting users

globally. The platform must handle massive amounts of data, including user
profiles, real-time posts, comments, likes, multimedia uploads, and activity
logs. The company is considering implementing a NoSQL database solution
due to the scalability needs and diverse data types.

The startup's requirements are as follows:

1. User Profile Data: Highly structured data with a focus on
relationships between users (e.g., friends, followers, connections).
2. Real-Time Posts and Comments: Unstructured, high-volume data
with frequent reads and writes.
3. Multimedia Files: Large binary data (e.g., images, videos) stored and
retrieved.
4. Activity Logs: Time-series data generated by user interactions (e.g.,
clicks, views, shares).
Question:
Given the startup's diverse data needs, which types of NoSQL databases
would you recommend for each of the four requirements? Explain why you
would choose the specific NoSQL database type (Document Store, Key-
Value Store, Column-Family Store, or Graph Database) for each use case,
and how the features of these databases align with the company’s needs for
scalability, performance, and flexibility.

Common questions

By analyzing unstructured data, such as social media posts, businesses can gain insights into consumer sentiments, preferences, and trends in real-time. This can help in tailoring marketing strategies, improving customer engagement, and rapidly responding to market changes or customer feedback. Additionally, such analysis can identify potential areas for product improvement and innovation, ultimately leading to enhanced customer satisfaction and competitive advantage .

The emergence of NoSQL databases is primarily driven by the need to handle large volumes of unstructured data, scalability requirements, and the need for faster real-time data processing. Unlike traditional relational databases which are schema-bound and usually less scalable, NoSQL databases are designed for flexible schema, horizontal scaling, and are optimized for specific types of data models such as key-value pairs, documents, or graph structures. This makes NoSQL databases more suitable for modern applications that require high performance and agility in handling diverse data types .

In a Smart City project, managing Volume involves using scalable storage solutions and distributed systems like Hadoop to handle the vast amounts of data collected from multiple sources. Velocity is addressed by implementing real-time data processing frameworks, enabling timely decision-making and interventions in urban management. Variety is managed through integrated data platforms that can analyze both structured and unstructured data, facilitating a holistic view of the urban environment. Ensuring Veracity involves deploying robust data quality checks and integrating trustworthy data sources to enhance the reliability of insights generated, thereby improving urban planning and services .

Graph databases are ideal for healthcare systems because they naturally represent and analyze complex networks and relationships between various entities such as patients, treatments, and symptoms. They provide a flexible data model enabling seamless integration and exploration of related data points, essential for identifying disease patterns and correlations. This insight supports personalized treatment plans and enhanced clinical decision-making, thereby improving patient outcomes and healthcare services efficiency .

Key trends contributing to Big Data growth include the exponential increase in data generated from digital activities, advancements in storage technologies, improved data processing capabilities, and the proliferation of Internet of Things (IoT) devices. The convergence of these trends has revolutionized data management practices by enabling efficient storage, real-time data processing, and the seamless integration of diverse data types. As a result, organizations are now able to utilize big data analytics to drive strategic decision-making, operational efficiencies, and customer personalization .

Schemaless databases, like those used in NoSQL systems, offer flexibility in handling data by allowing each document to have a potentially different structure, which is advantageous in environments where data models frequently change. They are well-suited for managing large-scale, varied datasets. However, challenges include increased complexity in data integrity maintenance and query performance, as the absence of a fixed schema may lead to difficulties in ensuring consistency and efficient data retrieval. Thus, careful design and indexing strategies are necessary to overcome these challenges .

Graph-based databases offer numerous benefits in healthcare, including improved data interoperability, enhanced ability to model complex relationships, and efficient querying of connected data, facilitating better disease diagnosis and personalized treatment recommendations. They support comprehensive analysis of patterns and trends in patient data which can improve clinical decisions. However, potential risks include data privacy concerns due to handling sensitive patient information, increased complexity in database management, and the need for specialized skills to implement and maintain the system effectively .

Hadoop is an open-source framework that enables the processing and storage of large datasets in a distributed computing environment. Its key components include Hadoop Distributed File System (HDFS) for storing data across multiple machines, and MapReduce for processing data in parallel across clusters. HDFS provides scalability and fault tolerance by distributing data across multiple nodes, while MapReduce simplifies the task of processing large volumes of data by breaking it down into smaller tasks that can be executed in parallel. These features effectively handle the challenges posed by big data, such as volume and variety .

Traditional structured data is organized in fixed schemas, typically in tables with rows and columns, making it straightforward to store, query, and analyze using relational database systems. Unstructured data, on the other hand, lacks a consistent format or schema, encompassing varied content like text, images, video, and more. Handling them differently is crucial in big data analytics because structured data lends itself to straightforward querying and analysis, whereas unstructured data requires more advanced tools and techniques for processing, such as natural language processing and image recognition, to extract actionable insights .

For user profile data, which is highly structured and relational, a Graph Database would be suitable for efficiently managing relationships such as friends and followers. Real-time posts and comments, which are high-volume and require quick reads and writes, can benefit from a Document Store like MongoDB, supporting unstructured data with high read/write throughput. Multimedia files require a Key-Value Store such as Amazon S3 for scalable storage and fast retrieval of large binary data. Activity logs, being time-series data, find an ideal match in a Column-Family Store like Cassandra, which optimizes for sequential data ingestion and retrieval .

231cse917t Fundamentals of Big Data Analytics Final
No ratings yet
231cse917t Fundamentals of Big Data Analytics Final
26 pages
Big Data Analytics and NoSQL Insights
No ratings yet
Big Data Analytics and NoSQL Insights
13 pages
Big Data Analytics Exam Questions 2025
No ratings yet
Big Data Analytics Exam Questions 2025
4 pages
Understanding Big Data and NoSQL Concepts
No ratings yet
Understanding Big Data and NoSQL Concepts
5 pages
Bda Question Bank Ay 25-26
No ratings yet
Bda Question Bank Ay 25-26
7 pages
Understanding Big Data vs. Small Data
No ratings yet
Understanding Big Data vs. Small Data
22 pages
Big Data and Analytics Question Bank
No ratings yet
Big Data and Analytics Question Bank
5 pages
Big Data Concepts and Applications Explained
No ratings yet
Big Data Concepts and Applications Explained
50 pages
Big Data Analytics Overview and Applications
No ratings yet
Big Data Analytics Overview and Applications
8 pages
Big Data Concepts and Applications Guide
No ratings yet
Big Data Concepts and Applications Guide
17 pages
Big Data Analytics: Key Concepts & Applications
No ratings yet
Big Data Analytics: Key Concepts & Applications
10 pages
BDA QB
No ratings yet
BDA QB
19 pages
Big Data Analytics Course Overview
No ratings yet
Big Data Analytics Course Overview
2 pages
Bda Unit 1 QB
No ratings yet
Bda Unit 1 QB
9 pages
Pyq Paper Solution Bda
No ratings yet
Pyq Paper Solution Bda
44 pages
MapReduce and SQL in Big Data Analytics
No ratings yet
MapReduce and SQL in Big Data Analytics
13 pages
Big Data Analytics Course Details
No ratings yet
Big Data Analytics Course Details
4 pages
Big Data Concepts and Applications
No ratings yet
Big Data Concepts and Applications
11 pages
Big Data Analytics Overview and Applications
No ratings yet
Big Data Analytics Overview and Applications
2 pages
Big Data Analytics Question Bank
No ratings yet
Big Data Analytics Question Bank
2 pages
Big Data Analytics Exam Paper
No ratings yet
Big Data Analytics Exam Paper
38 pages
Understanding Unified Storage Concepts
No ratings yet
Understanding Unified Storage Concepts
27 pages
Understanding Unified Storage in Data Science
No ratings yet
Understanding Unified Storage in Data Science
27 pages
Big Data Analytics Exam Paper Set B
No ratings yet
Big Data Analytics Exam Paper Set B
2 pages
Big Data Analytics Assignment Overview
No ratings yet
Big Data Analytics Assignment Overview
1 page
Big Data Analytics Question Bank 2024
No ratings yet
Big Data Analytics Question Bank 2024
5 pages
Big Data Characteristics and Technologies
No ratings yet
Big Data Characteristics and Technologies
18 pages
Data Analytics Practice Questions
No ratings yet
Data Analytics Practice Questions
5 pages
Big Data Analytics Overview and Concepts
No ratings yet
Big Data Analytics Overview and Concepts
8 pages
Big Data Exam Answers Guide
No ratings yet
Big Data Exam Answers Guide
5 pages
Question Bank Big Data
No ratings yet
Question Bank Big Data
7 pages
Big Data and NoSQL Management Overview
No ratings yet
Big Data and NoSQL Management Overview
4 pages
Big Data Analytics Model Question Paper
No ratings yet
Big Data Analytics Model Question Paper
6 pages
Big Data Question Bank for B.Tech CSE
No ratings yet
Big Data Question Bank for B.Tech CSE
2 pages
Big Data Analytics Question Bank
No ratings yet
Big Data Analytics Question Bank
10 pages
Big Data Analytics Question Bank 2024
No ratings yet
Big Data Analytics Question Bank 2024
13 pages
Big Data Analytics Model Test II
No ratings yet
Big Data Analytics Model Test II
1 page
Big Data Analytics Mid Term Exam 2025
No ratings yet
Big Data Analytics Mid Term Exam 2025
4 pages
Big Data Analytics Question Bank CSE
No ratings yet
Big Data Analytics Question Bank CSE
10 pages
Wa0001
No ratings yet
Wa0001
5 pages
BD 2024
No ratings yet
BD 2024
3 pages
Big Data Analytics Question Bank
No ratings yet
Big Data Analytics Question Bank
3 pages
Big Data Analytics Exam Model Questions
100% (1)
Big Data Analytics Exam Model Questions
6 pages
Understanding Structured Data in Big Data
No ratings yet
Understanding Structured Data in Big Data
3 pages
CSE704 Data Analytics Question Bank
No ratings yet
CSE704 Data Analytics Question Bank
4 pages
CCS334 Big Data Analytics Question Bank
No ratings yet
CCS334 Big Data Analytics Question Bank
12 pages
Pmc304 Big Data Analytics
No ratings yet
Pmc304 Big Data Analytics
12 pages
Bda Iat - 3
No ratings yet
Bda Iat - 3
2 pages
Big Data Analytics Question Bank
No ratings yet
Big Data Analytics Question Bank
17 pages
Big Data Acquisition Question Bank
No ratings yet
Big Data Acquisition Question Bank
2 pages
Big Data Analytics Overview and Insights
No ratings yet
Big Data Analytics Overview and Insights
54 pages
Big Data Analytics MCQ Question Bank
No ratings yet
Big Data Analytics MCQ Question Bank
22 pages
Big Data Analytics Exam Paper 2023-24
No ratings yet
Big Data Analytics Exam Paper 2023-24
2 pages
Big Data Analytics Assignments (3170722)
No ratings yet
Big Data Analytics Assignments (3170722)
7 pages
Bda 2024
No ratings yet
Bda 2024
3 pages
Big Data Analytics Assignment Overview
No ratings yet
Big Data Analytics Assignment Overview
4 pages
Question Bank Bdal Iso Format - Updated
No ratings yet
Question Bank Bdal Iso Format - Updated
11 pages
Big Data Analytics Exam Questions
No ratings yet
Big Data Analytics Exam Questions
3 pages
Placement Preparation Question Paper
No ratings yet
Placement Preparation Question Paper
16 pages
Python Interview Questions Overview
No ratings yet
Python Interview Questions Overview
26 pages
Full-Stack Developer Interview Insights
No ratings yet
Full-Stack Developer Interview Insights
6 pages
CCS356 Object-Oriented Software Engineering
No ratings yet
CCS356 Object-Oriented Software Engineering
11 pages
Data Analysis and Visualization Techniques
No ratings yet
Data Analysis and Visualization Techniques
38 pages
Introduction to Network Communication
No ratings yet
Introduction to Network Communication
42 pages
Network Layer: Packet Switching & IP Addressing
No ratings yet
Network Layer: Packet Switching & IP Addressing
30 pages
Komodo Dragon Population Demographics
No ratings yet
Komodo Dragon Population Demographics
15 pages
Theories of Physical Development in Early Childhood
No ratings yet
Theories of Physical Development in Early Childhood
15 pages
ENI Myanmar Human Rights Report
No ratings yet
ENI Myanmar Human Rights Report
65 pages
Biomarker Screening Test Form
No ratings yet
Biomarker Screening Test Form
1 page
Calcium's Role in Animal Behavior
100% (14)
Calcium's Role in Animal Behavior
16 pages
Class 10 Science: Asexual Reproduction Guide
No ratings yet
Class 10 Science: Asexual Reproduction Guide
4 pages
Metrology Unit1 Transducers FULL DETAILED
No ratings yet
Metrology Unit1 Transducers FULL DETAILED
3 pages
IGCSE Science Revision Checklist Guide
No ratings yet
IGCSE Science Revision Checklist Guide
12 pages
Understanding Soft Skills and Their Importance
No ratings yet
Understanding Soft Skills and Their Importance
10 pages
Neem Pesticide Guide for USAID Partners
No ratings yet
Neem Pesticide Guide for USAID Partners
10 pages
TOFD For Weld Root Corrosion and Erosion
100% (1)
TOFD For Weld Root Corrosion and Erosion
7 pages
Harter Ralph 1956 India
No ratings yet
Harter Ralph 1956 India
124 pages
OOS Investigation SOP for Quality Control
No ratings yet
OOS Investigation SOP for Quality Control
28 pages
PHDI and HDI Rankings Overview
No ratings yet
PHDI and HDI Rankings Overview
4 pages
Black and White Photography Essentials
No ratings yet
Black and White Photography Essentials
32 pages
Science Questions for 10th Grade Students
No ratings yet
Science Questions for 10th Grade Students
12 pages
Culinary Vegetable Identification Quiz
No ratings yet
Culinary Vegetable Identification Quiz
5 pages
Rate Confirmation for Freight Shipment
No ratings yet
Rate Confirmation for Freight Shipment
1 page
Karnataka HC: Marriage Not a Rape License
No ratings yet
Karnataka HC: Marriage Not a Rape License
4 pages
Thermal Effects On Human Performance in of Ce Environment
No ratings yet
Thermal Effects On Human Performance in of Ce Environment
6 pages
Engaging Picture Interpretation Activities
No ratings yet
Engaging Picture Interpretation Activities
15 pages
Assa Abloy - SL500 - Eng
No ratings yet
Assa Abloy - SL500 - Eng
34 pages
Lupin Limited Nagpur Maharastra India 9.8-9.16.25 483 Redacted
No ratings yet
Lupin Limited Nagpur Maharastra India 9.8-9.16.25 483 Redacted
8 pages
Indian & Sri Lankan Chef Resume
No ratings yet
Indian & Sri Lankan Chef Resume
4 pages
IndiGo Flight Booking Confirmation
No ratings yet
IndiGo Flight Booking Confirmation
2 pages
Australian Healthcare Rights Charter
No ratings yet
Australian Healthcare Rights Charter
1 page
Health and Safety Study Material
No ratings yet
Health and Safety Study Material
121 pages
Overview of Performance Management Systems
No ratings yet
Overview of Performance Management Systems
4 pages
Install and Explore Python Libraries
No ratings yet
Install and Explore Python Libraries
67 pages
Deep Well Dewatering System Overview
100% (3)
Deep Well Dewatering System Overview
36 pages

Big Data Analytics Exam Questions

Uploaded by

Big Data Analytics Exam Questions

Uploaded by

Reg. No.

Continuous Assessment Test- I [CAT - I]

Date: 14-02-25 Time: 90 Min Marks: 50

A startup is building a new social media platform aimed at connecting users

The startup's requirements are as follows:

Common questions

In what ways can the analysis of unstructured data, like social media posts, offer a competitive advantage to businesses?

What are the main reasons behind the emergence of NoSQL databases, and how do they compare to traditional relational databases?

How can the 4Vs of Big Data (Volume, Velocity, Variety, Veracity) be managed in a Smart City project to improve urban living?

Why are graph databases preferable for analyzing complex relationships in healthcare systems?

What are the key trends that have contributed to the rapid growth of Big Data, and how has their convergence influenced data management practices?

What advantages do schemaless databases offer, and what challenges may arise from their use?

What are the benefits and potential risks associated with implementing a graph-based database for disease diagnosis in healthcare?

How does the Hadoop framework facilitate big data processing, and what are its key components?

How do traditional structured data and unstructured data differ, and why is it important to handle them differently in big data analytics?

How would you recommend a NoSQL database architecture for different types of data requirements in a social media platform?

You might also like