0% found this document useful (0 votes)
6 views21 pages

Data Analytics: Types and Importance

The document provides an overview of data, its types (structured, unstructured, and semi-structured), and the concept of data analytics, including its importance and applications across various industries. It outlines the steps involved in data analytics, such as data collection, cleaning, transformation, validation, analysis, and visualization, as well as different types of data analytics like descriptive, predictive, prescriptive, and diagnostic analysis. Additionally, it highlights the significance of data analytics in enhancing decision-making, operational efficiency, customer experience, risk management, and competitive advantage.

Uploaded by

priya
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
6 views21 pages

Data Analytics: Types and Importance

The document provides an overview of data, its types (structured, unstructured, and semi-structured), and the concept of data analytics, including its importance and applications across various industries. It outlines the steps involved in data analytics, such as data collection, cleaning, transformation, validation, analysis, and visualization, as well as different types of data analytics like descriptive, predictive, prescriptive, and diagnostic analysis. Additionally, it highlights the significance of data analytics in enhancing decision-making, operational efficiency, customer experience, risk management, and competitive advantage.

Uploaded by

priya
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd

UNIT-1

1. What is Data?
2. Types of Data
3. What is Data Analytics?
4. Types of Data Analytics
5. Importance of Data Analytics
6. Applications of Data Analytics

1. What is Data?
i. Data is raw unprocessed facts and figures.
ii. Data is processed by computational operations to
retrieve meaningful information.
iii. This information becomes helpful in Decision-making in
Organizations/Companies.

2. Types of Data
i. Structured Data
 Structured Data is stored in row and Columns
format.
 Each Column has Defined Data Type
 Mapping of Data(i.e. Relationship between data)
can be maintained by using various keys(e.g.
Primary key, Foreign Key).
 Data can be retrieved (accessed) easily as per your
need using Queries or (‘Find & Replace’) Function
in Spread sheet.
E.g. SQL, Spread sheet(Excel Files), CSV Files(Comma
Delimiter/Comma Separated Values)

Sr. No Roll No Course Subject


1 101 BCA-II Pyhon
2 102 BCA-II DAUS
3 103 BCA-II DBMS

Sql> Select * from Student where RollNo=103;


Output:

3 103 BCA-II DBMS

In structured Data, In Spread sheet various operations are


available like mean, average, frequency, etc to extract meaningful
information.

ii. Unstructured Data.


 It is typically raw data that can be of any type and
doesn't fit neatly into rows and columns like
structured data.
 It cannot be converted into Structured Data.

o E.g. Text data: Emails, social media posts(e.g. Instagram,


facebook), reviews, articles.
o Multimedia data: Images, videos, and audio files.
iii. Semi-Structured Data
 Semi-structured data is a combination of structured and
unstructured data.
 It doesn’t comes in row & column format.
 It often comes in formats that include tags or markers to
separate elements
 It is converted into Structured Data as per requirement.

E.g. XML,JSON, NoSQL

XML File
<note date="2008-01-10">
<to>Tove</to>
<from>Jani</from>
<heading>Reminder</heading>
<body>Don't forget me this weekend!</body>
</note>

XML File is used to Data Exchange between Systems.


3. What is Data Analytics?

Data analytics refers to a broad range of data-related activities and


concepts. It is a process for translating basic facts and figures into
meaningful information. The purpose is to help businesses make
better decisions and achieve greater success.
It processes data using various technologies such as Tableau
Public, Python, Excel, Google Analytics, SAS, Apache Spark and
others.
Steps involved in Data Analytics
 Data Collecting
 Data Cleaning
 Data Transformation
 Data Validation
 Data Analysis
 Data Visualization

[Link] Collecting:
Gather relevant data from various sources.
 Data Collection Methods
Primary Data Collection
 Surveys :(Structured Tool for Data Collection)

 Questionnaires

 Interviews (Structured, Semi-Structured, Unstructured)

 Focus Groups

 Observations

 Experiments

 Field Studies

Secondary Data Collection


 Reviewing Existing Datasets

 Reports

 Journals

 Books

 Database Access

 Literature Review
2. Data Cleaning
Removing Duplicates is used for Data Cleaning And
Handling missing values using
 Mean/Median/Mode Imputation
 Forward/Backward Fill (Time-Series Imputation)
 Manual Imputation
 Regression Imputation
 Find and Replace:
 Formulas:
 Row-wise Deletion
 Column-wise Deletion

3. Data Transformation
Transform Text Data into Columns by Delimited commas or
tabs

4. Data Validation
 Numbers(Check Input Data within specific range)
 Text(is Text Length valid)
 Date(is Date between specified valid Range)
 Drop – Down List(Displaying only Data entered in Drop-Down
List)
 Input Message
 Error Alert

5. Data Analysis
Statistical analysis in data analytics using the spreadsheet's Data
Analysis Toolpak involves using built-in tools to perform various statistical
calculations and generate reports. This add-in provides functions

Key Statistical Analysis Functions:


The Data Analysis Toolpak provides a wide range of statistical
analysis tools, including:
 Descriptive Statistics: Calculate summary statistics like mean,
median, mode, standard deviation, etc.
 Regression Analysis: Analyze the relationship between a dependent
variable and one or more independent variables.
 Correlation: Determine the strength and direction of the relationship
between two or more variables.
 ANOVA: Perform analysis of variance to compare means of two or
more samples.
 t-Test: Compare the means of two groups to see if they are
significantly different.
 Histograms: Visualize the distribution of data.
 Moving Average: Smooth out data fluctuations to identify trends.
 Exponential Smoothing: Similar to moving average but assigns more
weight to recent data.

How to use it:


1. Enable the Toolpak:
Go to File > Options > Add-ins > Manage Excel Add-ins > Go > Check
"Analysis ToolPak" and click OK.

2. Access the Toolpak:


On the Data tab, you'll find the Data Analysis button in the Analysis
group.
3. Select an Analysis Tool:
Choose the desired analysis tool from the list (e.g., Regression, ANOVA,
etc.).
4. Specify Input Range and Parameters:
Provide the input data and any required parameters for the analysis.
5. Choose Output Options:
Specify where you want the results to be displayed (e.g., new worksheet,
existing worksheet).
6. View Results:
The Toolpak will generate an output table and/or charts based on your
analysis.

6. Data Visualization

Data visualization is very common in your day to day life; they


often appear in the form of charts and graphs and Histograms. In other words,
data shown graphically so that it will be easier for the human brain to
understand and process it. Data visualization often used to discover unknown
facts and trends. By observing relationships and comparing datasets, you can
find a way to find out meaningful information.

Types of Data Analytics


Descriptive Analysis
It is used for documenting characteristics of Data. It analyses
complete data or a sample of summarized numerical data. It shows
mean and deviation for continuous data whereas percentage and
frequency for categorical data.

Predictive Analysis
Predictive Analysis shows “what is likely to happen” by using
previous data. The simplest data analysis example is like if last year I
bought two dresses based on my savings and if this year my salary is
increasing double then I can buy four dresses. But of course it’s not
easy like this because you have to think about other circumstances
like chances of prices of clothes is increased this year or maybe
instead of dresses you want to buy a new bike, or you need to buy a
house!

So here, this Analysis makes predictions about future outcomes


based on current or past data. Forecasting is just an estimate. Its
accuracy is based on how much detailed information you have and
how much you dig in it.

Prescriptive Analysis
Prescriptive Analysis combines the insight from all previous Analysis
to determine which action to take in a current problem or decision.
Most data-driven companies are utilizing Prescriptive Analysis
because predictive and descriptive Analysis is not enough to improve
data performance. Based on current situations and problems, they
analyze the data and make decisions.

Diagnostic Analysis
Once you know what happened, diagnostic analysis helps explain
why. Say a hospital notices more patients than usual. By looking
deeper into the data, you might find that many of them had the
same symptoms, helping you figure out the cause. This analysis
answers: Why did it happen?

Diagnostic Analysis shows “Why did it happen?” by finding the cause


from the insight found in Statistical Analysis. This Analysis is useful to
identify behaviour patterns of data. If a new problem arrives in your
business process, then you can look into this Analysis to find similar
patterns of that problem. And it may have chances to use similar
prescriptions for the new problems.

Importance of Data Analytics

Enhanced Decision-Making:

• Data analytics provides evidence-based insights, reducing


guesswork.
• It helps businesses understand customer behavior, market
trends, and operational performance.
• This allows for more accurate strategic planning and resource
allocation.

Improved Operational Efficiency:

 Data analytics can identify areas where processes can be


streamlined and costs can be reduced.

 It helps optimize inventory management, supply chain logistics,


and resource utilization.

 This leads to increased productivity and profitability.

Enhanced Customer Experience:


 Data analytics enables businesses to understand customer
preferences, personalize interactions, and tailor products or
services.

 It helps in identifying customer needs and addressing pain


points, leading to increased satisfaction and loyalty.

 Personalized marketing campaigns and targeted product


recommendations become possible.

Risk Management:

 Data analytics helps in identifying potential risks and


vulnerabilities early on.

 It allows businesses to develop proactive strategies to mitigate


risks and minimize potential losses.

 This could involve fraud detection, security breach prevention,


or supply chain disruptions.

Competitive Advantage:

 By leveraging data analytics, businesses can gain a deeper


understanding of their market and competitors.

 It helps in identifying emerging trends and opportunities,


enabling businesses to stay ahead of the competition.

 This can lead to innovation, new product development, and


increased market share.
 In essence, data analytics empowers businesses to make better
decisions, operate more efficiently, and ultimately achieve
sustainable growth and success in today's data-driven world

Applications of Data Analytics

Data Analytics is widely applied across various industries and sectors


to optimize processes, improve decision-making, and uncover
valuable insights. Below are some of the key applications of data
analytics, with examples and use cases in each domain.

1. Business and Marketing

Data analytics is heavily used in business and marketing to improve


strategies, understand consumer behavior, and drive revenue.

 Customer Segmentation: Analyzing purchasing patterns,


demographics, and behaviors to segment customers into
distinct groups. This helps in tailoring marketing efforts.
o Example:
 Amazon uses customer purchase data to create
personalized recommendations and targeted
advertising.
 Sentiment Analysis: Analyzing customer reviews, social media
posts, and feedback to gauge public sentiment about a product,
brand, or service.
o Example:
 Brandwatch and Hootsuite analyze social media
mentions to help companies understand how their
brand is perceived by consumers.
 Predictive Analytics: Predicting future sales, customer demand,
or market trends by analyzing historical data.
o Example:
 Netflix uses predictive analytics to suggest movies
and TV shows based on user preferences and
viewing history.
 Campaign Effectiveness: Evaluating the success of marketing
campaigns in real-time and adjusting strategies to maximize
ROI.
o Example:
 Google Ads uses data analytics to adjust bids and
optimize ad campaigns based on user behavior and
performance metrics.

2. Healthcare

Data analytics plays a transformative role in improving patient


outcomes, reducing costs, and advancing medical research.

 Predictive Healthcare: Predicting patient health outcomes,


potential diseases, or complications based on historical health
data and medical records.
o Example:
 Predictive algorithms help healthcare providers
identify at-risk patients and intervene early, such as
predicting sepsis in ICU patients.
 Personalized Medicine: Tailoring treatments and therapies
based on the genetic profile and health data of individual
patients.
o Example:
 Oncology treatments can be personalized by
analyzing cancer data and identifying the most
effective drug treatments for specific genetic
mutations.
 Drug Discovery: Accelerating drug development by analyzing
clinical trial data and identifying potential drug candidates.
o Example:
 DeepMind's AI has been used to predict protein
structures, which can significantly accelerate drug
discovery.
 Healthcare Operations Optimization: Improving hospital
management by optimizing resources, staffing, patient
scheduling, and reducing wait times.
o Example:
 Predictive models help hospitals manage patient
flow and reduce wait times in emergency
departments.

3. Financial Services and Banking

The financial industry relies on data analytics for risk management,


fraud detection, and improving customer services.

 Fraud Detection: Analyzing transaction patterns to detect


suspicious or fraudulent activity.
o Example:
 Credit card companies like Visa and Mastercard use
machine learning models to detect unusual
transactions and prevent fraud in real time.
 Credit Scoring: Using data analytics to evaluate an individual's
creditworthiness based on transaction history, credit behavior,
and financial activity.
o Example:
 FICO scores and alternative data sources are used
to determine whether a person is eligible for a loan.
 Risk Management: Identifying and mitigating financial risks by
analyzing market trends, asset performance, and customer
behavior.
o Example:
 Hedge funds use quantitative analysis to optimize
trading strategies and manage portfolio risks.
 Algorithmic Trading: Using complex algorithms to make high-
speed trading decisions based on data-driven insights.
o Example:
 Goldman Sachs and other investment banks use
data analytics for automated trading systems that
respond to market changes in real time.

4. Retail and E-commerce

Retailers and e-commerce platforms use data analytics to enhance


customer experience, optimize inventory, and drive sales.

 Inventory Management: Optimizing inventory levels by


predicting demand and adjusting stock accordingly.
o Example:
 Walmart uses predictive analytics to ensure that
products are stocked at optimal levels based on
customer demand and seasonality.
 Customer Behavior Analysis: Understanding shopping habits,
product preferences, and browsing patterns to personalize the
shopping experience.
o Example:
 Target uses customer purchase data to send
personalized offers to shoppers based on their
buying history.
 Dynamic Pricing: Adjusting product prices in real time based on
market conditions, demand, and competition.
o Example:
 Amazon uses dynamic pricing models to
automatically adjust prices for millions of products
based on competitor prices and market conditions.
 Supply Chain Optimization: Analyzing data from suppliers,
logistics, and transportation to streamline supply chain
operations and reduce costs.
o Example:
 Amazon’s fulfillment centers use advanced analytics
to track products, optimize warehouse operations,
and improve delivery speed.

5. Sports Analytics

Sports organizations and teams use data analytics to enhance


performance, optimize team strategy, and engage fans.

 Player Performance Analysis: Using data from wearables,


sensors, and match statistics to evaluate player performance
and prevent injuries.
o Example:
 Basketball teams like the Golden State Warriors use
player tracking data to analyze movement and
optimize player strategies.
 Game Strategy Optimization: Analyzing past games to identify
patterns, strengths, and weaknesses in both the team’s
performance and the opponent's strategy.
o Example:
 Football teams use analytics to adjust tactics,
optimize formations, and predict opposing team
strategies.
 Fan Engagement: Using data analytics to understand fan
preferences and create personalized experiences, offers, and
promotions.
o Example:
 MLB and other sports leagues use fan data to
improve stadium experiences and deliver targeted
advertisements.

6. Manufacturing and Supply Chain

In manufacturing, data analytics is used to improve production


efficiency, reduce downtime, and optimize the supply chain.

 Predictive Maintenance: Predicting when equipment or


machinery will fail, enabling timely repairs and minimizing
downtime.
o Example:
 General Electric uses sensor data and analytics to
monitor machinery in real-time, reducing downtime
and repair costs.
 Quality Control: Analyzing production data to identify defects
or inefficiencies in the manufacturing process and improve
product quality.
o Example:
 Toyota uses data analytics to optimize production
lines and reduce defects using Six Sigma
methodology.
 Supply Chain Optimization: Using data to track materials,
products, and shipments in real time, optimizing routes, and
improving delivery times.
o Example:
 UPS uses data analytics to optimize delivery routes,
saving millions in fuel costs and reducing delivery
times.

7. Government and Public Policy

Governments use data analytics to improve policy, enhance public


services, and ensure better governance.

 Crime Analysis: Analyzing patterns in crime data to identify


hotspots and allocate police resources effectively.
o Example:
 Predictive policing uses crime data to forecast
where and when crimes are likely to occur, helping
law enforcement prevent criminal activities.
 Traffic Management: Analyzing traffic data to optimize signal
timing, reduce congestion, and improve road safety.
o Example:
 Smart cities like Singapore use data from traffic
cameras, sensors, and GPS devices to optimize
traffic flow and reduce congestion.
 Public Health Surveillance: Using data analytics to track and
predict the spread of diseases, allocate resources, and manage
public health crises.
o Example:
 COVID-19 tracking dashboards used data analytics
to monitor the spread of the virus, forecast hospital
needs, and allocate vaccines efficiently.

8. Education
Data analytics in education improves teaching strategies, learning
outcomes, and institutional management.

 Student Performance Analysis: Analyzing student data to


identify learning patterns, predict outcomes, and offer
personalized interventions.
o Example:
 Khan Academy and other e-learning platforms use
data analytics to track student progress and
recommend customized learning resources.
 Curriculum Optimization: Using feedback, test scores, and
student performance data to refine curricula and teaching
methods.
o Example:
 Harvard University uses data analytics to analyze
student feedback and improve courses and teaching
methodologies.
 Dropout Prevention: Identifying students at risk of dropping
out and providing targeted support or interventions.
o Example:
 Georgia State University uses data to predict which
students are at risk of dropping out and proactively
engages them to ensure retention.

9. Energy and Utilities

Energy companies use data analytics to optimize energy production,


improve efficiency, and ensure sustainability.

 Smart Grids: Analyzing real-time data from smart meters and


sensors to optimize energy distribution, prevent outages, and
manage peak demand.
o Example:
 Pacific Gas and Electric uses data analytics to
monitor energy use and predict system failures
before they occur.
 Energy Consumption Optimization: Helping consumers or
businesses optimize energy usage based on real-time data.
o Example:
 Nest Thermostat uses analytics to learn user
preferences and adjust heating/cooling schedules to
reduce energy consumption.
 Renewable Energy Forecasting: Analyzing weather data and
historical patterns to predict energy output from renewable
sources (e.g., solar, wind).
o Example:
 Solar power plants use predictive analytics to
forecast energy generation based on weather
conditions.

10. Travel and Tourism

Data analytics in travel enhances customer experience, optimizes


pricing, and improves operational efficiency.

 Dynamic Pricing: Adjusting flight or hotel prices in real time


based on demand, competition, and availability.
o Example:
 Airlines use data analytics to adjust ticket prices
based on real-time demand and booking patterns.
 Personalized Travel Recommendations: Offering tailored travel
experiences based on customer preferences, past behavior,
and reviews.
o Example:
 Expedia uses data analytics to recommend flights,
hotels, and vacation packages based on customer
browsing and booking history.
 Operational Efficiency: Optimizing flight schedules, staffing,
and inventory based on historical and real-time data.
o Example:
 Delta Airlines uses data analytics to optimize crew
scheduling, reducing delays and improving customer
satisfaction.

Conclusion

Data analytics has a vast array of applications across different


industries and sectors. By extracting valuable insights from data,
businesses and organizations can improve decision-making, reduce
costs, optimize operations, and create more personalized
experiences for their customers.

The ability to leverage data effectively has become a competitive


advantage in almost every field, driving innovation, efficiency, and
growth.

Common questions

Powered by AI

Data analytics facilitates enhanced customer experiences in retail by enabling the understanding of customer behavior, supporting personalized interactions, and tailoring products or services accordingly . Analytics helps identify customer needs, inform personalized marketing campaigns, and optimize inventory management to ensure product availability, all of which lead to increased customer satisfaction and loyalty .

Predictive analysis is more complex as it involves forecasting future outcomes based on historical data, which requires sophisticated models to account for various potential scenarios and factors influencing future trends . This contrasts with descriptive analysis, which only summarizes past data to identify patterns or characteristics without forecasting . Developing accurate predictive models often involves complex algorithms and extensive data processing .

Data analytics optimizes supply chain operations by providing insights into material tracking, predicting demand, and improving inventory management through real-time data analysis . It helps identify inefficiencies, optimize delivery routes, and forecast demand, ultimately reducing costs and improving delivery speed . Companies like Amazon and UPS utilize analytics to streamline operations and enhance efficiency across their supply chains .

Data cleaning enhances data accuracy and reliability by removing duplicate entries, handling missing values through methods like mean/median/mode imputation or regression imputation, and deleting irrelevant rows or columns . Proper data cleaning ensures that the datasets used in analysis are consistent and error-free, which is essential for valid and precise analytics outcomes .

Data visualization techniques transform complex data sets into graphical representations such as charts and graphs, making it easier to identify and understand trends and patterns . These visual tools aid in discovering hidden relationships and anomalies within the data that might not be readily apparent through raw data analysis. This ability to visually analyze information supports better decision-making by providing clearer insights into the dynamics of the data .

Diagnostic analysis assists businesses in identifying the origins of recurring problems by examining data patterns and trends to uncover cause-and-effect relationships . It focuses on understanding why specific events occur by tracing back through historical data and identifying common factors or conditions linked to problems . This analysis provides insights that help businesses address root causes rather than symptoms, fostering long-term solutions .

Structured data is organized in rows and columns with predefined data types, allowing for easy access and manipulation using queries . Unstructured data lacks this format and includes text and multimedia, making it harder to process using traditional databases . Consequently, structured data can be analyzed using SQL and data analytics tools, while unstructured data requires more advanced methods like natural language processing or machine learning for analysis .

Semi-structured data contains elements of both structured and unstructured data, typically not organized into a tabular row-and-column format, but often containing tags or markers (e.g., XML, JSON). Unlike structured data with defined schemas, semi-structured data is more flexible but still needs parsing to extract specific elements for analysis, making its management challenging . The inherent lack of consistent structure complicates integration and processing using traditional data management tools .

Data validation ensures the integrity of data analytics by confirming the accuracy and quality of data through checks on data types, ranges, and formats . This process prevents errors in data entry and ensures meaningful results, as invalid data can skew analyses and lead to incorrect conclusions . Effective validation verifies that the data aligns with the requirements for specific analytical models, maintaining the credibility and reliability of findings .

Data analytics enhances risk management in financial services by analyzing market trends, transaction patterns, and financial behaviors to identify potential risks early . For instance, credit card companies use machine learning to detect and prevent fraudulent activities in real time by identifying unusual transactions . Hedge funds employ quantitative analysis to manage portfolio risks, optimizing trading strategies based on market analysis . These applications enable financial institutions to mitigate risks efficiently and safeguard their operations .

You might also like