Types of Digital Data
➢ There is no doubt that the amount of digital data generated, not only at an
individual level but at an organizational level, from day-to-day is immense and can
become so complex and vast that most traditional data processing applications
would soon become insufficient to manage, not only the storage of data, but to
offer useful insights that drive meaningful results in the business.
➢ Data analytics technologies and techniques provide a means of analyzing data
sets and drawing conclusions about them to make informed business decisions.
➢ Big data analytics applications enable data scientists, predictive modelers,
statisticians and other analytics professionals to analyze growing volumes of
structured transaction data, plus other forms of data that are often left untapped
by conventional business intelligence (BI) and analytics programs.
➢ This encompasses a mix of semi-structured and unstructured data — for example,
internet clickstream data, web server logs, social media content, information from
customer emails and survey responses, mobile phone records and machine data
captured by sensors connected to the Internet of Things.
Pawan Kumar Singh, AP, CSE, GLBITM, Gr Noida 1
• Structured
• Structured is one of the types of big data and By structured data, we mean data
that can be processed, stored, and retrieved in a fixed format.
• It refers to highly organized information that can be readily and seamlessly stored
and accessed from a database by simple search engine algorithms.
• For instance, the employee table in a company database will be structured as
the employee details, their job positions, their salaries, etc., will be present in
an organized manner.
Pawan Kumar Singh, AP, CSE, GLBITM, Gr Noida 2
• Unstructured
• Unstructured data refers to the data that lacks any specific form or
structure whatsoever. This makes it very difficult and time-consuming
to process and analyze unstructured data. Email is an example of
unstructured data. Structured and unstructured are two important
types of big data.
Pawan Kumar Singh, AP, CSE, GLBITM, Gr Noida 3
• Semi-structured
• Semi structured is the third type of big data. Semi-structured data
pertains to the data containing both the formats mentioned above,
that is, structured and unstructured data.
• To be precise, it refers to the data that although has not been
classified under a particular repository (database), yet contains vital
information or tags that segregate individual elements within the
data.
Pawan Kumar Singh, AP, CSE, GLBITM, Gr Noida 4
OLAP is used for regular backup. OLTP usually uses schema used to store transnational
databases is the entity model (usually 3NF). OLAP uses star model to store the data.
Online transaction processing is database software designed to support transaction-related
applications on the Internet. OLTP database systems are commonly used for order entry, financial
transactions, customer relationship management and retail sales via the Internet.
OLAP is an acronym for Online Analytical Processing. OLAP performs multidimensional analysis of
business data and provides the capability for complex calculations, trend analysis, and
sophisticated data modeling.
Pawan Kumar Singh, AP, CSE, GLBITM, Gr Noida 5