TERM PAPER
ON
“DIFFERENT DATA MINING APPROACHES”
BY-:
Siddharth Jain
A2305216519
3CSE-8X
INTRODUCTION
Basically, data mining is observing data
and building similarities and patterns to
make it efficient to store and retrieve.
The techniques of data mining were
known and practised for many years now
but after the start up and establishment
of big data it is way more developed
KEY TECHNIQUES
There are a number of different techniques that can be
implemented in data mining to describe the different
types of mining of data and operations used to recover
data.
Following are the different techniques and examples that
explain the building of data mining:
ASSOCIATION
Association is a well know, understood and probably the most
widely used data mining technique. A relation between different
items or different types of data is observed and identified to build a
particular pattern. For example, it can be observed in a sports
market, a person buying bat may also apparently land up buying a
ball, so if this data is studied then both bat and a ball may be
associated together in order for future demand.
CLASSIFICATION
A classification may be made to construct an idea or concept of
the different type of items, data by putting a number of
constraints to eventually make a class under which different data
can be organised efficiently. For example, cars are classified into
different types like suv, sedans etc. Now a car can be slotted into
one of these classified categories by comparing the constraints.
CLUSTERING
By studying different constraints and class, data can be grouped
together in order to identify or examine the grouped data.
Commonly, clustering is studying two or more constrained data in
order to observe the correlation between them. Clustering is very
important to identify plenty of data in order to examine the
similarities between them.
Fig 3-The graph displayed explains a significant example. In this example, purchase power is
compared with the age of the customer and clustering is shown at different regions.
DECISION TREES
Decision trees are related to many of the other tools and
technique. It is used as a part of the selection category, or to
select and use the data from structure. In decision tree, question
may have two answers. Each and every answer follows up with
different question to identify the data that can be slotted a
separate category.
Fig 4-A typical example of decision
tree is shown.
Data mining depends on a well defined structure or model that ca
retrieve the correct or particular data user wants. Data
implementation work to make data mining as effective and
efficient as possible.
The most important step is to make and translate the data more
than to retrieve it. It also follows the difficult process of making
aggregate or to expand or to detect the data to ensemble the
entered data.
CONCLUSION
Data mining is not only about performing a command on the
data present in the database. Data must be organized whether
by structuring it or building a model by using SQL or software
such as hadoop .Getting the format of the data that one need
depends upon the tools and techniques. Once we get the data
that we want to retrieve we can use any tool or technique in
spite of any data type or structure.
THANK YOU