0% found this document useful (0 votes)
9 views7 pages

AI Project Cycle Overview and Steps

The document outlines the AI project cycle, which includes stages such as problem scoping, data acquisition, data exploration, modeling, and evaluation. It details the processes involved in each stage, including methods for data collection, types of AI models, and evaluation techniques. Additionally, it explains concepts like supervised, unsupervised, and reinforcement learning, as well as the importance of neural networks in AI development.

Uploaded by

niti123p
Copyright
© All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
9 views7 pages

AI Project Cycle Overview and Steps

The document outlines the AI project cycle, which includes stages such as problem scoping, data acquisition, data exploration, modeling, and evaluation. It details the processes involved in each stage, including methods for data collection, types of AI models, and evaluation techniques. Additionally, it explains concepts like supervised, unsupervised, and reinforcement learning, as well as the importance of neural networks in AI development.

Uploaded by

niti123p
Copyright
© All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd

Titiksha Public School

Unit-2(Part-B)
AI Project Cycle
(To be done in notebook)
What is a Project?
A project is a set of tasks that must be completed within a defined timeline to accomplish a
specific set of goals.

AI Project cycle-
The AI project cycle is the process of steps involved in preparing an AI model or AI project. In
Project Cycle we are going to deal with the steps involved in creating a project, starting from the
given problem till the project is created and tested.

Stages/components of AI Project Cycle

1. Problem Scoping
2. Data Acquisition
3. Data Exploration
4. Data Modeling
5. Evaluation

Problem Scoping:It is the process of figuring out the problem and what are the [Link]
entire process of finding a specific solution is known as problem scoping.

To find a problem and a specific solution you must start with the 4Ws.

4W’s problem canvas- The 4W’s of Problem Scoping are Who, What, Where, and Why. This
Ws helps in identifying and understanding the problem in a better and efficient manner.

1. Who:Who” part helps us in comprehending and categorizing who all are affected directly
and indirectly with the problem and who are called the Stakeholders.

2. What:“What” part helps us in understanding and identifying the nature of the problem and under
this block, you also gather evidence to prove that the problem you have selected exists.

3. Where:-"Where” does the problem arise, situation, context, and location.

4. Why:”Why” is the given problem worth solving.

Data Acquisition : It is a process of collecting from authentic and reliable sources for effective
Decision Making.

Types of data- All data is not created equal. Some data is structured, but most of it is
unstructured.

Structured Data- It is also called labeled data.

Unstructured Data- It is also called as unlabelled data.

Data features - Each feature, or column, represents a measurable piece of data that can be
used for analysis.

Ex: salary amount, increment percentage, increment period, bonus, etc.


Problem Statement Template-

Our Stakeholder Who

Have a problem Issue/Problem What

When/While Context/Situation/Location Where

Ideal Solution How the solution will help the Wny


stakeholders

Data Acquisition-

The process of collecting accurate and reliable data to work with.

Data Set- A data set (or dataset) is a collection of data.

Methods of Data Acquisition

The most common methods of data acquisition are:

1. Surveys : Through Google Forms, MS Teams Forms or any other interface


2. Web Scraping: Some software are Scarpy, Scrape hero Cloud, ParseHub, OutHitHub,
Visual Web Ripper, [Link]
3. Sensors: to convert physical parameters to electrical signals, to convert sensor signals
into a form that can be converted to digital values and to convert conditioned sensor
signals to digital values
4. Cameras: To capture images
5. Observations: Way of gathering data by watching behavior, events, or noting physical
characteristics in their natural setting
6. API (Application Program Interface)-API is a messenger which takes requests and tells
the system about requests and gives the response. Ex: Twitter API, Google Search API

Big Data

● A collection of data that is huge in volume, yet growing exponentially with time.
● It is a data with so large size and complexity that none of traditional data management
tools can store it or process it efficiently.

Examples of Big Data

● Stock Exchange
● Social Media Websites
● YouTube and web series platforms

Data Exploration

In this stage of the project cycle, we try to interpret some useful information out of the data we
have acquired. For this purpose, we need to explore the data and try to put it uniformly for a
better understanding. This stage deals with validation or verification of the collected data and to
analyze that:

● The data is according to the specifications decided.


● The data is free from errors.
● The data is meeting our needs
This stage is divided into 2 sub stages.
1) Data Cleaning
2) Data Visualization

Data Cleaning
Data cleaning helps in getting rid of commonly found errors and mistakes in a data set. These
are 3 commonly found errors in data.
1) Outliers: Data points existing out of the range.
2) Missing data: Data points missing at certain places.
3) Erroneous data: Incorrect data points.

Data Visualization

Why do we need to explore data through visualization?


1) We want to quickly get a sense of the trends, relationships, and patterns contained within the
data.
2) It helps us define a strategy for which model to use at a later stage.
3) Visual representation is easier to understand and communicate to others.

Data Visualization tools

●Microsoft Excel
●Tableau
●Qlikview
●Datawrapper Google Data Studio
●Python Matplotlib
Modelling :Modeling is the process in which different models based on the visualized data can
be created and even checked for the advantages and disadvantages of the model.

● AI Modelling refers to developing algorithms, also called models which can be trained to
get intelligent outputs. That is, writing codes to make a machine artificially intelligent.

Types of AI models - Modeling is the process in which different models based on the visualized
data can be created and even checked for the advantages and disadvantages of the model.
Rule-Based model - In this approach, the rules are defined by the developer. The machine
follows the rules or instructions mentioned by the developer and performs its task accordingly.
So, it’s a static model. i.e. the machine once trained, does not take into consideration any
changes made in the original training dataset.

Ex: You trained your model with 100 images of apples and bananas. Now If you test it by
showing an apple, it will figure out and tell if it's an apple or not. Here Labeled images of apple
and banana were fed, due to which the model could detect the fruit.
*Labeled Images: Simply, when the model is told about what the image is.

Learning based- It’s a type of AI modelling where the machine learns by itself. Under the
Learning Based approach, the AI model gets trained on the data fed to it and then is able to
design a model which is adaptive to the change in data. That is, if the model is trained with X
type of data and the machine designs the algorithm around it, the model would modify itself
according to the changes which occur in the data so that all the exceptions are handled in this
case.

After training, the machine is now fed with testing data. Now, the testing data might not have
similar images as the ones on which the model has been trained. So, the model adapts to the
features on which it has been trained and accordingly predicts the output.

Types of learning:-

There are three types of learning:

1. Supervised
2. Unsupervised
3. Reinforcement

Supervised Learning

● The dataset which is fed to the machine is labeled.


● A label is some information which can be used as a tag for data.
For example, students get grades according to the marks they secure in examinations.
These grades are labels which categorize the students according to their marks.
Classification-(discrete data)
In this model, data is classified according to the labels. For example, in the grading system,
students are classified on the basis of the grades they obtain with respect to their marks in the
[Link] model works on discrete dataset which means
the data need not be continuous.
Example: To predict which of them is apple and banana.
Regression-(continuous data)
Regression is a type of supervised learning which is used to predict continuous value.
Example: To predict your next salary, put in the data of your previous salary, any increments,
etc., train the model.
Example: Weather Prediction using past data.
Here, the data which has been fed to the machine is continuous.

Unsupervised Learning-

In terms of machine learning, unsupervised learning is in which a system learns through data
sets created on its own. In this, the training is not labeled.

Important Points:

● An unsupervised learning model works on an unlabelled dataset.


● This means that the data which is fed to the machine is random and there is a possibility
that the person who is training the model does not have any information regarding it.
● The unsupervised learning models are used to identify relationships, patterns and trends
out of the data which is fed into [Link] helps the user in understanding What the data is
about
● What are the major features identified by the machine
Example: Suppose a boy sees someone performing tricks with a ball, so he also learnt
the tricks by himself. This is what we call unsupervised learning.

Clustering
It is an unsupervised learning algorithm which can cluster the unknown data according to the
patterns or trends identified out of it. The patterns observed might be the ones which are known
to the developer or it might even come up with some unique patterns out of it

Dimensionality reduction
We humans are able to visualize up to 3-Dimensions only. If we have a ball in our hand, it is
3-Dimensions right now. But if we click its picture, the data transforms to 2-D.
Hence, to reduce the dimensions and still be able to make sense out of the data, we use
Dimensionality Reduction.

For example: in Natural language Processing, the words are considered to be N-Dimensional
entities. Which means that we cannot visualize them as they exist beyond our visualization
ability. Hence, to make sense out of it, we need to reduce their dimensions. Here, dimensionality
reduction algorithm is used.

Reinforcement Learning- Learning through feedback or trial and error method is called
Reinforcement Learning.

The system works on Reward or Penalty policy. In this an agent performs an action positive or
negative, in the environment which is taken as input from the system, then the system changes
the state in the environment and the agent is provided with a reward or penalty.

Evaluation:
Evaluation is the process of understanding the reliability of any AI model, based on outputs by
feeding the test data into the model and comparing it with actual answers. There can be
different Evaluation techniques, depending on the type and purpose of the model.

In this stage of AI project the model is checked and evaluated on the basis of

● Accuracy
● Precision
● Recall
● F1 Score

Neural Network:
Neural networks form a base of deep learning, a sub field of machine learning where
algorithms are inspired by the structure of the brain .Neural networks take data in and train
themselves to recognize the pattern in the data and predict the output for a new set of
similar data.
Important features of Neural Network:
▪ The model of Neural Network is based on the human neural network i.e brain and
nervous system.
▪ They are designed in such a way that the information can be automatically extracted
without the interaction of the programmer.
▪ Every node of a neural network system is a machine learning algorithm
▪ It is best suited for AI models dealing with large data.

Difference between Classification and Clustering-

Difference between Classification and Regression-

You might also like