Script Bot vs Smart Bot Explained

Uploaded by

gtcxgamer

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

20 views4 pages

Script Bot vs Smart Bot Explained

Uploaded by

gtcxgamer

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Unit -6

NLP
Q:-1 What are the applications of NLP?
A:- 1. Automatic text Summarization:-It is the process of creating the most meaningful and
relevant summary of voluminous texts from multiple resources.
2. Sentiment Analysis:- It is a process to identify sentiments among several social media posts
.Sentiment analysis reflects the overall positive, negative or neutral opinion by a person.
3. Text Classification:- It is process of classifying the unstructured text into groups or
categories. Eg spam filter
[Link] Assistant:- Virtual assistant like Alexa, siri, Cortana uses NLP and interpret human
voice ,understand human intent and do task accordingly like play your favourite playlist, set up
alarm etc.
[Link]:- Chatbot automates your tasks like saying good morning when you wake up, telling
you news on a daily basis, helping you in choosing a less traffic route for your school, ordering a
coffee for you on your way back home. Mitsuku bot, Haptik, Ochatbot etc.
Q:- 2 What is the difference between Scripted –bot and Smart-bot?
Ans:- Script Bot Smart Bot
1. These are simple chatbots 1. These are flexible,powerful, AI
with limited functionalities. based model with wider functionalities.
2. They are scripted according to task . 2. They support machine learning
algorithms that make a machine learn from
experience. They simulate human like
interactions with the users.
3. Script bot are easy to make. 3. Smart bot are difficult to make.
4. They need less programming skills. 4. They need a lot of programming and work on
bigger database .
5. Script bot are best suited for straight 5. Virtual assistants like Google
forward interactions like at customer care assistant, Alexa ,Siri are example
services. of Smart Bot.

Q:- 3 Name the techniques of NLP.

A:- Techniques of NLP are
1. Bag of words
2. Term Frequency
3. Inverse Document Frequency
Q:- 4 What are the steps for Text Normalisation? Explain
Ans:- 1. Text Normalisation:-In this process of cleaning of textual data by converting a text into
standard [Link] like as slang,short forms,misspelled words,abbreviations etc are converted
into canonical form.
a. Sentence Segmentation:- It is process of Sentence Boundary Detection which reduces the
corpus into a sentence.
b. Tokenization:-It is a process of dividing the sentences into tokens. A token is a word,number
or special character.
c. Removing stopwords,special characters and numbers:- The words that do not provide any
information regarding the corpus are removed in this step.
d. Converting Text to a common case:-It is process of converting whole corpus into lowercase
to avoid confusion .
e. Stemming: It is a process of removing the affixes from the words to get back its base word
is called stemming.
f. Lemmatization:- It is a process of removing the affixes from the words to create a
meaningful base word is called lemmatization.
Q:- 5Give example of stemming and Lemmatization.
A:- Stemming:- Healed- heal, Studies-studi, Caring - car
Lemmatization:- Studies- Study, Caring - Care
Q:-6 What is Bag of words?
Ans:- After the process of text normalization the corpus is converted into normalized corpus
which is just a collection of meaningful words with no sequence.
Q:- 7 What are the steps involved on Bag of Words?

Ans:- The Steps involved in Bag of Words algorithm are:

• Text Normalisation: The collection of data is processed to get normalised corpus.
• Create Dictionary: This step will create a list of all unique words available in normalisedcorpus.
• Create Document Vectors: For each document in the corpus, create a list of uniquewords with
its number of occurrences.
• Create Document Vectors for all the Documents: Repeat Step 3 for all documents inthe corpus to
create a “Document Vector Table”.
Q:- 8 Create a step by step approach to implement a bag of words algorithm.

Common questions

Both processes transform raw data into a format suitable for analysis, but they operate at different stages. Text normalization involves cleaning the text by converting it into a standard format, removing noise like stopwords, and establishing a consistent form through processes like stemming and lemmatization . Creating a document vector table in the Bag of Words model occurs after normalization and involves representing documents as vectors based on the frequency of words from an established dictionary. While text normalization focuses on cleaning and standardizing text, creating document vector tables quantitatively represents the text’s content structure .

Stemming and lemmatization both aim to reduce words to their base forms, but they differ in their approach. Stemming is a rule-based process that removes affixes to return the root form, but this form may not be a valid word (e.g., 'studies' becomes 'studi'). Lemmatization, however, considers the morphological analysis of words, converting them into meaningful base words (e.g., 'studies' becomes 'study'). Lemmatization tends to be more accurate in representing the word's meaning, aiding more precise text analysis. Both processes reduce dimensionality in text data, simplifying further computational tasks .

Automatic text summarization is a natural language processing (NLP) technique that involves creating the most meaningful and relevant summary from a large volume of text gathered from multiple resources. The main applications include reducing the time required to understand information from extensive sources, aiding in fast decision-making processes, and improving the accessibility of information by providing concise summaries .

The 'Bag of Words' model represents text data by disregarding grammar and word order while counting the frequency of each word's occurrence in a document. After text normalization, it forms a dictionary of unique words from the corpus. Each document is then represented as a vector, detailing the occurrences of these words, facilitating various analyses such as text classification and clustering by enabling the comparison of different vectors across documents .

Sentiment analysis focuses on identifying the sentiment or emotion expressed in a text, categorizing it as positive, negative, or neutral, often used to analyze opinions on social media . On the other hand, text classification involves categorizing unstructured text into broader groups or categories, such as in spam filtering, which sorts emails based on content . Both techniques aim to derive structured information from text data but serve distinct purposes.

Virtual assistants, as applications of NLP, interpret human voice commands, understand intent, and perform tasks such as playing music or setting alarms, based on machine learning algorithms. They enhance user interaction by facilitating hands-free, natural language-based communication, improving user experience with accessibility and convenience. This interaction leverages speech recognition and natural language understanding to provide intelligent responses and actions, making technology more user-friendly and efficient .

Implementing a 'Bag of Words' model involves several key steps: 1) Text Normalization, which prepares a clean and standardized corpus by removing noise; 2) Creating a Dictionary, listing all unique words from the normalized corpus, providing a comprehensive lexicon; 3) Creating Document Vectors, where each document is represented by a vector quantifying the occurrence of these words; 4) Creating a Document Vector Table for the entire corpus, enabling comparison between documents. Each step is essential for transforming raw text into a structured format amenable to quantitative analysis and machine learning tasks .

Scripted bots are built with limited functionalities, primarily designed for straightforward interactions and require less programming skill . They strictly follow scripts to perform specific tasks. In contrast, smart bots are more flexible and powerful, utilizing AI and machine learning to simulate human-like interactions, making them more complex and demanding in terms of programming and database management. Smart bots like virtual assistants (e.g., Alexa, Siri) can learn from interactions and adapt over time, unlike scripted bots .

Removing stopwords is crucial because these words (e.g., 'is', 'the', 'at') do not carry significant meaning or insight and can clutter the text data. By eliminating them, the remaining text becomes more focused on the keywords that carry semantic value, improving the efficiency and accuracy of subsequent analysis steps, like sentiment analysis or topic modeling, by reducing noise .

Sentence segmentation and tokenization are crucial steps in text normalization, a process of cleaning textual data. Sentence segmentation, or sentence boundary detection, reduces the text corpus into distinct sentences, facilitating easier management and analysis . Tokenization breaks these sentences into tokens, which can be words, numbers, or special characters, allowing further processing like removing stopwords and stemming, thus standardizing the text for further analysis and application in NLP tasks .

Understanding Natural Language Processing
No ratings yet
Understanding Natural Language Processing
3 pages
Overview of Natural Language Processing
No ratings yet
Overview of Natural Language Processing
10 pages
Unit 5 Natural Language Processing Learning Outcomes
No ratings yet
Unit 5 Natural Language Processing Learning Outcomes
9 pages
Cbse Class 10 Social Science Ef5hg Set 1 2025
No ratings yet
Cbse Class 10 Social Science Ef5hg Set 1 2025
5 pages
Understanding Natural Language Processing
No ratings yet
Understanding Natural Language Processing
2 pages
Unit 4 - NLP NBE Key
No ratings yet
Unit 4 - NLP NBE Key
5 pages
Grade 10 Notes - NLP-1
No ratings yet
Grade 10 Notes - NLP-1
11 pages
Understanding Natural Language Processing
No ratings yet
Understanding Natural Language Processing
18 pages
NLP Concepts and Applications MCQs
No ratings yet
NLP Concepts and Applications MCQs
7 pages
Grade 10 NLP Study Guide
No ratings yet
Grade 10 NLP Study Guide
4 pages
Understanding Natural Language Processing
No ratings yet
Understanding Natural Language Processing
2 pages
Grade 10 NLP Notes
No ratings yet
Grade 10 NLP Notes
4 pages
Unit 6 NLP
No ratings yet
Unit 6 NLP
2 pages
Introduction To NLP
No ratings yet
Introduction To NLP
28 pages
Natural Language Processing Overview
No ratings yet
Natural Language Processing Overview
4 pages
Class 10 NLP Q&A Guide
No ratings yet
Class 10 NLP Q&A Guide
3 pages
Understanding Natural Language Processing
No ratings yet
Understanding Natural Language Processing
31 pages
Natural Language Processing Overview
No ratings yet
Natural Language Processing Overview
4 pages
Natural Language Processing-Final
No ratings yet
Natural Language Processing-Final
19 pages
Revision NLP 5-1
No ratings yet
Revision NLP 5-1
15 pages
Understanding NLP and Its Applications
No ratings yet
Understanding NLP and Its Applications
2 pages
Natural Language Processing FAQs
No ratings yet
Natural Language Processing FAQs
12 pages
NLP Text Normalization and BoW Steps
No ratings yet
NLP Text Normalization and BoW Steps
11 pages
Stemming vs. Lemmatization in NLP
No ratings yet
Stemming vs. Lemmatization in NLP
11 pages
NLPP Aiml Sem 7 8 Viva Questions
No ratings yet
NLPP Aiml Sem 7 8 Viva Questions
7 pages
UNIT 6 - NLP - Questions and Answers
No ratings yet
UNIT 6 - NLP - Questions and Answers
4 pages
NLP Oral
No ratings yet
NLP Oral
22 pages
Overview of Natural Language Processing
No ratings yet
Overview of Natural Language Processing
15 pages
NLP Oral QB
No ratings yet
NLP Oral QB
23 pages
Understanding Natural Language Processing
No ratings yet
Understanding Natural Language Processing
6 pages
NLP Applications and Techniques
No ratings yet
NLP Applications and Techniques
8 pages
Understanding Natural Language Processing
No ratings yet
Understanding Natural Language Processing
10 pages
Overview of Natural Language Processing
No ratings yet
Overview of Natural Language Processing
8 pages
Class 10 NLP Overview and Applications
No ratings yet
Class 10 NLP Overview and Applications
7 pages
Understanding Natural Language Processing
No ratings yet
Understanding Natural Language Processing
7 pages
Real-World NLP Applications and Techniques
No ratings yet
Real-World NLP Applications and Techniques
23 pages
Real-World NLP Applications and Tasks
No ratings yet
Real-World NLP Applications and Tasks
6 pages
Understanding Natural Language Processing
No ratings yet
Understanding Natural Language Processing
3 pages
Understanding Stemming in NLP
No ratings yet
Understanding Stemming in NLP
5 pages
Section A: Exercise
No ratings yet
Section A: Exercise
2 pages
NLP Concepts and Techniques Explained
No ratings yet
NLP Concepts and Techniques Explained
9 pages
Understanding Natural Language Processing
No ratings yet
Understanding Natural Language Processing
10 pages
Understanding Natural Language Processing
No ratings yet
Understanding Natural Language Processing
15 pages
03 Natural Language Processing Important Questions Answers
No ratings yet
03 Natural Language Processing Important Questions Answers
31 pages
NLP Suggestion
No ratings yet
NLP Suggestion
60 pages
Understanding Natural Language Processing
No ratings yet
Understanding Natural Language Processing
12 pages
Understanding Natural Language Processing
No ratings yet
Understanding Natural Language Processing
4 pages
Understanding Natural Language Processing
No ratings yet
Understanding Natural Language Processing
10 pages
NLP Short Answer Questions and Answers
No ratings yet
NLP Short Answer Questions and Answers
18 pages
Understanding Natural Language Processing
No ratings yet
Understanding Natural Language Processing
9 pages
Ch-3 NLP Questions
No ratings yet
Ch-3 NLP Questions
6 pages
Partnership Deed for Legal Practice
No ratings yet
Partnership Deed for Legal Practice
6 pages
Air Conditioner Price and Specs Guide
No ratings yet
Air Conditioner Price and Specs Guide
5 pages
Latihan Matematik Tahun 2
No ratings yet
Latihan Matematik Tahun 2
14 pages
Spiritual Farming Symbiosis Explained
100% (1)
Spiritual Farming Symbiosis Explained
240 pages
LHB Design Coaches: Suspension Questions
No ratings yet
LHB Design Coaches: Suspension Questions
22 pages
Physics Investigatory Project Draft I
No ratings yet
Physics Investigatory Project Draft I
16 pages
Tax Invoices for Kasim Saheeb Orders
No ratings yet
Tax Invoices for Kasim Saheeb Orders
2 pages
Family Discount Mondays Announcement
No ratings yet
Family Discount Mondays Announcement
6 pages
Copper Colorimetric Test Procedure
No ratings yet
Copper Colorimetric Test Procedure
2 pages
Introduction to Pharmacology Basics
No ratings yet
Introduction to Pharmacology Basics
42 pages
Constructive Dismissal Case: Pream Anand
No ratings yet
Constructive Dismissal Case: Pream Anand
20 pages
Verb Forms: To V, Bare V, V-ing
No ratings yet
Verb Forms: To V, Bare V, V-ing
2 pages
Direct Variation in Mathematics 9
No ratings yet
Direct Variation in Mathematics 9
10 pages
High-Strength Concrete Lintel Guide
No ratings yet
High-Strength Concrete Lintel Guide
2 pages
Network Parameter Updates for 09212
No ratings yet
Network Parameter Updates for 09212
3 pages
Chemistry Course Specifications 109CHM
No ratings yet
Chemistry Course Specifications 109CHM
7 pages
UNSW Financial Support Declaration Form
No ratings yet
UNSW Financial Support Declaration Form
2 pages
New Women of Empowerment Honored
No ratings yet
New Women of Empowerment Honored
7 pages
Complex Number Class Lab Report
No ratings yet
Complex Number Class Lab Report
3 pages
EMEA Food & Drink Trends 2024 Insights
No ratings yet
EMEA Food & Drink Trends 2024 Insights
39 pages
Major KM005
No ratings yet
Major KM005
43 pages
INSET Proposal for Teacher Orientation
50% (2)
INSET Proposal for Teacher Orientation
6 pages
GST Flowchart and Key Concepts
No ratings yet
GST Flowchart and Key Concepts
24 pages
Global Otc Pharmaceutical Market
No ratings yet
Global Otc Pharmaceutical Market
121 pages
WHO Lab Quality Management Overview
No ratings yet
WHO Lab Quality Management Overview
10 pages
daloRadius Configuration and Logging Guide
No ratings yet
daloRadius Configuration and Logging Guide
4 pages
MCQs on Management Functions and Objectives
No ratings yet
MCQs on Management Functions and Objectives
57 pages
TSM Internship Calendar 2024-2025
No ratings yet
TSM Internship Calendar 2024-2025
2 pages
Usando Evaluacion de Ciclo de Vida para Informar Toma de Decisiones de Edificios Sostenibles
No ratings yet
Usando Evaluacion de Ciclo de Vida para Informar Toma de Decisiones de Edificios Sostenibles
24 pages
Introduction to Mathematical Modelling
No ratings yet
Introduction to Mathematical Modelling
80 pages