0% found this document useful (0 votes)

16 views3 pages

SMS Spam Detection with Python

The document outlines a process for text classification using a dataset of SMS messages labeled as spam or ham. It includes data preprocessing steps such as tokenization, stopword removal, and stemming, followed by feature extraction using TF-IDF. The document also demonstrates model training and evaluation using various classifiers, including Naive Bayes, Random Forest, and Logistic Regression, along with hyperparameter tuning using GridSearchCV.

Uploaded by

Om and Jay Suryawanshi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as TXT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views3 pages

SMS Spam Detection with Python

Uploaded by

Om and Jay Suryawanshi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as TXT, PDF, TXT or read online on Scribd

import pandas as pd

df=pd.read_csv('[Link]',sep='\t',names=['label','text'])

[Link]

!pip install nltk

import nltk

[Link]('stopwords')

sent = 'Hello friends! How are you?'

from [Link] import word_tokenize

word_tokenize(sent)

from [Link] import stopwords

swords=[Link]('english')

swords

clean=[word for word in word_tokenize(sent) if word not in swords]

clean

from [Link] import PorterStemmer

ps=PorterStemmer()
clean=[[Link](word) for word in word_tokenize(sent)
if word not in swords]

clean

sent='Hello friends! How are you? we will be learning python today.'

def clean_text(sent):
tokens=word_tokenize(sent)
clean=[word for word in tokens
if [Link]() or [Link]()]
clean=[[Link](word) for word in clean
if word not in swords]
return clean

clean_text(sent)

# pre-processing
from sklearn.feature_extraction.text import TfidfVectorizer

tfidf = TfidfVectorizer(analyzer=clean_text)

x = df['text']
y = df['label']

x_new=tfidf.fit_transform(x)
[Link]

x_new.shape

tfidf.get_feature_names()

y.value_counts()

#cross validation
from sklearn.model_selection import train_test_split

x_train,x_test,y_train,y_test=train_test_split(
x_new,y,random_state=0,test_size=0.25)

x_train.shape

x_test.shape

from sklearn.naive_bayes import GaussianNB

nb=GaussianNB()

[Link](x_train.toarray(),y_train)

y_pred=[Link](x_test.toarray())

y_test.value_counts()

from [Link] import ConfusionMatrixDisplay

ConfusionMatrixDisplay.from_predictions(y_test,y_pred)

from [Link] import accuracy_score, classification_report

print(classification_report(y_test,y_pred))

accuracy_score(y_test,y_pred)

from [Link] import RandomForestClassifier

rf=RandomForestClassifier(random_state=0)

[Link](x_train,y_train)

y_pred=[Link](x_test)

ConfusionMatrixDisplay.from_predictions(y_test,y_pred)

print(classification_report(y_test,y_pred))

accuracy_score(y_test,y_pred)

from sklearn.linear_model import LogisticRegression

log=LogisticRegression()
[Link](x_train,y_train)
y_pred=[Link](x_test)
accuracy_score(y_test,y_pred)

#hyer parameter tuning

from sklearn.model_selection import GridSearchCV

params={'criterion':['gini','entropy'],
'max_features':['sqrt','log2'],
'random_state':[0,1,2,3,4],
'class_weight':['balanced','balanced_subsample']
}

grid = GridSearchCV(rf,param_grid=params,cv=5,scoring='accuracy')

[Link](x_train,y_train)

y_pred=[Link](x_test)

accuracy_score(y_test,y_pred)

SMS Spam Detection with Naive Bayes
No ratings yet
SMS Spam Detection with Naive Bayes
1 page
Spam Filtering with Naive Bayes
No ratings yet
Spam Filtering with Naive Bayes
2 pages
SMS Spam Detection Using ML Techniques
No ratings yet
SMS Spam Detection Using ML Techniques
13 pages
SMS Spam Detection System Overview
No ratings yet
SMS Spam Detection System Overview
2 pages
SMS Spam Detection with Naive Bayes
No ratings yet
SMS Spam Detection with Naive Bayes
20 pages
AI Spam Classifier Guide
No ratings yet
AI Spam Classifier Guide
14 pages
Email Spam Filtering with Python
No ratings yet
Email Spam Filtering with Python
5 pages
SMS Spam Classification with ML
No ratings yet
SMS Spam Classification with ML
42 pages
SMS Spam Detection with Machine Learning
No ratings yet
SMS Spam Detection with Machine Learning
23 pages
SMS Spam Filtering with ML Techniques
No ratings yet
SMS Spam Filtering with ML Techniques
10 pages
Spam Detection with Scikit-Learn 1.6.1
No ratings yet
Spam Detection with Scikit-Learn 1.6.1
31 pages
DS Using Python Lab Mini-Project Report and Research Paper
No ratings yet
DS Using Python Lab Mini-Project Report and Research Paper
4 pages
DS Using Python Lab Mini-Project Report
No ratings yet
DS Using Python Lab Mini-Project Report
4 pages
NLP with Python: Spam Detection Guide
No ratings yet
NLP with Python: Spam Detection Guide
14 pages
Spam Email Classifier Project Overview
No ratings yet
Spam Email Classifier Project Overview
11 pages
Business Intelligence Lab Mini-Project Report
No ratings yet
Business Intelligence Lab Mini-Project Report
8 pages
Business Intelligence Lab Mini-Project Report
No ratings yet
Business Intelligence Lab Mini-Project Report
8 pages
NLP Text Classification with NLTK & Scikit-learn
No ratings yet
NLP Text Classification with NLTK & Scikit-learn
4 pages
SMS Spam Classification Dataset Overview
No ratings yet
SMS Spam Classification Dataset Overview
7 pages
Spam Detection with NLP and ML Techniques
No ratings yet
Spam Detection with NLP and ML Techniques
6 pages
SMS Spam Detection with Machine Learning
No ratings yet
SMS Spam Detection with Machine Learning
7 pages
SVM Spam Classifier Experiment Guide
No ratings yet
SVM Spam Classifier Experiment Guide
7 pages
SMS Spam Detection Analysis
No ratings yet
SMS Spam Detection Analysis
18 pages
Sms Spam Detection Using Machine Learning
No ratings yet
Sms Spam Detection Using Machine Learning
5 pages
2 - SMS Spam Transformer Model Technical Article
No ratings yet
2 - SMS Spam Transformer Model Technical Article
7 pages
Sandesh Raksha
No ratings yet
Sandesh Raksha
16 pages
Spam and Ham Dataset Analysis
No ratings yet
Spam and Ham Dataset Analysis
5 pages
ML-Based SMS Spam Detection System
No ratings yet
ML-Based SMS Spam Detection System
1 page
NLP Assignment1 (34,54)
No ratings yet
NLP Assignment1 (34,54)
7 pages
SMS Spam Filtering with Naive Bayes
No ratings yet
SMS Spam Filtering with Naive Bayes
11 pages
PROJECT2
No ratings yet
PROJECT2
23 pages
SMS Spam Filter Model Evaluation
No ratings yet
SMS Spam Filter Model Evaluation
7 pages
Developing a Spam Filter with ML
No ratings yet
Developing a Spam Filter with ML
5 pages
Spam Detection with Naive Bayes Model
No ratings yet
Spam Detection with Naive Bayes Model
5 pages
TensorFlow Spam Detection in Python
No ratings yet
TensorFlow Spam Detection in Python
13 pages
NLP 12
No ratings yet
NLP 12
4 pages
SMS Spam Detection Using NLP Techniques
No ratings yet
SMS Spam Detection Using NLP Techniques
21 pages
SMS Spam Detection with Machine Learning
No ratings yet
SMS Spam Detection with Machine Learning
14 pages
SMS Spam Detection with ML Algorithms
No ratings yet
SMS Spam Detection with ML Algorithms
21 pages
Spam Email Classification with Logistic Regression
No ratings yet
Spam Email Classification with Logistic Regression
6 pages
SMS Spam Detection Using Deep Learning
No ratings yet
SMS Spam Detection Using Deep Learning
19 pages
Email Spam Filtering with ML
No ratings yet
Email Spam Filtering with ML
5 pages
SMS Spam Detection with FastICA & Neural Networks
No ratings yet
SMS Spam Detection with FastICA & Neural Networks
20 pages
SMS Spam Detection with Machine Learning
No ratings yet
SMS Spam Detection with Machine Learning
23 pages
SMS Spam Detection Using Machine Learning
No ratings yet
SMS Spam Detection Using Machine Learning
6 pages
SMS Spam Detection with TensorFlow
No ratings yet
SMS Spam Detection with TensorFlow
14 pages
NLP-Based Spam Detection System
No ratings yet
NLP-Based Spam Detection System
2 pages
Title MAL
No ratings yet
Title MAL
2 pages
Microsoft Teams
No ratings yet
Microsoft Teams
3 pages
Spam SMS Filtering with Naive Bayes
No ratings yet
Spam SMS Filtering with Naive Bayes
2 pages
SMS Spam Detection with NLP Techniques
No ratings yet
SMS Spam Detection with NLP Techniques
17 pages
SMS Spam Filter Model Development
No ratings yet
SMS Spam Filter Model Development
1 page
Naive Bayes Spam Email Classification
No ratings yet
Naive Bayes Spam Email Classification
3 pages
IMDB Data Classification with SGD & SVC
100% (1)
IMDB Data Classification with SGD & SVC
4 pages
Simple Spam Detection in Python
No ratings yet
Simple Spam Detection in Python
7 pages
SMS Spam Detection Full Report
No ratings yet
SMS Spam Detection Full Report
4 pages
SMS Spam Detection with ML Techniques
No ratings yet
SMS Spam Detection with ML Techniques
47 pages
Intelligent Spam Classifier Project Report
100% (1)
Intelligent Spam Classifier Project Report
24 pages
NLP Implementation in Python Lab Report
No ratings yet
NLP Implementation in Python Lab Report
9 pages
Collapsible PET Water Fountain Bottle Design
No ratings yet
Collapsible PET Water Fountain Bottle Design
7 pages
HTML Basics: Structure and Tags Guide
No ratings yet
HTML Basics: Structure and Tags Guide
6 pages
Foundations of Data Science Syllabus
100% (4)
Foundations of Data Science Syllabus
201 pages
Enhancing Image Generation with CoT
No ratings yet
Enhancing Image Generation with CoT
26 pages
Cisco UCS 6248UP Config Guide
No ratings yet
Cisco UCS 6248UP Config Guide
146 pages
Object-Oriented Analysis Lab Manual
No ratings yet
Object-Oriented Analysis Lab Manual
132 pages
Array and Matrix Operations in C
No ratings yet
Array and Matrix Operations in C
36 pages
ERP Implementation Risks Explained
No ratings yet
ERP Implementation Risks Explained
5 pages
MOS3000 Monitoring Software Manual
No ratings yet
MOS3000 Monitoring Software Manual
67 pages
HLK-LD2461 Tool User Instructions
No ratings yet
HLK-LD2461 Tool User Instructions
7 pages
COS10009 Programming Learning Summary
No ratings yet
COS10009 Programming Learning Summary
4 pages
Types of Computer Booting Explained
No ratings yet
Types of Computer Booting Explained
2 pages
SAP CPI Onboarding Guide
100% (3)
SAP CPI Onboarding Guide
40 pages
Roblox Sigma Face Meme Guide
No ratings yet
Roblox Sigma Face Meme Guide
1 page
Microsoft Publisher Certificate Exercise
100% (2)
Microsoft Publisher Certificate Exercise
1 page
NumPy Data Analysis by Yash Gulati
No ratings yet
NumPy Data Analysis by Yash Gulati
3 pages
Morpho Device Registration Errors
No ratings yet
Morpho Device Registration Errors
8 pages
Python for Everybody Specialization Completion
No ratings yet
Python for Everybody Specialization Completion
1 page
A330 Central Maintenance System Overview
100% (7)
A330 Central Maintenance System Overview
23 pages
Understanding C Tokens and Types
No ratings yet
Understanding C Tokens and Types
20 pages
Backup and Restore Windows Server 2022 with TrueNAS
No ratings yet
Backup and Restore Windows Server 2022 with TrueNAS
27 pages
ServiceNow Onboarding Guide for Success
No ratings yet
ServiceNow Onboarding Guide for Success
19 pages
Open Web Application Security Project (OWASP)
No ratings yet
Open Web Application Security Project (OWASP)
4 pages
Data Visualization in Jupyter Notebooks
No ratings yet
Data Visualization in Jupyter Notebooks
18 pages
Understanding the Exponential Model in Reliability
No ratings yet
Understanding the Exponential Model in Reliability
2 pages
Hardware-Imposed Design in Android Development
No ratings yet
Hardware-Imposed Design in Android Development
7 pages
Computer Hardware and ICT Overview
100% (2)
Computer Hardware and ICT Overview
9 pages
Computer Architecture Overview and Components
No ratings yet
Computer Architecture Overview and Components
70 pages
PyChat: Real-Time Chat App Guide
No ratings yet
PyChat: Real-Time Chat App Guide
6 pages
Eccentric Oil Change Update for Volvo Models
No ratings yet
Eccentric Oil Change Update for Volvo Models
2 pages

SMS Spam Detection with Python

Uploaded by

SMS Spam Detection with Python

Uploaded by

import pandas as pd

!pip install nltk

sent = 'Hello friends! How are you?'

from [Link] import word_tokenize

from [Link] import stopwords

clean=[word for word in word_tokenize(sent) if word not in swords]

from [Link] import PorterStemmer

sent='Hello friends! How are you? we will be learning python today.'

from sklearn.naive_bayes import GaussianNB

from [Link] import ConfusionMatrixDisplay

from [Link] import accuracy_score, classification_report

from [Link] import RandomForestClassifier

from sklearn.linear_model import LogisticRegression

#hyer parameter tuning

You might also like