0% found this document useful (0 votes)

143 views12 pages

Fake News Detection Using ML Report

This project aims to develop a machine learning model to detect fake news. The objectives are to collect a dataset of genuine and fake news articles, preprocess the text, extract features, and train/evaluate models like Naive Bayes, SVMs, Random Forest. The best model will be selected based on metrics like accuracy, precision, recall. Potential improvements and future work involving deep learning techniques and multimodal features are discussed.

Uploaded by

Sparsh Dhama

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

143 views12 pages

Fake News Detection Using ML Report

Uploaded by

Sparsh Dhama

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Introduction
Literature Survey
Methodology
Results and Discussion
Conclusion and Future Work
References

Mini Project Report on

Fake News Detection using ML

Submitted in partial fulfillment of the requirement for the award of the

degree of

BACHELOR OF TECHNOLOGY
IN
COMPUTER SCIENCE & ENGINEERING

Submitted by:

Student Name: Sparsh Dhama University Roll No.: 2119275

Under the Mentorship of

Ms. Shruti Bhatla

Department of Computer Science and Engineering

Graphic Era Hill University
Dehradun, Uttarakhand
CANDIDATE’S DECLARATION

I hereby certify that the work which is being presented in the project report entitled “Fake
News Detection using ML” in partial fulfillment of the requirements for the award of the
Degree of Bachelor of Technology in Computer Science and Engineering of the Graphic Era
Hill University, Dehradun shall be carried out by myself under the mentorship of Ms. Shruti
Bhatla, Department of Computer Science and Engineering, Graphic Era Hill University,
Dehradun.

Name: University Roll no.:

Sparsh Dhama 2119275
Table of Contents

Chapter No. Description Page No.

Chapter 1 Introduction

Chapter 2 Literature Survey

Chapter 3 Methodology

Chapter 4 Result and Discussion

Chapter 5 Conclusion and Future Work

References
Chapter 1
Introduction

1.1 Introduction

The proliferation of fake news has become a significant concern in

today's digital age. Misinformation and deceptive content spread
rapidly through social media and online platforms, leading to
potential consequences such as public manipulation, erosion of trust,
and societal polarization. To combat this issue, there is a growing
interest in developing automated systems that can effectively detect
fake news articles. Machine learning algorithms provide a promising
approach to tackle this problem by leveraging patterns and features
within the data to identify deceptive information.

4
1.2 Problem Statement

The objective of this project is to develop a machine learning

model that can accurately classify news articles as either genuine
or fake. The model will analyze various textual features, such as
headline, content, source, and other metadata, to determine the
likelihood of an article being fake. The goal is to create a reliable
and efficient system that can assist users in distinguishing
between reliable news sources and potentially misleading
information.

5
1.3 Objectives of the Project

The main objectives of this project are as follows:

1. Collect a comprehensive dataset of labeled news articles, consisting
of both genuine and fake examples, to train and evaluate the machine
learning model.
2. Perform exploratory data analysis to gain insights into the
characteristics and patterns present in the dataset.
3. Preprocess the textual data by applying techniques such as
tokenization, stop-word removal, and stemming to transform the text
into a suitable format for machine learning algorithms.
4. Design and implement a machine learning pipeline that includes
feature extraction, model training, and evaluation stages.
5. Evaluate the performance of various machine learning algorithms,
such as Naive Bayes, Support Vector Machines, and Random Forest,
to identify the most effective model for fake news detection.
6. Fine-tune the selected model by optimizing hyperparameters and
evaluating its performance on validation data.
7. Assess the model's performance using appropriate evaluation
metrics, such as accuracy, precision, recall, and F1-score.
8. Provide recommendations for potential improvements and future
research directions in the field of fake news detection using machine
learning.

6
Chapter 2
Literature Survey

In this chapter, a comprehensive review of existing literature on

fake news detection using machine learning techniques will be
presented. The survey will cover various approaches,
methodologies, and performance metrics employed by researchers
in the field. It will also highlight the strengths and limitations of
different machine learning algorithms in detecting fake news.

System Architecture

7
Chapter 3
Methodology
3.1 Data Collection
A diverse dataset of labeled news articles will be collected from reliable
sources, including reputable news outlets and fact-checking organizations.
The dataset will consist of both genuine and fake news examples, ensuring
a balanced representation of different categories and topics.
3.2 Data Preprocessing
The collected dataset will undergo preprocessing steps to clean and
transform the textual data. Techniques such as tokenization, stop-word
removal, stemming, and vectorization will be applied to convert the text
into numerical features that can be used by machine learning algorithms.
3.3 Feature Extraction
Various features will be extracted from the preprocessed text, including
bag-of-words representations, TF-IDF scores, and word embeddings.
Additional metadata features, such as article source, publication date, and
author credibility, will also be considered to enhance the model's
performance.
3.4 Model Training and Evaluation
Several machine learning algorithms, such as Naive Bayes, Support
Vector Machines, Random Forest, and Neural Networks, will be trained
and evaluated using appropriate training and testing splits of the dataset.
The models will be assessed based on performance metrics such as
accuracy, precision, recall, and F1-score.
SYSTEM REQUIREMENTS

8
HARDWARE REQUIREMENTS:
 System - Pentium-IV
 Speed - 2.4GHZ
 Hard disk - 40GB
 Monitor - 15VGA color
 RAM - 512MB
SOFTWARE REQUIREMENTS:
 Operating System - Windows XP
 Coding language - PYTHON

9
Chapter 4
Results and Discussion

The results obtained from training and evaluating different

machine learning models will be presented in this chapter. The
performance metrics of each model will be compared to identify
the most effective algorithm for fake news detection. The
strengths and weaknesses of the selected model will be discussed,
along with potential reasons for its performance.

10
Chapter 5
Conclusion and Future Work

In this final chapter, the overall findings and conclusions of the project
will be summarized. The effectiveness of machine learning algorithms in
detecting fake news will be discussed, along with the implications and
potential applications of the developed model. Future research directions,
such as incorporating deep learning techniques and considering
multimodal features, will be suggested to improve the accuracy and
robustness of fake news detection systems.

11
References

[1][Link]

[2][Link]

learning/

[3][Link]

[4][Link]

[5] [Link]

Common questions

Primary preprocessing techniques included tokenization, stop-word removal, stemming, and vectorization. These steps were applied to clean and transform the textual data into a suitable format for machine learning algorithms by reducing noise and standardizing the text, which is crucial for accurate feature extraction and model training .

Feature extraction involved transforming the preprocessed text into numerical features usable by machine learning algorithms. Techniques like bag-of-words, TF-IDF scores, and word embeddings were used. Additional metadata features such as article source, publication date, and author credibility were also considered to enhance the model's performance. This was significant as it ensured that the model had a comprehensive set of features to accurately distinguish between genuine and fake news .

Evaluating models using precision, recall, and F1-score addresses specific aspects of model performance. Precision measures the accuracy of positive predictions, recall measures how well the model identifies actual positive instances, and F1-score balances precision and recall. Together, they ensure the model is not only accurate but also reliable in identifying fake news, aligning with the project's goal of developing an effective fake news detection system .

The project suggested future research could improve fake news detection systems by incorporating deep learning techniques and considering multimodal features. These approaches could enhance the accuracy and robustness of such systems by leveraging more complex models and integrating various types of data beyond text, such as images or videos .

The hardware requirements specified included a Pentium-IV system with 2.4GHz speed, 40GB hard disk, 15VGA color monitor, and 512MB RAM. Software requirements included Windows XP as the operating system and Python as the coding language. These requirements align with the project's computational needs by providing a baseline system capable of running Python-based machine learning algorithms for fake news detection .

Misinformation can lead to public manipulation, erosion of trust, and societal polarization. Machine learning proposes to mitigate these issues by developing automated systems that identify patterns and features indicative of fake news, allowing for efficient detection and classification of deceptive information .

The exploratory data analysis provided insights into the characteristics and patterns within the dataset, such as common textual features in fake vs. genuine articles. These findings informed the design of feature extraction methods and the choice of machine learning algorithms, ultimately impacting model selection and training strategies for more accurate fake news detection .

The project evaluated several machine learning algorithms including Naive Bayes, Support Vector Machines, Random Forest, and Neural Networks. The effectiveness of each model was determined using performance metrics such as accuracy, precision, recall, and F1-score. The model with the best performance across these metrics was considered the most effective for fake news detection .

The problem statement was to develop a machine learning model capable of accurately classifying news articles as genuine or fake. The specific objectives included collecting a comprehensive dataset, performing exploratory data analysis, preprocessing the data, designing a machine learning pipeline, evaluating and fine-tuning models, and finally, assessing models with metrics like accuracy, precision, recall, and F1-score .

The project collected a diverse dataset of labeled news articles from reliable sources, including reputable news outlets and fact-checking organizations. Ensuring a balanced representation of genuine and fake news was crucial for model reliability, as it allowed the model to effectively learn and generalize patterns associated with fake news across various categories and topics .

Mini Project Report on

Fake News Detection using ML

Submitted in partial fulfillment of the requirement for the award o

CANDIDATE’S DECLARATION

I hereby certify that the work which is being presented in the project report entitled “Fake

Table of Contents

Chapter No.
Description
Page No.
Chapter 1
Introduction

Chapter 2
Literature Survey
Chapt

4

Chapter 1
Introduction

1.1 Introduction

The proliferation of fake news has become a significant concern in
to

5

1.2 Problem Statement

The objective of this project is to develop a machine learning
model that can accurately cla

6

1.3 Objectives of the Project

The main objectives of this project are as follows:
1. Collect a comprehensive dataset

7

Chapter 2
Literature Survey

In this chapter, a comprehensive review of existing literature on
fake news detection u

8

Chapter 3
Methodology
3.1 Data Collection
A diverse dataset of labeled news articles will be collected from reliable

9

HARDWARE REQUIREMENTS:
 System - Pentium-IV
 Speed - 2.4GHZ
 Hard disk - 40GB
 Monitor - 15VGA color
 RAM - 51

10

Chapter 4
Results and Discussion

The results obtained from training and evaluating different
machine learning mode

Fake News Detection System Overview
No ratings yet
Fake News Detection System Overview
16 pages
Fake News Detection Using Java DP
No ratings yet
Fake News Detection Using Java DP
21 pages
Final Report
No ratings yet
Final Report
76 pages
Chatbot Assistant System Project Overview
No ratings yet
Chatbot Assistant System Project Overview
24 pages
B.Tech Thesis Report: ECE Project
No ratings yet
B.Tech Thesis Report: ECE Project
13 pages
FB Chatbot Project Report
No ratings yet
FB Chatbot Project Report
41 pages
DeepXDE: Physics-Informed Neural Networks
No ratings yet
DeepXDE: Physics-Informed Neural Networks
17 pages
Identifying Fake Profiles with ANN
No ratings yet
Identifying Fake Profiles with ANN
78 pages
Flood Prediction Using Machine Learning
No ratings yet
Flood Prediction Using Machine Learning
17 pages
Deepfake Video Detection Project Report
No ratings yet
Deepfake Video Detection Project Report
28 pages
Deep Fake Detection Techniques Review
No ratings yet
Deep Fake Detection Techniques Review
18 pages
Real-Time Hand Gesture Recognition System
No ratings yet
Real-Time Hand Gesture Recognition System
40 pages
Internship Report at Pie Infocomm Pvt. Ltd.
No ratings yet
Internship Report at Pie Infocomm Pvt. Ltd.
29 pages
Twitter Spam Detection Techniques
No ratings yet
Twitter Spam Detection Techniques
45 pages
Placement Prediction with ML Models
No ratings yet
Placement Prediction with ML Models
5 pages
Voice Assistant Project with NLP & Deep Learning
No ratings yet
Voice Assistant Project with NLP & Deep Learning
82 pages
Text-to-Speech Conversion Project Report
No ratings yet
Text-to-Speech Conversion Project Report
26 pages
Master's Program Finder in Sri Lanka
No ratings yet
Master's Program Finder in Sri Lanka
31 pages
Vandana Internship Report
No ratings yet
Vandana Internship Report
48 pages
Project Overview and Development Insights
No ratings yet
Project Overview and Development Insights
14 pages
Real Estate Price Prediction Report
No ratings yet
Real Estate Price Prediction Report
20 pages
AI Resume Screening System Project
100% (1)
AI Resume Screening System Project
32 pages
Chatbot Project Report 2018-19
No ratings yet
Chatbot Project Report 2018-19
7 pages
Web-Based Chatbot System Design
No ratings yet
Web-Based Chatbot System Design
8 pages
Blockchain in Healthcare Overview
No ratings yet
Blockchain in Healthcare Overview
34 pages
Fake News Detection System Overview
No ratings yet
Fake News Detection System Overview
18 pages
Machine Learning for Student Performance
No ratings yet
Machine Learning for Student Performance
38 pages
Youtube Transcript Summarizer Using Flask
No ratings yet
Youtube Transcript Summarizer Using Flask
9 pages
Automatic Car Speed Control RFID
No ratings yet
Automatic Car Speed Control RFID
47 pages
Smart Dustbin for Waste Segregation
No ratings yet
Smart Dustbin for Waste Segregation
7 pages
Smart Car Parking System Report
No ratings yet
Smart Car Parking System Report
17 pages
Multi-Perspective E-Commerce Fraud Detection
No ratings yet
Multi-Perspective E-Commerce Fraud Detection
6 pages
AI Multimedia Deepfake Detection Report
No ratings yet
AI Multimedia Deepfake Detection Report
24 pages
Online Agriculture Marketing Project
100% (1)
Online Agriculture Marketing Project
30 pages
Project Report: WCE Sangli CSE
No ratings yet
Project Report: WCE Sangli CSE
12 pages
Smart College Enquiry Chatbot Project
No ratings yet
Smart College Enquiry Chatbot Project
88 pages
Software Engineering Project Overview
100% (1)
Software Engineering Project Overview
31 pages
Music Course Management System Report
No ratings yet
Music Course Management System Report
51 pages
Red Wine Quality Prediction Report
No ratings yet
Red Wine Quality Prediction Report
31 pages
Cyberbullying Detection via ML Techniques
No ratings yet
Cyberbullying Detection via ML Techniques
67 pages
LSTM Ensemble Learning for Phishing Detection
No ratings yet
LSTM Ensemble Learning for Phishing Detection
17 pages
Alumni Management System Project Report
No ratings yet
Alumni Management System Project Report
43 pages
Fake Account Detection Using Random Forest
No ratings yet
Fake Account Detection Using Random Forest
95 pages
Arduino Robot for Elderly Assistance
No ratings yet
Arduino Robot for Elderly Assistance
60 pages
AI Chatbot for Customer Support Project
0% (1)
AI Chatbot for Customer Support Project
3 pages
Woldia University Society Chat Proposal
No ratings yet
Woldia University Society Chat Proposal
54 pages
Seminar Report on Prompt Engineering
No ratings yet
Seminar Report on Prompt Engineering
28 pages
Automatic Answer Evaluator Project Report
100% (1)
Automatic Answer Evaluator Project Report
65 pages
Robust Lane Detection in Adverse Conditions
No ratings yet
Robust Lane Detection in Adverse Conditions
15 pages
Semi-Supervised Fake Review Detection
No ratings yet
Semi-Supervised Fake Review Detection
4 pages
Data Science Internship Report
No ratings yet
Data Science Internship Report
42 pages
Handwriting Recognition Project Report
No ratings yet
Handwriting Recognition Project Report
34 pages
Naïve Bayes SMS/Email Spam Classifier
No ratings yet
Naïve Bayes SMS/Email Spam Classifier
9 pages
Currency Detection System Report
No ratings yet
Currency Detection System Report
17 pages
Web Application for Eye Clinic Project
No ratings yet
Web Application for Eye Clinic Project
48 pages
Campus Connect: College ERP System Report
No ratings yet
Campus Connect: College ERP System Report
40 pages
Fake News Classifier with NLP & React
No ratings yet
Fake News Classifier with NLP & React
5 pages
Tower of Hanoi Mini Project Report
No ratings yet
Tower of Hanoi Mini Project Report
23 pages
PCL Report
No ratings yet
PCL Report
10 pages
Fake News Detection with Machine Learning
No ratings yet
Fake News Detection with Machine Learning
23 pages
Introduction to Hive in Big Data Analytics
No ratings yet
Introduction to Hive in Big Data Analytics
28 pages
CS 412 Data Mining Course Syllabus
No ratings yet
CS 412 Data Mining Course Syllabus
7 pages
MS Access Query Types and Examples
No ratings yet
MS Access Query Types and Examples
12 pages
Extended E-R Features in Database Design
No ratings yet
Extended E-R Features in Database Design
12 pages
2D Takeoff Kreo
No ratings yet
2D Takeoff Kreo
2 pages
Understanding Data-Driven Decision Support Systems
No ratings yet
Understanding Data-Driven Decision Support Systems
7 pages
Solutions Manual Handbook of Business Analytics 2nd Edition Jaggia Textbook
100% (2)
Solutions Manual Handbook of Business Analytics 2nd Edition Jaggia Textbook
265 pages
Global Mental Health Awareness Guide
No ratings yet
Global Mental Health Awareness Guide
26 pages
Lesson 1 Cataloguing Principles and Practices
No ratings yet
Lesson 1 Cataloguing Principles and Practices
8 pages
Project Staffing and Management Overview
No ratings yet
Project Staffing and Management Overview
11 pages
Apache Spark vs Hive SQL in Big Data
100% (1)
Apache Spark vs Hive SQL in Big Data
4 pages
Smart Grocery List Generator
No ratings yet
Smart Grocery List Generator
4 pages
Understanding Data Warehousing Concepts
No ratings yet
Understanding Data Warehousing Concepts
60 pages
Final Year Project on System Design
No ratings yet
Final Year Project on System Design
3 pages
Tableau Data Visualization Guide
No ratings yet
Tableau Data Visualization Guide
7 pages
11 Tibco
No ratings yet
11 Tibco
13 pages
SQL Server User Defined Tables Overview
No ratings yet
SQL Server User Defined Tables Overview
6 pages
Midterm Exam: IT Application Tools
No ratings yet
Midterm Exam: IT Application Tools
4 pages
TEBM Case Study: Apollo Hospitals
No ratings yet
TEBM Case Study: Apollo Hospitals
5 pages
Youssef Azam Mahfouz - Data Analyst Profile
No ratings yet
Youssef Azam Mahfouz - Data Analyst Profile
2 pages
Data Mining and Business Intelligence Overview
No ratings yet
Data Mining and Business Intelligence Overview
225 pages
Introduction to Information Technology
No ratings yet
Introduction to Information Technology
32 pages
Relational Algebra Queries for Employees
No ratings yet
Relational Algebra Queries for Employees
1 page
Python Code for Sales Data Analysis
No ratings yet
Python Code for Sales Data Analysis
2 pages
Research Skills and Search Queries Guide
No ratings yet
Research Skills and Search Queries Guide
2 pages
Overview of Oracle Benefits Tables
No ratings yet
Overview of Oracle Benefits Tables
4 pages
Multi-Turn Multi-Modal Query Clarification
No ratings yet
Multi-Turn Multi-Modal Query Clarification
12 pages
IBM Address Standarization
No ratings yet
IBM Address Standarization
2 pages
Testbank Case Studies in Health Information Management 4th Edition Schnering
100% (2)
Testbank Case Studies in Health Information Management 4th Edition Schnering
279 pages
Netflix Data Model ER Diagram Creation
No ratings yet
Netflix Data Model ER Diagram Creation
5 pages

Fake News Detection Using ML Report

Uploaded by

Fake News Detection Using ML Report

Uploaded by

Mini Project Report on

Fake News Detection using ML

Submitted in partial fulfillment of the requirement for the award of the

Student Name: Sparsh Dhama University Roll No.: 2119275

Under the Mentorship of

Department of Computer Science and Engineering

Name: University Roll no.:

Chapter No. Description Page No.

Chapter 2 Literature Survey

Chapter 4 Result and Discussion

Chapter 5 Conclusion and Future Work

The proliferation of fake news has become a significant concern in

The objective of this project is to develop a machine learning

The main objectives of this project are as follows:

In this chapter, a comprehensive review of existing literature on

The results obtained from training and evaluating different

Common questions

What were the primary preprocessing techniques applied in the fake news detection project, and how do they contribute to the overall machine learning process?

How did the methodology incorporate feature extraction, and why was it significant for the fake news detection model's performance?

How does the evaluation of machine learning models using metrics like precision, recall, and F1-score contribute to the project's goals?

In what ways did the project suggest future research could improve fake news detection systems?

What were the hardware and software requirements specified for the fake news detection project, and how do they align with the project's computational needs?

What challenges does misinformation pose to society, and how does machine learning propose to mitigate these issues according to the fake news detection project?

What insights were gained from the exploratory data analysis in the fake news detection project, and how did they impact subsequent steps?

What machine learning algorithms were evaluated in the fake news detection project, and how did the project determine the most effective algorithm?

What was the problem statement for the fake news detection project, and what were the specific objectives outlined to tackle this problem?

Describe the data collection process for the fake news detection project and why it is crucial for model reliability.

You might also like