0% found this document useful (0 votes)

36 views4 pages

Week 5 Industrial Internship Report

Vikas Gupta completed his fifth week of internship at CureYa. He learned about data visualization using QlikSense, including the steps of visualization and important features. He studied PyTorch for machine learning and completed tutorials on tensors, backpropagation, and creating neural networks. He discussed research papers on AI techniques with his mentor and learned about types of papers, criteria for selection, and top publications. To compare machine learning algorithms, he used a breast cancer dataset and found logistic regression and KNN had highest accuracy while decision trees and naive Bayes had lowest.

Uploaded by

Vikas Gupta

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

36 views4 pages

Week 5 Industrial Internship Report

Uploaded by

Vikas Gupta

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

INDUSTRIAL INTERNSHIP

WEEKLY PERFORMANCE REPORT (WPR)

Student Name: Vikas Gupta

Supervisor Name: B. P. Mishra/ Shivani Mishra
Coordinator/Team Leader Name: Namira Rangrej
Mentor Name: Pranshu Sharma
Organization: CureYa
Hours Worked: Monday-2 hrs, Tuesday-2 hrs, Wednesday-1.5 hrs, Thursday-1.5 hrs, Friday- 3 hrs

Summarize your thoughts regarding your internship this week. Include duties you have performed,
facts, and procedures you have learned, skills you have mastered, and observations you have made.

Week-5 (24 May 28, 2021 to 28 May 2021)

Monday:
Data Visualization: It is the process of converting data into information in the form of a chart,
diagram, picture etc….to help the decision making in the mean time.
Data Visualization using QlikSense: QlikSense is the biggest game player in Business Intelligence
(BI) market operating from 1993 providing various BI services to around 1700 customers all around
the world every day.
There are 3 steps of visualization:
1. Extraction of data from various data sources,
2. Modeling: It’s the clean up part with outcome as a single table or a set of tables interlinked,
3. Visualization: it includes dimensions (columns) and metrics (operations).
What is the importance of Data Visualization? AND Why BI Visualization tools are required?
Answer:
1) Connect varieties of data sources, bring data into a single platform and perform
transformations.
2) Best Data Compression (40 MB storage can be compressed to 5MB storage)
3) In Memory (Improved Performance)
4) Data Associativity Feature (interact deeply with a particular filter feature)
5) Machine learning and deep learning integration (Mash-Up in QlikSense)
6) Easy to use
7) Accessibility over internet (host the application on server and share)
8) Embedded Analytics (bring visualization to websites)
9) Geo Analytics (dig data deep down geographically)
10) Integration of open source chart libraries like [Link], [Link] etc.
QlikSense consists of bigdata data source connectors, flat files connectors, SQL database connectors,
and API connectors.
Who are the End Users:
DAR Structure (Dashboard, Analysis, Reports)
Dashboard: - For Higher Level Management only (Mentioning critical points only)
Analysis: - For Middle Level Management (Aggregate view, deep view of data)
Reports: - Low Level Management (Record by record transaction for TL etc.)
Top BI Tools:-
1) QlikView/ QlikSense (ETL and customization features), 2) Tableau, 3) Power BI, 4)
SiSense, 5) Kibana, 6) Microstrategy, 7) Birst, 8) TibcoSpotfire, 9) Looker
Qlik Installation: - Go to [Link]/us→ Register on it→ Go to “Support”→ Go to
“Download”→ download the latest version (don’t forget to check ‘extras’)
QlikSense Hub: - Login→ Go to ‘desktop hub’→ click on “CreateNewApp” (it’ll be in qvf
format)
Now, open the app→ add data from files → load a csv file → add data→ go to “data load
editor” to check.
This is used to generate insights.
Go to ‘script editor’ to load data→ create new section by clicking Ꚛ→ Name the section→
create new connection→ select folder→ enter path→ name the connection→ select data
Edit connection→ insert data→ click on load data
App Overview→ Sheets, Bookmarks, Stories
Go to Sheets→ click on CreateSheet→ click on association
Data Model Viewer: interconnection of multiple tables
Creation of variables, barcharts, and add-ons. Change appearance such as title, footnote,
presentation, colors of bars.
Creating multiple charts: go to ‘Tables’→ go to ‘Data’→ click on “Add column”
Data Transformation Basics: -
Give appropriate table names (by default it’ll be the file name of ‘csv’.
Comment // to ignore a column.
Adding Filters: Make changes→ Save→ Load again→ Go to “Model Viewer”
Functions in scripts and expressions, refer [Link] for documentation.

Tuesday:
PyTorch Tutorial: -
PyTorch is developed by Facebook AI Research (FAIR) Lab. PyTorch has a C++ interface, thus,
it’s very fast. A number of features of deep learning software are built on top of PyTorch
including Tesla Autopilot, Uber’s Pyro, HuggingFace’s Tranformers, and PyTorch Lightening
& Catalyst.
From Research to Production: - An open source machine learning framework that
accelerates the path from research prototyping to production deployment. It is used for
Computer Vision and Natural Language Processing.
PyTorch provides 2 high-level features:
1) Tensor Computing (like NumPy) with strong acceleration via GPU
2) Deep Neural Networks built on a type-based automatic differentiation system
PyTorch Videos on:
 Installation of PyTorch for Deep Learning
 Understanding of Tensors: - A Tensor is a generalization of vectors and metrices,
and is easily understood as a multidimensional array. It is a term and set of
techniques known in machine learning in the training and operation of deep learning
models can be described in terms of tensors. In many cases, tensors are used as a
replacement for NumPy to use the power of GPUs.
Tensors are a type of data structure used in linear algebra, and like vectors and
metrices, you can calculate arithmetic operations with tensors.
 Back Propagation using PyTorch (Compute derivatives)
 Creating an ANN (Artificial Neural Network) using PyTorch
 Kaggle Advance House Price Prediction using PyTorch-Tabular Dataset
 How to use GPU to run PyTorch code

Wednesday:
 Revision of Python basics, Statistics, and machine learning algorithms.
 Installed Python package “covid” and done analysis of data by various methods and
functions.
 Hands-on experience with Pandas Profiling.
 Deep Learning Study (Neural Networks)

Thursday:
One-to-one discussion with Dr. Bajarang Mishra Sir on research paper: Discussed Artificial
Intelligence (AI) techniques: 1) Neural Network, 2) Fuzzy Logic, 3) Genetic Algorithms (GAs),
and 4) Hybrid method. I’ve done comparative study of all these techniques. I’ve also done
the comparative study of ANN (Artificial Neural Network), ANFIS (Adaptive neuro fuzzy
inference systems), CANFIS (Co-active neural fuzzy inference systems), and hybrid intelligent
systems.
Types of research papers: 1) Patent paper (topmost priority), 2) Transactions paper, 3)
Journal Paper, 4) International Conference paper.
Criteria for selection: 1) New and innovative ideas (priority), 2) Finding the best
(comparative study of technologies or algorithms), 3) Technical Review Paper (Crux of the
outcome of 20-25 research papers).
Top Publications: - 1) IEEE, US, 2) Elsevier, 3) Taylor & Francis
Most reputed Journal indexing services:
I. WOS (Web Of Science),
II. SCI (Science Citation Index)
a. ESCI (Emerging Source Citation Index)
b. SCIE(Science Citation Index Expanded)),
III. SCOPUS
Research Paper Study: searching of publishers, journals and topics on IEEE Xplore.

Friday:
Comparison of various machine learning algorithms:
I used Breast Cancer Winconsin (Diagnostic) Dataset for this task. The objective was to
predict whether the cancer is Benign or Malignant. I performed exploratory data analysis
(EDA) on this dataset and then compared the accuracy of various machine learning
algorithms. I found that Logistic Regression and KNN provided maximum accuracy while
Decision Tree and Naïve Bayes rendered lowest accuracy. The project was uploaded on my
GitHub profile and then it was posted on LinkedIn along with a video (screen recording of
the code) and a GitHub link (including tags of CureYa, Cureya Internship, all CureYa
individuals involved in this internship and related hashtags).
Student Signature: Vikas Gupta Date: 28/05/2021

Head Co-ordinator Signature: Date:

Instructions: After the completed report has been signed by both the student and Head-
coordinator, the head-coordinator shall scan the form to a pdf format and email it to the Director-
1 (bpmishra435@[Link]) of the company. Specific problems, concerns or suggestions from
either the student/head-coordinator should be emailed separately to the C. E. O. (info@[Link])
of the company.

Common questions

ANNs, ANFIS, CANFIS, and hybrid intelligent systems each offer unique approaches within AI. ANNs focus on learning from large datasets to detect patterns and make decisions . ANFIS combines neural networks with fuzzy logic for adaptive processing and learning . CANFIS integrates cooperative approaches into a fuzzy inference system, enabling enhanced adaptability and accuracy . Hybrid systems merge multiple techniques to leverage their respective strengths, often resulting in more robust performance across diverse applications .

Data visualization transforms complex data sets into user-friendly representations like charts and diagrams, aiding decision-making processes . QlikSense enhances this by connecting various data sources, offering data compression, in-memory performance improvement, and features like data associativity and machine learning integration. These capabilities provide users with a comprehensive yet accessible way to analyze and interpret data visually .

In the DAR structure, dashboards are used by higher-level management for critical data insights, analysis provides middle-level management with aggregated views and deeper insights, and reports offer low-level management detailed, transactional data records for operational purposes . This tiered approach ensures that information is delivered in a manner aligned with the decision-making needs of each management level .

The four main research paper types in AI are patent papers, transactions papers, journal papers, and international conference papers . When selecting a research article, important criteria include the novelty and innovation of ideas, the comparative analysis of technologies or algorithms, and technical reviews that synthesize outcomes from multiple studies .

PyTorch simplifies backpropagation with its dynamic computation graph, allowing for real-time adjustments and more natural error tracking during neural network training . This dynamic capability compares favorably with static graph frameworks by offering flexibility and ease of debugging, making PyTorch particularly effective for research and experimental setups in AI and ML compared to rigid frameworks .

PyTorch accelerates the shift from research to production through its open-source machine learning framework, supporting Computer Vision and Natural Language Processing tasks . Its key features include tensor computing with GPU acceleration and deep neural networks with automatic differentiation, enabling efficient experimentation and deployment in both academic and industrial settings .

PyTorch's tensor computing, which parallels NumPy operations but leverages GPU acceleration, significantly enhances deep learning model performance by enabling efficient and fast computation of large-scale tensors . GPU acceleration facilitates this by providing the computational power necessary to quickly process complex neural network operations, thereby accelerating model training and inference .

Embedding analytics into websites allows organizations to provide real-time data insights directly within their web platforms, enhancing decision-making and user engagement . QlikSense facilitates this by offering embedded analytics capabilities, supporting the integration of visualizations into web environments through its architecture, which includes open source chart libraries .

Vikas Gupta compared machine learning algorithms using the Breast Cancer Wisconsin (Diagnostic) Dataset by performing exploratory data analysis and assessing prediction accuracy. Logistic Regression and KNN yielded the highest accuracy for classifying cancer as either benign or malignant, whereas Decision Tree and Naïve Bayes rendered lower accuracy .

QlikSense provides connectors for big data sources, flat files, SQL databases, and APIs, allowing it to integrate and unify disparate data streams into a centralized platform . This diversity in connectivity supports its effectiveness as a Business Intelligence tool, enabling comprehensive data modeling and visualization from varied sources .

Data Science and Deep Learning Overview
No ratings yet
Data Science and Deep Learning Overview
36 pages
Python Data Analytics Essentials Guide
No ratings yet
Python Data Analytics Essentials Guide
5 pages
Data Analytics with Python Essentials
No ratings yet
Data Analytics with Python Essentials
10 pages
Data Science and AI Course Overview
100% (3)
Data Science and AI Course Overview
18 pages
Data Science Course Overview at GLA University
No ratings yet
Data Science Course Overview at GLA University
21 pages
Data Science Tools & Techniques Overview
No ratings yet
Data Science Tools & Techniques Overview
4 pages
Data Science Internship Overview
No ratings yet
Data Science Internship Overview
22 pages
Python for AI Workshop Guide
No ratings yet
Python for AI Workshop Guide
4 pages
3rd & Final Yr Engg Student Offering
No ratings yet
3rd & Final Yr Engg Student Offering
7 pages
Python's Impact on Data Science & AI
No ratings yet
Python's Impact on Data Science & AI
12 pages
ppt1 - Intro To Data Analytics and Visualization
No ratings yet
ppt1 - Intro To Data Analytics and Visualization
35 pages
Python's Role in Data Science & AI
No ratings yet
Python's Role in Data Science & AI
12 pages
Comprehensive Guide to Data Science
No ratings yet
Comprehensive Guide to Data Science
6 pages
Data Science Overview: Python & Visualization
No ratings yet
Data Science Overview: Python & Visualization
15 pages
Data Science Training Report: ML & AI
No ratings yet
Data Science Training Report: ML & AI
24 pages
Scikit-learn: Intro to ML Classifiers
No ratings yet
Scikit-learn: Intro to ML Classifiers
11 pages
Unit 1 DataScience
No ratings yet
Unit 1 DataScience
13 pages
Python Data Analysis Complete Notes
100% (1)
Python Data Analysis Complete Notes
3 pages
Efficient Data Analysis with Vaex
No ratings yet
Efficient Data Analysis with Vaex
41 pages
AI and DATA SCIENCE Full Stack With Gen AI & Agentic AI
No ratings yet
AI and DATA SCIENCE Full Stack With Gen AI & Agentic AI
12 pages
GenAI MBA Placement Notes
No ratings yet
GenAI MBA Placement Notes
13 pages
Data Analysis and Business Intelligence Insights
No ratings yet
Data Analysis and Business Intelligence Insights
20 pages
Python Data Exploration for Students
No ratings yet
Python Data Exploration for Students
28 pages
Essential Python Libraries for Data Science
No ratings yet
Essential Python Libraries for Data Science
12 pages
Data Science and Big Data Analytics
No ratings yet
Data Science and Big Data Analytics
110 pages
Deploying Python for Data Science
No ratings yet
Deploying Python for Data Science
7 pages
Python for Data Analysis Basics
100% (3)
Python for Data Analysis Basics
170 pages
Machine Learning and Data Science Course
No ratings yet
Machine Learning and Data Science Course
19 pages
Machine Learning Project Overview
No ratings yet
Machine Learning Project Overview
43 pages
Data Analytics and Reporting Overview
No ratings yet
Data Analytics and Reporting Overview
11 pages
DataAnalytics Units123 Notes
No ratings yet
DataAnalytics Units123 Notes
22 pages
NumPy and Pandas Learning Plan
No ratings yet
NumPy and Pandas Learning Plan
6 pages
Data Science & ML Course Overview
No ratings yet
Data Science & ML Course Overview
4 pages
AI, Data Science, and Python Overview
No ratings yet
AI, Data Science, and Python Overview
10 pages
Diya Robotics Course Review
No ratings yet
Diya Robotics Course Review
15 pages
Pandas & Scikit-Learn Lab Guide
No ratings yet
Pandas & Scikit-Learn Lab Guide
6 pages
Understanding Python Data Structures
No ratings yet
Understanding Python Data Structures
49 pages
Data Science Training at 3RI Technologies
100% (1)
Data Science Training at 3RI Technologies
33 pages
Data Science with Python Guide
No ratings yet
Data Science with Python Guide
149 pages
Unit21docx 2025 08 18 12 22 15
No ratings yet
Unit21docx 2025 08 18 12 22 15
18 pages
Data Science Internship Report Overview
No ratings yet
Data Science Internship Report Overview
25 pages
Data Science: Tools, Techniques, and Applications
No ratings yet
Data Science: Tools, Techniques, and Applications
4 pages
Comprehensive Data Analytics Course
No ratings yet
Comprehensive Data Analytics Course
13 pages
Instagram Reach Analysis with Python
No ratings yet
Instagram Reach Analysis with Python
66 pages
AIML Roadmap 6months
No ratings yet
AIML Roadmap 6months
10 pages
Data Analytics Internship at iPEC Solutions
No ratings yet
Data Analytics Internship at iPEC Solutions
42 pages
Deep Learning and Classification Course
No ratings yet
Deep Learning and Classification Course
34 pages
Comprehensive AI & Data Science Guide
No ratings yet
Comprehensive AI & Data Science Guide
5 pages
Machine Learning Workflow Guide
No ratings yet
Machine Learning Workflow Guide
7 pages
R vs Python: Key Differences Explained
No ratings yet
R vs Python: Key Differences Explained
29 pages
Data Science Training by 3RI Technologies
No ratings yet
Data Science Training by 3RI Technologies
33 pages
Power BI Data Preparation and Analysis Guide
No ratings yet
Power BI Data Preparation and Analysis Guide
15 pages
Python Library Functions Overview
No ratings yet
Python Library Functions Overview
12 pages
PGP in Data Science Curriculum Overview
No ratings yet
PGP in Data Science Curriculum Overview
17 pages
NISHTHA
No ratings yet
NISHTHA
10 pages
Summer Data Expert Intership Report
No ratings yet
Summer Data Expert Intership Report
44 pages
Python for Business Data Analysis
No ratings yet
Python for Business Data Analysis
11 pages
Data Extraction for Text Analysis Assignment
No ratings yet
Data Extraction for Text Analysis Assignment
4 pages
Python Assignment on Data Analysis
No ratings yet
Python Assignment on Data Analysis
3 pages
Machine Learning Internship Report 2016
No ratings yet
Machine Learning Internship Report 2016
7 pages
Data Science Internship Summary 2021
100% (1)
Data Science Internship Summary 2021
27 pages
AI, ML, Data Science Internship Report
No ratings yet
AI, ML, Data Science Internship Report
74 pages
Weekly Industrial Internship Report
No ratings yet
Weekly Industrial Internship Report
5 pages
Jovian.ml Internship Weekly Report
No ratings yet
Jovian.ml Internship Weekly Report
2 pages
Weekly Industrial Internship Report
No ratings yet
Weekly Industrial Internship Report
8 pages
Weekly Internship Report: Python Insights
No ratings yet
Weekly Internship Report: Python Insights
5 pages
Weekly Industrial Internship Report
No ratings yet
Weekly Industrial Internship Report
3 pages
7 Steps to Launch a Data Science Career
No ratings yet
7 Steps to Launch a Data Science Career
69 pages
Literature Study On Application of HEC H
No ratings yet
Literature Study On Application of HEC H
3 pages
A1SJ71QC24 (-R2) - User's Manual (Hardware) IB (NA) - 66686-B (08.98)
No ratings yet
A1SJ71QC24 (-R2) - User's Manual (Hardware) IB (NA) - 66686-B (08.98)
24 pages
WBS Codes in Microsoft Project 2019
100% (1)
WBS Codes in Microsoft Project 2019
4 pages
MCQs on Pandas DataFrame Basics
No ratings yet
MCQs on Pandas DataFrame Basics
11 pages
Ch. 13 Communications 4 - Parseval's Theorem 2025
No ratings yet
Ch. 13 Communications 4 - Parseval's Theorem 2025
14 pages
CSS Modules 1 To 9 50 Item Test With Answer Key
No ratings yet
CSS Modules 1 To 9 50 Item Test With Answer Key
3 pages
Data Analysis Techniques and Tools
No ratings yet
Data Analysis Techniques and Tools
30 pages
Excel Expert Guide: Macros & References
No ratings yet
Excel Expert Guide: Macros & References
7 pages
Software Engineering Course Overview
No ratings yet
Software Engineering Course Overview
206 pages
Using Formulas in Spreadsheets
No ratings yet
Using Formulas in Spreadsheets
7 pages
Automatic Switch Design for Arduino
No ratings yet
Automatic Switch Design for Arduino
8 pages
RESTful Web Services Tutorial
No ratings yet
RESTful Web Services Tutorial
13 pages
SAP Learning Pathways for Students
No ratings yet
SAP Learning Pathways for Students
23 pages
Tracing Pad For Kids
No ratings yet
Tracing Pad For Kids
32 pages
Sampling Distributions and CLT Overview
No ratings yet
Sampling Distributions and CLT Overview
13 pages
Baofeng UV-5RM Plus User Manual
100% (1)
Baofeng UV-5RM Plus User Manual
49 pages
Data Acquisition Methods in Archaeology
No ratings yet
Data Acquisition Methods in Archaeology
14 pages
Library Management System Project
No ratings yet
Library Management System Project
20 pages
Reducing Overdue IT Tickets by 50%
No ratings yet
Reducing Overdue IT Tickets by 50%
5 pages
2SB1481 PNP Transistor Datasheet
No ratings yet
2SB1481 PNP Transistor Datasheet
4 pages
Class XII IT Practical Exam Guide
No ratings yet
Class XII IT Practical Exam Guide
2 pages
KFC Purchase Process Overview
No ratings yet
KFC Purchase Process Overview
15 pages
Upload Documents to Access Content
No ratings yet
Upload Documents to Access Content
4 pages
Drowsy Driving Detection System Design
100% (1)
Drowsy Driving Detection System Design
4 pages
IIM-A Exclusive Apple Discounts & Cashbacks
No ratings yet
IIM-A Exclusive Apple Discounts & Cashbacks
11 pages
EIGRP Static Route IP SLA Issue Resolved
No ratings yet
EIGRP Static Route IP SLA Issue Resolved
11 pages
Present CANS
No ratings yet
Present CANS
71 pages
Requirement Analysis for Industrial Project
No ratings yet
Requirement Analysis for Industrial Project
20 pages
Advanced Progressive Scan: Operating Instructions
No ratings yet
Advanced Progressive Scan: Operating Instructions
40 pages
Class 12 AI Model Question Paper 2024-25
No ratings yet
Class 12 AI Model Question Paper 2024-25
6 pages

Week 5 Industrial Internship Report

Uploaded by

Week 5 Industrial Internship Report

Uploaded by

INDUSTRIAL INTERNSHIP

WEEKLY PERFORMANCE REPORT (WPR)

Student Name: Vikas Gupta

Week-5 (24 May 28, 2021 to 28 May 2021)

Head Co-ordinator Signature: Date:

Common questions

What makes Artificial Neural Networks (ANNs), ANFIS, CANFIS, and hybrid intelligent systems unique in AI research?

What is the significance of data visualization in business intelligence, and how does QlikSense facilitate this process?

Discuss the roles of different management levels in utilizing the DAR structure of dashboards, analysis, and reports in QlikSense.

What are the different types of research papers in AI, and what criteria are important for selecting a research article?

What is the benefit of using PyTorch for backpropagation and creating artificial neural networks, and how does it compare to other frameworks?

How does PyTorch facilitate both research and production in AI, and what are its key features?

How does PyTorch's tensor computing enhance the performance of deep learning models, and what role does GPU acceleration play?

How can embedding analytics into websites be beneficial for organizations, and what role does QlikSense play in this process?

In the context of machine learning, how did Vikas Gupta compare the effectiveness of different algorithms for cancer prediction?

What types of connectors does QlikSense provide for data integration, and how do these contribute to its effectiveness as a BI tool?

You might also like