0% found this document useful (0 votes)

22 views9 pages

Salary and Experience Data Analysis

Uploaded by

ngak1214

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

22 views9 pages

Salary and Experience Data Analysis

Uploaded by

ngak1214

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Perform the following operations using Python on dataset

Salary_Data.csv (Experience, Salary)

1. Print all data from Salary_Data.csv
2. Find the empty cell from Salary_Data.csv
3. Count the missing values from column Experience
4. List the Descriptive Statistics for given dataset
5. Reset the default index of given dataset

Perform the following operations using Python on dataset

Salary_Data.csv (Experience, Salary)
1. Print all data from Salary_Data.csv
2. Find the empty cell from Salary_Data.csv
3. Count the missing values from column Experience
4. List the Descriptive Statistics for given dataset
5. Reset the default index of given dataset

Create a Data Frame Product (Product_Name, Price) with some

missing values and perform following operation on it.

1. Count missing values under the column Product_Name.

2. Count missing values under the entire data frame Product.
3. Count missing values under the entire row.
4. Count missing values across the row with index of 7.
5. To remove the duplicates across the columns of Product_Name
Create a Data Frame Product (Product_Name, Price) with some
missing values and perform following operation on it.

1. Count missing values under the column Product_Name.

Create a Pandas Data frame with two columns (Value1 and Value2)
with some Numeric and some Categorical values and perform
following operation on it.

1. Convert all values from data frame into float format and print it.
2. Drop all the rows with the NaN values from data frame.
3. Replace the NaN values with 0’s
4. Transpose the given data frame.
5. Rename the default index with X, Y, Z and then transpose data frame.
Perform following operation on Iris data set.
1. Standard Scaler and Minimax Scaler operation on Iris Data set.
2. Scale data with range 5 to 10 using Minimax Scaler operation on
Iris Data set.
3. Write a Python code for outlier detection using Z score

Perform following operation on Iris data set.

1. Standard Scaler and Minimax Scaler operation on Iris Data set.
2. Scale data with range 5 to 10 using Minimax Scaler operation on
Iris Data set.
3. Write a Python code for outlier detection using Z score

1. Perform following operation on [Link] data set.

1. Display all descriptive statistic of mtcars data set.
2. Get the Mean, Median and Mode of each column for mtcars
data set.
3. Get the Mean of each rows for mtcars data set.
2. Write a Python program to display some basic statistical details
like standard deviation, mean, standard deviation etc. of the species
of ‘Iris-setosa’, ‘Iris-versicolor’ and ‘Iris- versicolor’ of [Link] dataset.
[Link] following operation on [Link] data set.
1. Display all descriptive statistic of mtcars data set.
2. Get the Mean, Median and Mode of each column for mtcars
data set.
3. Get the Mean of each rows for mtcars data set.
2. Write a Python program to display some basic statistical details
like standard deviation, mean, standard deviation etc. of the species
of ‘Iris-setosa’, ‘Iris-versicolor’ and ‘Iris- versicolor’ of [Link] dataset.

Create a Linear Regression Model using Python to predict home

prices using Boston Housing Dataset.

Create a Linear Regression Model using Python to predict home

prices using Boston Housing Dataset.

1. Implement logistic regression using Python to perform

classification on Social_Network_Ads.csv dataset.
2. Compute Confusion matrix to find TP, FP, TN, FN, Accuracy, Error
rate, Precision, Recall on the given dataset.
1. Implement logistic regression using Python to perform
classification on Social_Network_Ads.csv dataset.

2. Compute Confusion matrix to find TP, FP, TN, FN, Accuracy, Error
rate, Precision, Recall on the given dataset.

1. Implement Simple Naïve Bayes classification algorithm using

Python on [Link] dataset.
2. Compute Confusion matrix to find TP, FP, TN, FN, Accuracy, Error
rate, Precision, Recall on the given dataset.

1. Implement Simple Naïve Bayes classification algorithm using

Python/R on [Link] dataset.

2. Compute Confusion matrix to find TP, FP, TN, FN, Accuracy, Error
rate, Precision, Recall on the given dataset.

1. Extract Sample document and apply following document

preprocessing methods:
 Tokenization,
 POS Tagging,
 stop words removal,
 Stemming and
 Lemmatization.
2. Create representation of document by calculating Term Frequency
and Inverse Document Frequency.
1. Extract Sample document and apply following document
preprocessing methods:
 Tokenization,
 POS Tagging,
 stop words removal,
 Stemming and
 Lemmatization.
2. Create representation of document by calculating Term
Frequency and Inverse Document Frequency.

 Use the inbuilt dataset 'titanic', contains information about the

passengers who boarded the unfortunate Titanic ship.

 Write a code to check how the price of the ticket (column name:
'fare') for each passenger is distributed by plotting a histogram
with and without kernel density estimation

 Use the inbuilt dataset 'titanic', contains information about the

passengers who boarded the unfortunate Titanic ship.

 Write a code to check how the price of the ticket (column name:
'fare') for each passenger is distributed by plotting a histogram
with and without kernel density estimation
Use the inbuilt dataset 'titanic' and Plot a box plot for distribution of
age with respect to each gender along with the information about
whether they survived or not. (Column names: 'sex' and 'age')

Use the inbuilt dataset 'titanic' and Plot a box plot for distribution of
age with respect to each gender along with the information about
whether they survived or not. (Column names: 'sex' and 'age')

Perform following operation on [Link] data set.

1. List down the features and their types (e.g., numeric, nominal)
available in the dataset.
2. Create a histogram for each feature in the dataset to illustrate
the feature distributions.
3. Create a box plot for each feature in the dataset.
4. Compare distributions and identify outliers.
Perform following operation on [Link] data set.
1. List down the features and their types (e.g., numeric, nominal)
available in the dataset.
2. Create a histogram for each feature in the dataset to illustrate
the feature distributions.
3. Create a box plot for each feature in the dataset.
4. Compare distributions and identify outliers.
Write a code in JAVA for a simple Word Count application that counts
the number of occurrences of each word in a given input set using the
Hadoop Map-Reduce framework on local-standalone set-up.

Write a code in JAVA for a simple Word Count application that counts
the number of occurrences of each word in a given input set using the
Hadoop Map-Reduce framework on local-standalone set-up.

Design a distributed application using Map-Reduce which processes a

log file of a system.

Design a distributed application using Map-Reduce which processes a

log file of a system.
Locate dataset (e.g., sample_weather.txt) for working on weather
data which reads the text input files and finds average for
temperature, dew point and wind speed.

Locate dataset (e.g., sample_weather.txt) for working on weather

data which reads the text input files and finds average for
temperature, dew point and wind speed.

Python Data Analysis with NumPy and Pandas
No ratings yet
Python Data Analysis with NumPy and Pandas
14 pages
Variance and Data Analysis with NumPy
No ratings yet
Variance and Data Analysis with NumPy
28 pages
Experimnt 10
No ratings yet
Experimnt 10
17 pages
Data Analysis & Visualization with Python
No ratings yet
Data Analysis & Visualization with Python
27 pages
Python Machine Learning: Iris Dataset Analysis
No ratings yet
Python Machine Learning: Iris Dataset Analysis
19 pages
Python for AI: CSV, NumPy, Pandas, Sklearn
No ratings yet
Python for AI: CSV, NumPy, Pandas, Sklearn
6 pages
Data Analysis & Visualization with Python
No ratings yet
Data Analysis & Visualization with Python
24 pages
Kedar Dsbda Codes
No ratings yet
Kedar Dsbda Codes
18 pages
Python Practical Second Last Option
No ratings yet
Python Practical Second Last Option
5 pages
Pandas Operations and Data Analysis Guide
No ratings yet
Pandas Operations and Data Analysis Guide
19 pages
TE DS Lab Manual (317534)
No ratings yet
TE DS Lab Manual (317534)
107 pages
Data Preprocessing and Model Training Guide
No ratings yet
Data Preprocessing and Model Training Guide
74 pages
Data Analysis & Visualization with Python
No ratings yet
Data Analysis & Visualization with Python
3 pages
Data Analysis and Visualization Course
No ratings yet
Data Analysis and Visualization Course
4 pages
Data Analysis Lab Experiment List
No ratings yet
Data Analysis Lab Experiment List
2 pages
Data Preprocessing Techniques in Python
No ratings yet
Data Preprocessing Techniques in Python
10 pages
Categorical to Quantitative Conversion in Python
No ratings yet
Categorical to Quantitative Conversion in Python
2 pages
NumPy and Pandas Data Science Codes
No ratings yet
NumPy and Pandas Data Science Codes
18 pages
Iris and Titanic Dataset Statistics Analysis
No ratings yet
Iris and Titanic Dataset Statistics Analysis
16 pages
Python Data Analysis with Iris Dataset
No ratings yet
Python Data Analysis with Iris Dataset
25 pages
ML Pract Ref
No ratings yet
ML Pract Ref
103 pages
Visualisation Lab Part 1
No ratings yet
Visualisation Lab Part 1
4 pages
Python DataFrame Operations and Visualizations
No ratings yet
Python DataFrame Operations and Visualizations
17 pages
Worksheet 2
No ratings yet
Worksheet 2
10 pages
Hospital Management Data Analysis Guide
No ratings yet
Hospital Management Data Analysis Guide
104 pages
Data Analysis and Visualization Techniques
No ratings yet
Data Analysis and Visualization Techniques
73 pages
Data Science Practical Assignments
No ratings yet
Data Science Practical Assignments
21 pages
Data Analysis and Visualization Techniques
No ratings yet
Data Analysis and Visualization Techniques
14 pages
Fod Unit 2
No ratings yet
Fod Unit 2
64 pages
SVM and KNN Classification Examples
No ratings yet
SVM and KNN Classification Examples
23 pages
Python Data Analysis Techniques Guide
No ratings yet
Python Data Analysis Techniques Guide
2 pages
Python Project: Bank Marketing Dataset Analysis
No ratings yet
Python Project: Bank Marketing Dataset Analysis
28 pages
Understanding the DXV File Format
No ratings yet
Understanding the DXV File Format
3 pages
ML Lab Manual 1-9
No ratings yet
ML Lab Manual 1-9
41 pages
DA Lab File Final
No ratings yet
DA Lab File Final
16 pages
NumPy and Pandas Data Science Codes
No ratings yet
NumPy and Pandas Data Science Codes
2 pages
EDA of Iris Dataset in Python
No ratings yet
EDA of Iris Dataset in Python
12 pages
Machine Learning Python Libraries Guide
No ratings yet
Machine Learning Python Libraries Guide
47 pages
Data Analysis and Visualization Techniques
No ratings yet
Data Analysis and Visualization Techniques
14 pages
EDR2 Data Normalization Techniques
No ratings yet
EDR2 Data Normalization Techniques
11 pages
Titanic Data Analysis and Visualization
No ratings yet
Titanic Data Analysis and Visualization
5 pages
Data Wrangling Techniques in Python
No ratings yet
Data Wrangling Techniques in Python
49 pages
Scikit-Learn Datasets and Encoding Methods
No ratings yet
Scikit-Learn Datasets and Encoding Methods
92 pages
Machine Learning Lab Exam Guide
No ratings yet
Machine Learning Lab Exam Guide
2 pages
Titanic Dataset Analysis and Visualization
No ratings yet
Titanic Dataset Analysis and Visualization
5 pages
Data Operations in Python Lab Guide
No ratings yet
Data Operations in Python Lab Guide
26 pages
SL-III Lab Manual (2) - 1
No ratings yet
SL-III Lab Manual (2) - 1
74 pages
Convert Categorical to Quantitative in Python
No ratings yet
Convert Categorical to Quantitative in Python
23 pages
ML Journal Shital
No ratings yet
ML Journal Shital
102 pages
Data Analysis and Machine Learning Tasks
No ratings yet
Data Analysis and Machine Learning Tasks
1 page
Data Science & Big Data Lab Manual
No ratings yet
Data Science & Big Data Lab Manual
76 pages
Exploratory Data Analysis in Python
No ratings yet
Exploratory Data Analysis in Python
16 pages
Data Exploration & Visualization Syllabus
No ratings yet
Data Exploration & Visualization Syllabus
3 pages
BDA Practical Lab Manual PDF
No ratings yet
BDA Practical Lab Manual PDF
9 pages
Machine Learning Techniques Explained
No ratings yet
Machine Learning Techniques Explained
43 pages
Data Analysis and Visualization Tasks
No ratings yet
Data Analysis and Visualization Tasks
29 pages
Data Preprocessing Techniques in Python
No ratings yet
Data Preprocessing Techniques in Python
23 pages
Lab 1
No ratings yet
Lab 1
12 pages
Python Practicals Study Guide
No ratings yet
Python Practicals Study Guide
9 pages
SPSS Data Analysis and Regression Output
No ratings yet
SPSS Data Analysis and Regression Output
4 pages
JETINYPR Index: Historical Price Data
No ratings yet
JETINYPR Index: Historical Price Data
24 pages
CuInS2/ZnS Quantum Dot Nanothermometers
No ratings yet
CuInS2/ZnS Quantum Dot Nanothermometers
10 pages
OLS Variance in Matrix Form
No ratings yet
OLS Variance in Matrix Form
42 pages
Understanding Sampling in Audits
No ratings yet
Understanding Sampling in Audits
61 pages
Journal of Vocational Behavior: Richard P. Douglass, Ryan D. Duffy
No ratings yet
Journal of Vocational Behavior: Richard P. Douglass, Ryan D. Duffy
8 pages
Multiple Linear Regression Overview
No ratings yet
Multiple Linear Regression Overview
11 pages
Ibm Spss Samplepower: Get The Right Sample Size The First Time
No ratings yet
Ibm Spss Samplepower: Get The Right Sample Size The First Time
4 pages
BE 2100 Exam 3 Overview
No ratings yet
BE 2100 Exam 3 Overview
6 pages
HR Development & Culture's Impact on Performance
No ratings yet
HR Development & Culture's Impact on Performance
15 pages
Week 6: Linear Regression Insights
No ratings yet
Week 6: Linear Regression Insights
3 pages
Best Practices for Handling Outliers
No ratings yet
Best Practices for Handling Outliers
33 pages
Flattening of the Phillips Curve Explained
No ratings yet
Flattening of the Phillips Curve Explained
23 pages
Ultrasonic Repellent for Rice Pest Control
No ratings yet
Ultrasonic Repellent for Rice Pest Control
39 pages
Impact of Agricultural Credit on Productivity
No ratings yet
Impact of Agricultural Credit on Productivity
43 pages
Data Mining Mid Term Exam Questions
No ratings yet
Data Mining Mid Term Exam Questions
4 pages
Understanding Quantitative Reasoning Skills
No ratings yet
Understanding Quantitative Reasoning Skills
78 pages
Correlation Coefficient & Linear Regression
No ratings yet
Correlation Coefficient & Linear Regression
53 pages
Free Fall Experiment Analysis
100% (1)
Free Fall Experiment Analysis
6 pages
Non-Linear Regression Case Studies
No ratings yet
Non-Linear Regression Case Studies
31 pages
Analisis Homogenitas dan Normalitas Data
No ratings yet
Analisis Homogenitas dan Normalitas Data
7 pages
Single-Index Model in Portfolio Management
No ratings yet
Single-Index Model in Portfolio Management
46 pages
R Squared vs Adjusted R Squared Explained
No ratings yet
R Squared vs Adjusted R Squared Explained
6 pages
Barley Breeding: Predicting Genotype Performance
No ratings yet
Barley Breeding: Predicting Genotype Performance
9 pages
Understanding Normal Distribution in Statistics
No ratings yet
Understanding Normal Distribution in Statistics
9 pages
Curve Fitting Techniques Overview
No ratings yet
Curve Fitting Techniques Overview
28 pages
Building ARIMA Models for Time Series
No ratings yet
Building ARIMA Models for Time Series
30 pages
Optimizing FSW Parameters for AA7075
No ratings yet
Optimizing FSW Parameters for AA7075
8 pages
Ethiopian Tetraploid Wheat Diversity
100% (2)
Ethiopian Tetraploid Wheat Diversity
13 pages
Understanding Ordinary Least Squares
No ratings yet
Understanding Ordinary Least Squares
22 pages

Salary and Experience Data Analysis

Uploaded by

Salary and Experience Data Analysis

Uploaded by

Perform the following operations using Python on dataset

Salary_Data.csv (Experience, Salary)

Perform the following operations using Python on dataset

Create a Data Frame Product (Product_Name, Price) with some

1. Count missing values under the column Product_Name.

1. Count missing values under the column Product_Name.

Perform following operation on Iris data set.

1. Perform following operation on [Link] data set.

Create a Linear Regression Model using Python to predict home

Create a Linear Regression Model using Python to predict home

1. Implement logistic regression using Python to perform

1. Implement Simple Naïve Bayes classification algorithm using

1. Implement Simple Naïve Bayes classification algorithm using

1. Extract Sample document and apply following document

 Use the inbuilt dataset 'titanic', contains information about the

 Use the inbuilt dataset 'titanic', contains information about the

Perform following operation on [Link] data set.

Design a distributed application using Map-Reduce which processes a

Design a distributed application using Map-Reduce which processes a

Locate dataset (e.g., sample_weather.txt) for working on weather

You might also like