Uploaded by

Somyajit Chakraborty

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

81 views

Project Amazon Sales Data Analysis

Uploaded by

Somyajit Chakraborty

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 12

Project Amazon Sales

Data Analysis
Objective
● Development of a predictive model for predicting sales.
● Perform ETL (Extract-Transform-Load) on dataset.
● Develop dashboard by using tableau.

Beneﬁts
● Better understand and optimise revenue generation in future
● Maximize forecasting accuracy
● Make current sales experience our top priority
Architecture
Data Preprocessing:
● Importing necessary libraries for data analysis such as : Pandas, Numpy, Matplotlib &
Seaborn etc.
● Using pd.read_csv() function stores the data in pandas dataframe named data.
● Using data.column showing columns present in dataframe.
● info() function show basic information of dataframe like null value count of each
column and their data type
● Changing the data type of different column for model training and analysis
● Using describe function on dataframe for getting basic stats of numerical dataset
● Adding extra column to dataframe which contain only month, year and month with
year
● Using isnull().sum() checking out total null value in all the column of dataframe
Exploratory data Analysis
Checking Outliers in the dataframe by using Box Plot

● Box Plot for Total Proﬁt : Here we detect outliers in the speciﬁed column using
the Z-score method and found 7 outliers.
● Box Plot of Total Cost : found 5 outliers in Total Cost column

● Box Plot of Total Revenue : Found 6 outliers in Total Revenue column

● Creating a bar chart for Total Revenue and Order Month : where it
showcases the number of order purchased in particular month.

● Calculating the total revenue for each group with respect to Item Type
and then sorting then in descending order.
● Calculating the total proﬁt for each group with respect to Item Type
and then sorting them in descending order.
● Calculating correlation of 'Total Revenue', 'Total Cost' and 'Total Proﬁt'
columns present in dataframe.
Predictive Analytics :
● Label Encoding of Item Type, Sales Channel and Order Priority for model
training.
● Dropping columns Region, Country, Order Date MonthYear, Order ID and
Ship Date.

Pycaret library :
● PyCaret is an open-source, low-code machine learning library in Python.
● Allows users to quickly and easily build, compare, and deploy machine
learning models on structured and tabular data.
● Reduce the amount of code needed to build a model.
● It provides preprocessing and feature engineering functions.
● Automatic model selection and hyperparameter tuning.
● Support for a wide range of machine learning algorithms
● Plotting residuals for Lasso Least Angle Regression based trained model

● Plotting prediction error plot for Lasso Least Angle Regression based trained
model
Implementation of Linear Regression
● Selecting the independent variables and target variable.
● Splitting the data into training and testing datasets.
● Standardizing the dataset.
● Performing fit transform on X_train dataframe.
● Performing fit transform on X_test dataframe.
● Applying Linear Regression on X_train and y_train.
● Calculating mean squared error.
● Creating kernel density estimate plot
● Plotting the predicted values against the actual values to visualize
how well the model is fitting the data.

Essential n8n Playbook
From Everand
Essential n8n Playbook
Leandro Calado
No ratings yet
Agricultural Experimentation
80% (5)
Agricultural Experimentation
354 pages
Supermarket Sales Analysis and Prediction
100% (1)
Supermarket Sales Analysis and Prediction
34 pages
Big Mart Sales Analysis
No ratings yet
Big Mart Sales Analysis
3 pages
Data Analysis On BigMart Sales
67% (3)
Data Analysis On BigMart Sales
17 pages
Analyzing Sales Data
No ratings yet
Analyzing Sales Data
11 pages
Data Analysis
No ratings yet
Data Analysis
4 pages
Supermart Grocery Sales - Retail Analytics Dataset - (Data Analyst)
No ratings yet
Supermart Grocery Sales - Retail Analytics Dataset - (Data Analyst)
17 pages
Analytical project using python BMBA-252
No ratings yet
Analytical project using python BMBA-252
4 pages
Sales Forecasting Project Detailed
No ratings yet
Sales Forecasting Project Detailed
12 pages
Cours 3 - TP
No ratings yet
Cours 3 - TP
3 pages
Applied Datascience - Phase3
No ratings yet
Applied Datascience - Phase3
8 pages
Case Study Reportf
No ratings yet
Case Study Reportf
6 pages
DS-Food
No ratings yet
DS-Food
23 pages
Deep Learning Assignments
No ratings yet
Deep Learning Assignments
13 pages
IIM PBA Assignment 2
No ratings yet
IIM PBA Assignment 2
3 pages
Case Study Reportf
No ratings yet
Case Study Reportf
6 pages
Business intelligent
No ratings yet
Business intelligent
20 pages
Python - Data Analysis
No ratings yet
Python - Data Analysis
11 pages
Implementation (Raw)
No ratings yet
Implementation (Raw)
12 pages
A Real World Scenario Solution using pandas
No ratings yet
A Real World Scenario Solution using pandas
3 pages
Ex 5.1 Customer Behaviour Prediction
No ratings yet
Ex 5.1 Customer Behaviour Prediction
8 pages
Solution
No ratings yet
Solution
4 pages
Supermarket Sales Analysis Project
No ratings yet
Supermarket Sales Analysis Project
8 pages
HET ka FML
No ratings yet
HET ka FML
13 pages
1july Presentation
No ratings yet
1july Presentation
18 pages
Coffee Sales - (Data Analyst)
No ratings yet
Coffee Sales - (Data Analyst)
31 pages
DOC-20250118-WA0002.
No ratings yet
DOC-20250118-WA0002.
4 pages
Revenue Predictor - Udit Ennam PDF
No ratings yet
Revenue Predictor - Udit Ennam PDF
30 pages
Lab08 ML
No ratings yet
Lab08 ML
6 pages
Bigmart Sales Solution Methodology
No ratings yet
Bigmart Sales Solution Methodology
5 pages
Identifying Columns with Missing Values
No ratings yet
Identifying Columns with Missing Values
4 pages
Supermarket Sales Data analysis
No ratings yet
Supermarket Sales Data analysis
6 pages
Ex4.1 Walmart Forecasting
No ratings yet
Ex4.1 Walmart Forecasting
7 pages
Excel To Pandas Advanced Data Techniques For BI Devs 1729266352
No ratings yet
Excel To Pandas Advanced Data Techniques For BI Devs 1729266352
9 pages
Advance Data Analytics ASSIGNMENT
No ratings yet
Advance Data Analytics ASSIGNMENT
10 pages
Big Mart Sales Analysis
No ratings yet
Big Mart Sales Analysis
3 pages
B M Sale Analysis
No ratings yet
B M Sale Analysis
3 pages
Price Opti Medium Code
No ratings yet
Price Opti Medium Code
15 pages
Ids Case Study
No ratings yet
Ids Case Study
15 pages
Final
No ratings yet
Final
2 pages
part2
No ratings yet
part2
21 pages
profitanalysis
No ratings yet
profitanalysis
18 pages
Sales Prediction Using Regression Analysis: Problem Statement
No ratings yet
Sales Prediction Using Regression Analysis: Problem Statement
3 pages
AML Assignment 1 1
No ratings yet
AML Assignment 1 1
4 pages
Each Stage of A Data Mining Project
No ratings yet
Each Stage of A Data Mining Project
5 pages
Linear Regression Report
No ratings yet
Linear Regression Report
3 pages
BS MINI PROJECT 2
No ratings yet
BS MINI PROJECT 2
5 pages
249 PRJ
No ratings yet
249 PRJ
31 pages
RITHIKA CONTENT
No ratings yet
RITHIKA CONTENT
25 pages
PRJ Sales Forecasting
No ratings yet
PRJ Sales Forecasting
22 pages
SalesMgmtSystem XII IP Projectreport 2022 23
No ratings yet
SalesMgmtSystem XII IP Projectreport 2022 23
18 pages
Main Phase 3 Dharani (1)
No ratings yet
Main Phase 3 Dharani (1)
19 pages
BIDA practical print
No ratings yet
BIDA practical print
56 pages
Sales Prediction For Big Mart 3.0.pptx MM
No ratings yet
Sales Prediction For Big Mart 3.0.pptx MM
25 pages
Phase 3 (2)
No ratings yet
Phase 3 (2)
19 pages
dk-phase2 (1)
No ratings yet
dk-phase2 (1)
5 pages
Articles Xgboost Classification With Smote-Enn Algorithm
No ratings yet
Articles Xgboost Classification With Smote-Enn Algorithm
11 pages
A project based on Python
No ratings yet
A project based on Python
17 pages
Oe Cae 3
No ratings yet
Oe Cae 3
7 pages
Manufacturing: Engineering, Management and Marketing
From Everand
Manufacturing: Engineering, Management and Marketing
S.O.T Ogaji
No ratings yet
IndBET Master FB
No ratings yet
IndBET Master FB
111 pages
Label 2
No ratings yet
Label 2
4 pages
Machine Learning 2M&10M Qpaper
No ratings yet
Machine Learning 2M&10M Qpaper
3 pages
Project 03: Data Fitting Applied Mathematics and Statistics For Information Technology
No ratings yet
Project 03: Data Fitting Applied Mathematics and Statistics For Information Technology
17 pages
Probability and Statistics- Book(Dr Hari Arora)
100% (3)
Probability and Statistics- Book(Dr Hari Arora)
473 pages
The LATE Theorem
No ratings yet
The LATE Theorem
14 pages
Csit (r22) 3-2 Machine Learning Digital Notes
No ratings yet
Csit (r22) 3-2 Machine Learning Digital Notes
120 pages
ashageri assignment
No ratings yet
ashageri assignment
13 pages
1) Download the binary classification dataset for... - Colab
No ratings yet
1) Download the binary classification dataset for... - Colab
6 pages
Machine Learning Based Advanced Crime Prediction and Analysis
No ratings yet
Machine Learning Based Advanced Crime Prediction and Analysis
7 pages
Defining Model 1 (Null Model) With PASW Menu Commands: Models: Specified Subjects and Repeated
No ratings yet
Defining Model 1 (Null Model) With PASW Menu Commands: Models: Specified Subjects and Repeated
18 pages
Objective 6: Define What A Positive, Negative, and Zero Correlation Is
No ratings yet
Objective 6: Define What A Positive, Negative, and Zero Correlation Is
2 pages
Data Analytics Unit III
No ratings yet
Data Analytics Unit III
15 pages
Manova
No ratings yet
Manova
15 pages
Chapter-15: Research Methodology
No ratings yet
Chapter-15: Research Methodology
25 pages
Chapter 3
No ratings yet
Chapter 3
23 pages
Test bank 6-10
No ratings yet
Test bank 6-10
64 pages
Regression 3: Medical Supplies Costs A + (B: Summary Output
No ratings yet
Regression 3: Medical Supplies Costs A + (B: Summary Output
2 pages
Temperature Price of Ice Cream Units Sold
No ratings yet
Temperature Price of Ice Cream Units Sold
14 pages
Regression Models for Categorical Dependent Variables Using Stata 3rd Edition J. Scott Long download
100% (1)
Regression Models for Categorical Dependent Variables Using Stata 3rd Edition J. Scott Long download
55 pages
Econo Mid-Term Exam
No ratings yet
Econo Mid-Term Exam
4 pages
Machine Learning Viva Questions
No ratings yet
Machine Learning Viva Questions
6 pages
unit-5-ad3491-fundamentals-of-data-science-unit-5-notes (1)
No ratings yet
unit-5-ad3491-fundamentals-of-data-science-unit-5-notes (1)
24 pages
Prediction of COVID-19 Possibilities Using KNeares
No ratings yet
Prediction of COVID-19 Possibilities Using KNeares
9 pages
Online Polynomial Regression: Regressiontools: The Program
No ratings yet
Online Polynomial Regression: Regressiontools: The Program
2 pages
Answer Sheet Group Number: - 6 - Names of Team Members: - Hongru Bi, Yangxi Gan, Tara Kelley, Jie Xu Data Set - Arlhomes1
No ratings yet
Answer Sheet Group Number: - 6 - Names of Team Members: - Hongru Bi, Yangxi Gan, Tara Kelley, Jie Xu Data Set - Arlhomes1
6 pages
Regression and Correlations: Rank Correlation
No ratings yet
Regression and Correlations: Rank Correlation
61 pages
Arima Slide Share
No ratings yet
Arima Slide Share
65 pages
Lesson 3 Logistic Regression Diagnostics
No ratings yet
Lesson 3 Logistic Regression Diagnostics
37 pages

Uploaded by

Uploaded by

Project Amazon Sales

● Box Plot of Total Revenue : Found 6 outliers in Total Revenue column

You might also like