0% found this document useful (0 votes)

49 views

Major Project

This project uses machine learning to predict flight prices based on parameters like stops, dates, airlines, and locations. Random forest regression is used to train a model on historical flight data, and hyperparameters are tuned. The model is then deployed using Flask to create a web app that allows users to input features and receive predicted prices.

Uploaded by

RISHABH GIRI

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

49 views

Major Project

Uploaded by

RISHABH GIRI

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 17

Project Name:

FLIGHT FARE PREDICTION

NAME: RISHABH GIRI (1906191)

NAME : RITIK SINGH (1906193)
NAME : SEJAL BARNWAL (1906207)
NAME : AKRITI SINHA (1906010)
ABOUT PROJECT

This model predicts the price of the flight based on some parameters like
total stops, journey Day, journey month, Air India, Indigo, source,
destination, etc. I have trained this model using the random forest
regressor and after training, fine-tune the model which is also known as
hyper parameter tuning. Then save a model and deploy this Flight Fare
Prediction model using the Flask application on the localhost.
OVERVIEWS
We have 2 datasets here — training set and test set.

The training set contains the features, along with the prices of the flights. It
contains 10683 records, 10 input features and 1 output column — ‘Price’.

The test set contains 2671 records and 10 input features. The output ‘Price’
column needs to be predicted in this set. We will use Regression techniques here,
since the predicted output will be a continuous value.

Following is the features available in the dataset – Airline, Date_of_Journey,

Source, Destination, Route, Dep_Time, Arrival_Time ,Duration, Total_Stops,
Additional_Info, Price.
01
PYTHON
Language use

02
Jupyter Notebook
Platform Used
Technology Used
In Project

03
Machine Learning
Algorithm

04
FLASK FRAMEWORK
PYTHON
Diffrent Type Of Process

1) Install Jupyter Notebook : Ide where we used to code

2) Install liberary : Install all the important python liberary.

Tools:-

Pandas- This library is used for data analysis.

NumPy-It is used for mathematical calculations.

Diffrent Type Of Process

Seaborn/Matplotlib- It is used for data visualization.

Scikit-learn- It isused to train validate and test our ML model.
XGBoost-used in supervised learning(regression and
classification problems).

3) Dataset : Download a dataset from kaggle website.

Diffrent Type Of Process

CLEAN DATASET: Delete unnecessary data from dataset

1. Missing Values in the dataset.

2. All the Numerical variables and Distribution of the numerical
variables
3. Categorical Variables
4. Outliers
5. Relationship between an independent and dependent feature(price)
Diffrent Type Of Process
5) Perform EDA

From description we can see that Date_of_Journey is a object

data type,
Therefore, we have to convert this datatype into timestamp so as
to use this column properly for prediction

For this we require pandas to_datetime to convert object data

type to datetime dtype.
.dt.day method will extract only day of that date
.dt.month method will extract only month of that date
AFTER CONVERTING
Diffrent Type Of Process

6) Feature Engineering : We add ,delete and combine the

dataset for better performance.
To prepare proper input data so that it is compatible with ML
algorithm.
List of Feature Engineering Techniques:-
Encoding
Grouping Operations
Feature Split
Diffrent Type Of Process
7) Feature Selection : Where we find the corelation value through heat map.
Diffrent Type Of Process
Fitting model using Random Forest

1. Split dataset into train and test set in order to

prediction w.r.t X_test
2. If needed do scaling of data
Scaling is not done in Random forest
3. Import model
4. Fit the data
5. Predict w.r.t X_test
6. In regression check RSME Score
7. Plot graph
Checking accuracy of the model:

Evaluating the model accuracy is an essential part of

the process of creating machine learning models to describe
how well the model is performing in its predictions. The MSE,
MAE, and RMSE metrics are mainly used to evaluate the prediction
error rates and model performance in regression analysis.

• MAE (Mean absolute error)

• MSE (Mean Squared Error)
• RMSE (Root Mean Squared Error)
Model Deployment
Model Deployment is one of the last stages of any machine learning project. Here, we will
design a user interface. we used a flask to make an HTML file for flight price prediction. this will
take the input value for each feature and calculate the price for a flight as shown in the image
below.
THANK YOU

Amity Online MCA QA-5
No ratings yet
Amity Online MCA QA-5
18 pages
cz4041 Project Final Report Nyc Taxi Fare Prediction
0% (1)
cz4041 Project Final Report Nyc Taxi Fare Prediction
18 pages
Flight Fare
No ratings yet
Flight Fare
15 pages
Data Analysis Syllabus
No ratings yet
Data Analysis Syllabus
6 pages
Meta
No ratings yet
Meta
21 pages
Flight Price Prediction Report
No ratings yet
Flight Price Prediction Report
18 pages
Ict Project Report (1)[1]
No ratings yet
Ict Project Report (1)[1]
14 pages
Flight Fare Prediction: Project Report
No ratings yet
Flight Fare Prediction: Project Report
38 pages
Easychair Preprint: Vinod Kimbhaune, Harshil Donga, Asutosh Trivedi, Sonam Mahajan and Viraj Mahajan
No ratings yet
Easychair Preprint: Vinod Kimbhaune, Harshil Donga, Asutosh Trivedi, Sonam Mahajan and Viraj Mahajan
5 pages
Fligh Price Paper
No ratings yet
Fligh Price Paper
6 pages
Airfare Synopsis
No ratings yet
Airfare Synopsis
6 pages
Presentation Learbnbay - Flight Fare Prediction
No ratings yet
Presentation Learbnbay - Flight Fare Prediction
15 pages
EE5253 2023 Paper Group35
No ratings yet
EE5253 2023 Paper Group35
5 pages
Flight Price Predection 2
No ratings yet
Flight Price Predection 2
6 pages
Airplane Final
No ratings yet
Airplane Final
23 pages
47.epra Journals 14763
No ratings yet
47.epra Journals 14763
6 pages
frmCourseSyllabusIPDownload (2)
No ratings yet
frmCourseSyllabusIPDownload (2)
3 pages
Report_1
No ratings yet
Report_1
11 pages
Flight Fare Predictor
No ratings yet
Flight Fare Predictor
21 pages
Thesis Defense
No ratings yet
Thesis Defense
25 pages
ML Ex 5
No ratings yet
ML Ex 5
6 pages
Prediction of Flight-Fare Using Machine Learning
No ratings yet
Prediction of Flight-Fare Using Machine Learning
6 pages
Case Study 219302405
No ratings yet
Case Study 219302405
14 pages
Winter Report
No ratings yet
Winter Report
82 pages
Cab Fare Prediction Report by Abhinav Jha
No ratings yet
Cab Fare Prediction Report by Abhinav Jha
41 pages
Car-price-prediction (1)
No ratings yet
Car-price-prediction (1)
42 pages
PA DA1
No ratings yet
PA DA1
17 pages
Paper 90
No ratings yet
Paper 90
7 pages
Untitled document
No ratings yet
Untitled document
5 pages
Prediction of Flight-Fare Using Machine Learning
No ratings yet
Prediction of Flight-Fare Using Machine Learning
6 pages
Session 4 Machine Learning Process (1)
No ratings yet
Session 4 Machine Learning Process (1)
28 pages
ML LAB
No ratings yet
ML LAB
23 pages
Project PPT 1
No ratings yet
Project PPT 1
16 pages
Uber Data Analysis
No ratings yet
Uber Data Analysis
22 pages
models
No ratings yet
models
5 pages
LAB MANUAL For Machine Learning
No ratings yet
LAB MANUAL For Machine Learning
15 pages
House Price Prediction Using Machine Learning Techniques
No ratings yet
House Price Prediction Using Machine Learning Techniques
5 pages
House Price Prediction Using Machine Learning Techniques
No ratings yet
House Price Prediction Using Machine Learning Techniques
5 pages
House Report
No ratings yet
House Report
26 pages
Presentation On Flight Price Prediction
No ratings yet
Presentation On Flight Price Prediction
30 pages
Predicting The Price of Airline Tickets
No ratings yet
Predicting The Price of Airline Tickets
30 pages
Machine learning lab manual
No ratings yet
Machine learning lab manual
22 pages
SML
No ratings yet
SML
8 pages
Flight Price Prediction Using Machine Learning Report
No ratings yet
Flight Price Prediction Using Machine Learning Report
58 pages
Lecture02. ML Pipeline (Chapter 2)
No ratings yet
Lecture02. ML Pipeline (Chapter 2)
50 pages
Flight Booking
No ratings yet
Flight Booking
25 pages
1-Flight Booking
No ratings yet
1-Flight Booking
25 pages
33358_Report
No ratings yet
33358_Report
31 pages
mlfile
No ratings yet
mlfile
33 pages
Dse4 Stug082
No ratings yet
Dse4 Stug082
43 pages
Batch 7 F
No ratings yet
Batch 7 F
15 pages
Unit 5
No ratings yet
Unit 5
18 pages
Flight Ticket Price Predicting With The
No ratings yet
Flight Ticket Price Predicting With The
4 pages
ML MANUAL
No ratings yet
ML MANUAL
24 pages
Python
No ratings yet
Python
4 pages
Predictive_Analysis_of_Taxi_Fare_using_M
No ratings yet
Predictive_Analysis_of_Taxi_Fare_using_M
6 pages
Final 1
No ratings yet
Final 1
6 pages
BA Project - Team17
No ratings yet
BA Project - Team17
13 pages
Orange3 Data Mining Library Using Python
50% (2)
Orange3 Data Mining Library Using Python
102 pages
Kaggle Course Notes
No ratings yet
Kaggle Course Notes
87 pages
Presentation On Flight Price Prediction 2
No ratings yet
Presentation On Flight Price Prediction 2
30 pages
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
From Everand
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
César Pérez López
No ratings yet
Edmonson et al. 1989. A Body Condition Scoring Chart for Holstein Dairy Cows. JDS
No ratings yet
Edmonson et al. 1989. A Body Condition Scoring Chart for Holstein Dairy Cows. JDS
12 pages
Journal of Asian Economics: Komain Jiranyakul, Timothy P. Opiela
No ratings yet
Journal of Asian Economics: Komain Jiranyakul, Timothy P. Opiela
8 pages
Chapter 08
No ratings yet
Chapter 08
86 pages
Errors and Evaluating of Analytical Data
No ratings yet
Errors and Evaluating of Analytical Data
23 pages
Wilcoxon Rank Sum Table
No ratings yet
Wilcoxon Rank Sum Table
4 pages
Statistical Power Analysis for the Behavioral Sciences 2nd Edition eBook Full Text
100% (9)
Statistical Power Analysis for the Behavioral Sciences 2nd Edition eBook Full Text
14 pages
Analysis of Blind Product Test: Market Research Term Project
No ratings yet
Analysis of Blind Product Test: Market Research Term Project
20 pages
Mail Spam
No ratings yet
Mail Spam
4 pages
STAT721 Test1 2022 Solutions
No ratings yet
STAT721 Test1 2022 Solutions
5 pages
Songs
No ratings yet
Songs
3 pages
Revision
No ratings yet
Revision
12 pages
Scatter Diagrams: Earnings Given Age
No ratings yet
Scatter Diagrams: Earnings Given Age
3 pages
To Demonstrate A Correlated Uniqueness Model, We Use The Following Summary Statistics Data
No ratings yet
To Demonstrate A Correlated Uniqueness Model, We Use The Following Summary Statistics Data
7 pages
(eBook PDF) Introduction to Probability and Statistics 3rd by William Mendenhall - The ebook is ready for download, no waiting required
100% (2)
(eBook PDF) Introduction to Probability and Statistics 3rd by William Mendenhall - The ebook is ready for download, no waiting required
45 pages
Correlation Lecture
No ratings yet
Correlation Lecture
20 pages
3-Estimation With Heteroscedasticity
No ratings yet
3-Estimation With Heteroscedasticity
12 pages
Chapter 7 - Factor Analysis
No ratings yet
Chapter 7 - Factor Analysis
43 pages
Immediate download (Ebook) Adventures in Financial Data Science: The Empirical Properties of Financial and Economic Data, 2nd Edition by Graham L. Giller ISBN 9789811250644, 9811250642 ebooks 2024
100% (2)
Immediate download (Ebook) Adventures in Financial Data Science: The Empirical Properties of Financial and Economic Data, 2nd Edition by Graham L. Giller ISBN 9789811250644, 9811250642 ebooks 2024
71 pages
Lecture Chi Square Non Parametric Test
No ratings yet
Lecture Chi Square Non Parametric Test
41 pages
Performance Task For Data Analysis and Interpretation
No ratings yet
Performance Task For Data Analysis and Interpretation
2 pages
FDT and MCT
No ratings yet
FDT and MCT
19 pages
Cochran C Test For Outliers
No ratings yet
Cochran C Test For Outliers
3 pages
Bayesian Inference 4 LMS PDF
No ratings yet
Bayesian Inference 4 LMS PDF
91 pages
Anova and Pca
No ratings yet
Anova and Pca
10 pages
Lnme Lnme Lndi Lncpi Lnexr Lnpop Lnto Lnme: Stage 1: Testing ADF and PP Unit Root Test Level I
No ratings yet
Lnme Lnme Lndi Lncpi Lnexr Lnpop Lnto Lnme: Stage 1: Testing ADF and PP Unit Root Test Level I
2 pages
Final 2019 2020 Winter Model Answer
No ratings yet
Final 2019 2020 Winter Model Answer
8 pages
About Log Linear Validation
No ratings yet
About Log Linear Validation
10 pages
Endogeneity
No ratings yet
Endogeneity
10 pages

Uploaded by

Uploaded by

Project Name:

FLIGHT FARE PREDICTION

NAME: RISHABH GIRI (1906191)

Following is the features available in the dataset – Airline, Date_of_Journey,

1) Install Jupyter Notebook : Ide where we used to code

2) Install liberary : Install all the important python liberary.

Pandas- This library is used for data analysis.

Seaborn/Matplotlib- It is used for data visualization.

3) Dataset : Download a dataset from kaggle website.

CLEAN DATASET: Delete unnecessary data from dataset

1. Missing Values in the dataset.

From description we can see that Date_of_Journey is a object

For this we require pandas to_datetime to convert object data

6) Feature Engineering : We add ,delete and combine the

1. Split dataset into train and test set in order to

Evaluating the model accuracy is an essential part of

• MAE (Mean absolute error)

You might also like