Skip to content

divyakkm/Data-Mining-Project

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

18 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Predicting Airline Delays - Fly from SFO or OAK?

Team
Divya M
Eunkwang J
Ryan J
Julia K

Problem Statement Simplified version: "Given a destination and a date range, which is a better airport to fly out from - SFO or OAK?" We wanted to apply machine learning techniques to build a predictive model which can help flyer decide which airport to choose. Our model was built using data for all US domestic flights from 2001-08. Our models works for all airports, however we were particularly interested in SFO/OAK. There is a popular urban myth to fly from OAK to avoid delays. But we find that myth is not true always.

About the Data We will be working with airline data for individual years found at http://stat-computing.org/dataexpo/2009/the-data.html.

Techniques Naive Bayes Logistic Regression

Python Libraries Pandas, Scikit, Matplotlib, Seaborn

About

Analyzing Airline data to predict delays

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 2

  •  
  •  

Languages