Predicting H1-B visa petitions

H-1B visas are taking long time to process which could be 1-2 years. According to paysa.com research, employees are staying at the big tech companies less than 2 years. After all, does it really make sense for big tech companies to go through visa process for an employee that may or may not be a fit for a position? To solve this problem, given the occupation name, job title, position type and salary, a model could give us the chance of acceptance before even file a petition. That would save so much time and money for big companies.

Dataset

Dataset for this project is from Kaggle. The raw data can be obtained at The Office of Foreign Labor Certification (OFLC) website as well. The dataset contains five year's worth of H-1B petition data(2011 - 2016) which is approximately 3 million records.

Initial notebook

nbviewer version of the notebook is here.

Project Design

This project will consist of six categories. Each category is explained below.

Data collection: Sometimes data is a collection of image, audio or text data. The dataset I will be using in this project is from Kaggle.

Data Wrangling: In this stage, dataset will be organized and cleaned to make it more usable for the next stages.

Data Exploring: The data exploration stage is where some visualizations will be provided to understand the data. Some statistical methods will also be provided to gather information from data.

Data Transforming: Most of the times, data has to be scaled using a scaler method in order to prepare it for the model.In order to visualize dataset, dimensionality reduction techniques will be used to reduced to number of features.

Data Modeling: As it is mentioned above, three machine learning algorithms will be used to train a model. These are Logistic Regression, Random Forest and Support Vector Machines.

Model Evaluation: This is where the model will be scored.

License

The contents of this repository are covered under the MIT License.

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
.gitignore		.gitignore
CAPSTONE PROJECT.pdf		CAPSTONE PROJECT.pdf
Capstone.ipynb		Capstone.ipynb
LICENSE		LICENSE
README.md		README.md
capstone_proposal.md		capstone_proposal.md
proposal.pdf		proposal.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Predicting H1-B visa petitions

Dataset

Initial notebook

Project Design

License

About

Uh oh!

Releases

Packages

Languages

License

numanyilmaz/Capstone

Folders and files

Latest commit

History

Repository files navigation

Predicting H1-B visa petitions

Dataset

Initial notebook

Project Design

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages