Employee Attrition Prediction using Gradient Boosting Models

Problem Statement

Employee attrition poses a significant challenge for organizations, impacting productivity, team morale, and overall operational costs. Understanding the factors leading to employee turnover and predicting attrition in advance can help HR departments take proactive measures to retain talent.

This project aims to build a machine learning pipeline that can predict whether an employee is likely to leave the organization, based on a variety of personal and professional attributes.

Project Objective (Target)

The goal is to predict the likelihood of employee attrition (Yes/No) using historical HR data. The task is framed as a binary classification problem where the target variable is Attrition.

Dataset

Source: IBM HR Analytics Employee Attrition & Performance dataset (Kaggle Link) Total Records: 1,470 Target Column: Attrition (Yes or No) Feature Types: Mix of categorical, ordinal, and numerical features (e.g., Age, MonthlyIncome, JobRole, YearsAtCompany, WorkLifeBalance, etc.)

Approach

Data Cleaning and Preprocessing: a. Handled missing values (none present in the dataset) b. Encoded categorical features using one-hot encoding and label encoding c. Scaled numerical features using standardization
Model Selection and Training: Evaluated and compared the performance of three advanced gradient boosting algorithms: a. XGBoost b. CatBoost c. LightGBM d. Hyperparameter Tuning: Used Hyperopt library for this.

Evaluation Metrics

To ensure robust performance analysis, the following metrics were used: Accuracy, Precision, Recall, F1-Score, ROC-AUC Score, Confusion Matrix

Best Model: XGBoost

Accuracy: 83%, F1-Score: 64%, ROC-AUC: 0.94 (test)

Outperformed CatBoost and LightGBM on both precision and recall, indicating better balance and generalization across classes.

Tech Stack

Python, Jupyter Notebook, Scikit-learn, XGBoost, LightGBM, CatBoost, Matplotlib & Seaborn for visualizations

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.gitignore		.gitignore
Employee_Attrition_Prediction.ipynb		Employee_Attrition_Prediction.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Employee Attrition Prediction using Gradient Boosting Models

Problem Statement

Project Objective (Target)

Dataset

Approach

Evaluation Metrics

Best Model: XGBoost

Tech Stack

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Employee Attrition Prediction using Gradient Boosting Models

Problem Statement

Project Objective (Target)

Dataset

Approach

Evaluation Metrics

Best Model: XGBoost

Tech Stack

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages