Fraudulent job postings

Please note that comments are available in the notebook itself:
EE0005_Mini_Project_Fraudulent_Job_Postings_(FINAL).ipynb

Due to the large amount of modules imported, we have made a duplicate Google Colab notebook here for ease of running this notebook.

This project was submitted as part of the requirements of EE0005 Introduction to Data Science and Artificial Intelligence.

Name (Alphabetical Order)	Contributions
Goh Lee Hua	Text pre-processing, Random forest classifier and GridSearchCV, Markdown comments
Hansel Tay	Lemmatization, Metrics, Oversampling and undersampling techniques
Philip Lee Hann Yung (Team Leader)	Feature extraction, TF-IDF vectorization, Modelling and Hyperparameter tuning, Organization of project pipeline, Markdown comments
Tan Keng Soon	Visualisation, Exploratory data analysis (EDA)

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
Screenshots used (if not loaded properly)		Screenshots used (if not loaded properly)
EE0005 Mini Project - Fraudulent Job Postings (Slides).pdf		EE0005 Mini Project - Fraudulent Job Postings (Slides).pdf
EE0005 Mini Project - Fraudulent Job Postings.pptx		EE0005 Mini Project - Fraudulent Job Postings.pptx
EE0005_Mini_Project_Fraudulent_Job_Postings_(FINAL).html		EE0005_Mini_Project_Fraudulent_Job_Postings_(FINAL).html
EE0005_Mini_Project_Fraudulent_Job_Postings_(FINAL).ipynb		EE0005_Mini_Project_Fraudulent_Job_Postings_(FINAL).ipynb
LICENSE		LICENSE
README.md		README.md
fake_job_postings.csv		fake_job_postings.csv
fake_job_postings_lemmatized.csv		fake_job_postings_lemmatized.csv
readme.txt		readme.txt

Provide feedback