Skip to content

leephilipx/Fraudulent-Job-Postings

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Fraudulent job postings

Please note that comments are available in the notebook itself:
EE0005_Mini_Project_Fraudulent_Job_Postings_(FINAL).ipynb

Due to the large amount of modules imported, we have made a duplicate Google Colab notebook here for ease of running this notebook.

Our objectives and the questions we intend to answer:

  • Identify fake job postings by sentiment analysis.
  • Which model is the best, in terms of metric scores, memory used and speed?
  • Identify countries which are associated with fraud postings.
  • Find associated features with a fraudulent and non-fraudulent posting.

This project was submitted as part of the requirements of EE0005 Introduction to Data Science and Artificial Intelligence.


The list of contributors for this project is tabled below.

Name (Alphabetical Order) Contributions
Goh Lee Hua Text pre-processing, Random forest classifier and GridSearchCV, Markdown comments
Hansel Tay Lemmatization, Metrics, Oversampling and undersampling techniques
Philip Lee Hann Yung (Team Leader) Feature extraction, TF-IDF vectorization, Modelling and Hyperparameter tuning, Organization of project pipeline, Markdown comments
Tan Keng Soon Visualisation, Exploratory data analysis (EDA)

About

Sentiment analysis is used to predict fraudulent job postings.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors