Skip to content

Latest commit

 

History

History
40 lines (29 loc) · 1.97 KB

File metadata and controls

40 lines (29 loc) · 1.97 KB

Major project :

(This can done in groups of 2 - 3 people)

Problem Statements -

1

2

3

4

Datasets

Twitter Dataset: https://www.kaggle.com/datasets/kazanova/sentiment140
Disease Burden Dataset: https://www.kaggle.com/datasets/shivkumarganesh/disease-burden-by-cause
Gender Development Dataset: https://www.kaggle.com/datasets/elmartini/gender-development-index-2019
Campus Placement Dataset: https://www.kaggle.com/datasets/benroshan/factors-affecting-campus-placement
Employee Attrition Dataset: https://www.kaggle.com/datasets/patelprashant/employee-attrition

You can go through all the problem statements above and pick one to solve. After that follow these steps -

  1. Create a jupyter notebook or a colab notebook.
  2. Show all the calculations required and print them.
  3. The final calculation for the answer must be in a separate cell.
  4. Create a text cell to give explanations wherever you think is necessary.
  5. The visualization questions must be accompanied with an explanation as well as conclusions.
  6. Write all the answers in point form. Answers must be to the point.

Submission procedure:

Follow this video for the submission.
https://drive.google.com/file/d/1Brs-mx4Q9jlNVrevFZ2pVUNX5G235Tjc/view?usp=sharing

Pull request naming format:

Name_Dataset name

For example: Manvi Kaur_Loan Default Prediction

For any doubts or queries, put them on our discord server (doubts will be entertained on the ask-oc channel only) and we'll get back to you. ALL THE BEST!!

SUBMIT BY 7th JULY 12PM SHARP