Major project :

(This can done in groups of 2 - 3 people)

Problem Statements -

Datasets

Twitter Dataset: https://www.kaggle.com/datasets/kazanova/sentiment140
Disease Burden Dataset: https://www.kaggle.com/datasets/shivkumarganesh/disease-burden-by-cause
Gender Development Dataset: https://www.kaggle.com/datasets/elmartini/gender-development-index-2019
Campus Placement Dataset: https://www.kaggle.com/datasets/benroshan/factors-affecting-campus-placement
Employee Attrition Dataset: https://www.kaggle.com/datasets/patelprashant/employee-attrition

You can go through all the problem statements above and pick one to solve. After that follow these steps -

Create a jupyter notebook or a colab notebook.
Show all the calculations required and print them.
The final calculation for the answer must be in a separate cell.
Create a text cell to give explanations wherever you think is necessary.
The visualization questions must be accompanied with an explanation as well as conclusions.
Write all the answers in point form. Answers must be to the point.

Submission procedure:

Follow this video for the submission.
https://drive.google.com/file/d/1Brs-mx4Q9jlNVrevFZ2pVUNX5G235Tjc/view?usp=sharing

Pull request naming format:

Name_Dataset name

For example: Manvi Kaur_Loan Default Prediction

For any doubts or queries, put them on our discord server (doubts will be entertained on the ask-oc channel only) and we'll get back to you. ALL THE BEST!!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Major project :

(This can done in groups of 2 - 3 people)

Problem Statements -

Datasets

Submission procedure:

Pull request naming format:

Name_Dataset name

SUBMIT BY 7th JULY 12PM SHARP

Files

README.md

Latest commit

History

README.md

File metadata and controls

Major project :

(This can done in groups of 2 - 3 people)

Problem Statements -

Datasets

Submission procedure:

Pull request naming format:

Name_Dataset name

SUBMIT BY 7th JULY 12PM SHARP