This repository contains the .ipynb files and the report of our project for the course Pattern Recognition and Machine Learning, taught in the Fall Semester in Aristotle University of Thessaloniki.
Course Professor: Panagiotis Petrantonakis
This assignment is part of the Pattern Recognition & Machine Learning course and consists of four main Parts, each focusing on different Classification Algorithms.
This part involves the development of a Maximum Likelihood (ML) classifier to recognize stress in users of a video game, based on data derived from button pressure patterns. The goal is to evaluate the reliability of the variable x. This evaluation is based on data from 12 users, 7 of whom did not feel stressed and 5 felt stressed.
This part comes as an extension of part A. We implement a Bayesian Classifier and estimate the unknown parameter theta.
Using the Iris Dataset from the sklearn library we implement a Decision Tree Classifier and a Random Forest Classifier and we compare, among other characteristics of each classifier, the maximum depth of each one and decide which classifier is more optimal.
- Training Set: datasetTV.csv: 8743 samples with 224 features per sample, followed by a label (1,...,5) in the last column.
- Test Set: datasetTest.csv: 6955 samples without provided labels.
The goal of this Part is to try out a variety of classifiers, train them using the training set and validate them using the Test Set to determine which one works better with our data. In the end we produce a .npy file with the labels of the Test Set that the Classifier provided us with.
- Team11-AC.ipynb : The solutions of the first three parts
- Team11-D.ipynb : The solution of the last part
- PR_Assignment_2024.pdf : The Instructions of the Assignment
- Team11.pdf : The report in which we analyse our Implementations and present our conclusions
- labels11.npy : the labels of the Test Set that the Classifier provided us with in part D
- datasetTV.csv : Training Set used in part D for implementing the classifiers
- datasetTest.csv : Test Set used in part D for testing the classifiers' efiiciency
Panagiotis Mpelantakis