TeslaTweets is a project where we try to predict whether Tesla's stock price will rise or fall based on Elon Musk's twitter data by using Machine Learning techniques and Natural Language Processing.
- The tweets dataset contains all of Elon Musk's tweets from November 16, 2012 to September 29, 2017. This dataset was taken from Kaggle.
- The stock price was extracted using pandas_datareader. Refer to the code for more information. This contains Date, Open, Close, Low, High prices and the Volume for Tesla's stock from November 16, 2012 to September 29, 2017.
To build and run the project on your device, do the following:
- Install Anaconda by following the instructions on their website and make sure you have Jupyter Notebooks. Alternatively you can also use Google Colab.
- Clone this repo to your local machine using
https://github.com/nowmozillaclub/TeslaTweets.git
. - Start Jupyter Notebook in the project directory and you are free to explore the codebase. For Google Colab, you will have to upload the file on your Google Drive and then you can start exploring.
Please read the CONTRIBUTING.md file for more details on how to contribute.
This project is developed by a bunch of engineering students at NMIMS MPSTME, Mumbai. Take a look at the entire list of contributors to see who all have helped with the project. The image on top is a photo by Team Freepik on Freepik.
This project is licensed under the MIT License - see the LICENSE file for details.
Here, XGBoostClassifier is the best model as it has an accuracy of ~61%. To read more more about how we got here and the steps we followed in this project, read FINAL_REPORT.md.