Repository for the Human Data Analytics Project 2020/2021.
/dataset (to download)
/data
- data.tar
- data.tar.gz
/demo
/models
/Base_NN
/ChromeR
/Deep_Chrome
/separated (contains all the models trained individually)
/Ultra_Chrome
/notebooks
- attentive-chrome.ipynb
- converter.ipynb
- models.ipynb
- parser.ipynb
- visualize.ipynb
/papers
/report
/assets
- latex-project.zip
- Report.pdf
/results
The full dataset used in this project can be found here.
The demo was develop as an Angular webapp and is hosted by Firebase Hosting.
Live demo here.
In the report folder you can find the final Report.pdf along with the latex project I developed on overleaf, an online latex environment.
All the code written can be found in the notebooks folder in jupyter notebook format.
In order to reproduce the results reported in the Report:
-
Download the dataset into the
datasetfolder -
Run
parsernotebook: loads raw data and organizes into numpy array for future processing -
Run
modelsnotebook: loads the parsed dataset, trains and saves the models, computes predictions and saves results -
Run
attentive-chromenotebook: download the pre-trained models of AttentiveChrome and computes the predictions for every cell type of the dataset -
Run
converternotebook: converts thetensorflowsaved models into javascript compatible files ready to be used inside the demo webapp -
Run
visualizenotebook: loads results and compares them with different plots
• DeepChrome- deep-learning for predicting gene expression from histone modifications