Skip to content

Latest commit

Β 

History

History
89 lines (53 loc) Β· 2.46 KB

File metadata and controls

89 lines (53 loc) Β· 2.46 KB

πŸ“Š Data Analysis Projects

Welcome! This repository contains a collection of data science projects built using Python (and a little R). These projects cover a range of topics from statistical modeling and visualization to machine learning and real-world datasets.

Whether you're just starting out in data science or looking to explore practical examples, this repo has something for you!


🧠 What's Inside?

Here are some of the types of projects and topics you'll find:

  • Machine Learning Logistic regression, random forests, prediction intervals, and more.

  • Statistics & Probability Bayesian inference, Poisson distributions, ECDFs, and statistical significance.

  • Visualization Projects using Plotly, Datashader, and pair plots for insightful visual storytelling.

  • Time Series & Features Working with cyclical features, time features, and stock market data.

  • Data Engineering & Automation Web automation scripts, Slack interactions, and organizing large datasets.

  • Geographic & Traffic Data Analysis of NYC traffic data and geospatial visualization.

  • Miscellaneous Explorations Economics, learning habits, weight loss tracking, and more.

Each folder in the repo contains code, Jupyter notebooks, and notes that explain the thinking behind the project.


πŸ“š Many Projects are Also Articles

I’ve written up many of these projects as articles on Towards Data Science β€” where I explain the concepts in more detail for a broader audience.


πŸ“‚ Project History

These projects span several years β€” from early experiments 7–8 years ago to more recent work. Some folders include notes like:

  • "Working on plotting"
  • "Finished prediction intervals notebook"
  • "Formatted with Black"
  • "Added table of contents"

This variety reflects an evolving learning journey through data science.


πŸš€ Getting Started

To run these notebooks:

  1. Clone the repository

    git clone https://github.com/YOUR_USERNAME/data-analysis.git
  2. Install dependencies

    pip install -r requirements.txt
  3. Launch Jupyter Notebook or JupyterLab

    jupyter notebook

πŸ“¬ Get in Touch

If you have questions or want to connect, feel free to reach out on Twitter @koehrsen_will.


πŸ“„ License

This project is licensed under the MIT License. See the LICENSE file for details.