Skip to content
Open
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
91 changes: 86 additions & 5 deletions README.md
Original file line number Diff line number Diff line change
@@ -1,8 +1,89 @@
# Data-Analysis
Data Science Projects Using Python and a little R.
# 📊 Data Analysis Projects

This repository is a place to share my code and notebooks for numerous data science projects.
Welcome! This repository contains a collection of **data science projects** built using **Python** (and a little **R**). These projects cover a range of topics from statistical modeling and visualization to machine learning and real-world datasets.

Many of these projects are written up on my [Towards Data Science Medium page](https://medium.com/@williamkoehrsen).
Whether you're just starting out in data science or looking to explore practical examples, this repo has something for you!

Reach me on Twitter [@koehrsen_will](https://twitter.com/@koehrsen_will)
---

## 🧠 What's Inside?

Here are some of the types of projects and topics you'll find:

* **Machine Learning**
Logistic regression, random forests, prediction intervals, and more.

* **Statistics & Probability**
Bayesian inference, Poisson distributions, ECDFs, and statistical significance.

* **Visualization**
Projects using Plotly, Datashader, and pair plots for insightful visual storytelling.

* **Time Series & Features**
Working with cyclical features, time features, and stock market data.

* **Data Engineering & Automation**
Web automation scripts, Slack interactions, and organizing large datasets.

* **Geographic & Traffic Data**
Analysis of NYC traffic data and geospatial visualization.

* **Miscellaneous Explorations**
Economics, learning habits, weight loss tracking, and more.

Each folder in the repo contains code, Jupyter notebooks, and notes that explain the thinking behind the project.

---

## 📚 Many Projects are Also Articles

I’ve written up many of these projects as articles on [Towards Data Science](https://towardsdatascience.com/) — where I explain the concepts in more detail for a broader audience.

---

## 📂 Project History

These projects span several years — from early experiments 7–8 years ago to more recent work. Some folders include notes like:

* "Working on plotting"
* "Finished prediction intervals notebook"
* "Formatted with Black"
* "Added table of contents"

This variety reflects an evolving learning journey through data science.

---

## 🚀 Getting Started

To run these notebooks:

1. Clone the repository

```bash
git clone https://github.com/YOUR_USERNAME/data-analysis.git
```

2. Install dependencies

```bash
pip install -r requirements.txt
```

3. Launch Jupyter Notebook or JupyterLab

```bash
jupyter notebook
```

---

## 📬 Get in Touch

If you have questions or want to connect, feel free to reach out on [Twitter @koehrsen\_will](https://twitter.com/koehrsen_will).

---

## 📄 License

This project is licensed under the MIT License. See the [LICENSE](./LICENSE) file for details.