This repository contains Jupyter notebooks for an interactive tutorial on hash-based probabilistic data structures. Here's how to run it:
Use binder. (We don't recommend this if you'll be running the tutorial over conference wifi, but it requires almost no setup and can run from a computer that only has a browser.)
- Make sure you have Python 3.7 installed, installing it if necessary
- If you have a favorite package manager, use that
- if not, python.org has binaries for many platforms
- Make sure you have
git
installed, installing it if necessary- If you have a favorite package manager, use that
- if not, git-scm.com has binaries for many platforms (you won't need a GUI)
- Install pipenv
- on a Mac, the easiest way is probably
brew install pipenv
- on a Fedora Linux machine, the easiest way is probably
dnf install pipenv
- on Windows, if you have Python installed already, the easiest way is probably to use
pip
- on a Mac, the easiest way is probably
- Clone this repository:
git clone https://github.com/willb/data-sketching-and-other-magic-tricks/
- tip: if you don't have
git
installed, you can also download an archive of this repository
- tip: if you don't have
- Change to this repository's directory:
cd data-sketching-and-other-magic tricks
- Install the dependencies:
pipenv install
- Run the notebooks:
pipenv run jupyter notebook