This section contains the steps that you need to follow in order to make sure that your setup is up to date
First and foremost, in order to work in good conditions, make sure that :
- you are working with a high speed internet connection
- your computer has enough memory (8GB), so that you can run your code efficiently
- your computer has enough disk space (30GB), so that you can work on big datasets
Verify that git works
git --version
👉 You should expect an output similar to this one yielding the git version :
git version 2.33.0
Verify that you have access to the Le Wagon public GitHub repositories
cd ~/code/<YOUR_GITHUB_NICKNAME>/
git clone [email protected]:lewagon/data-setup data-setup
👉 The repo should clone correctly :
Cloning into 'data-setup'...
remote: Enumerating objects: 21, done.
remote: Counting objects: 100% (21/21), done.
remote: Compressing objects: 100% (14/14), done.
Receiving objects: 100% (21/21), done.
Resolving deltas: 100% (6/6), done.
remote: Total 21 (delta 6), reused 16 (delta 1), pack-reused 0
👉 You can delete the cloned repo
rm -Rf data-setup
Verify that you have a ~/.zprofile
:
cat ~/.zprofile
👉 You should see the following lines :
# Setup the PATH for pyenv binaries and shims
export PYENV_ROOT="$HOME/.pyenv"
export PATH="$PYENV_ROOT/bin:$PATH"
type -a pyenv > /dev/null && eval "$(pyenv init --path)"
If the command does not output anything, create the ~/.zprofile
file :
cd
touch .zprofile
Add the following lines to your ~/.zprofile
:
# Setup the PATH for pyenv binaries and shims
export PYENV_ROOT="$HOME/.pyenv"
export PATH="$PYENV_ROOT/bin:$PATH"
type -a pyenv > /dev/null && eval "$(pyenv init --path)"
Update pyenv :
brew update && brew upgrade pyenv
Install the current python version :
pyenv install 3.10.6
👉 Make sure that the command completes correctly and restart your terminal
Let's remove the existing current virtual environment :
pyenv virtualenv-delete lewagon_current
Create a new virtual environment :
pyenv virtualenv 3.10.6 lewagon_current
Set the new virtual environment as default :
pyenv global lewagon_current
You should now be able to see the new virtual environment as active :
pyenv versions
👉 Here is a sample output :
system
3.10.6
3.10.6/envs/lewagon_current
3.7.6
3.7.6/envs/lewagon
* lewagon_current
lewagon
pip install -U pip
If your computer uses Apple Silicon, expand the paragraph below and go through it. Otherwise ignore it.
👉 Setup for Apple Silicon 👈
pip install -r https://raw.githubusercontent.com/lewagon/data-setup/master/specs/releases/apple_silicon.txt
If your computer uses Apple Intel, expand the paragraph below and go through it. Otherwise ignore it.
👉 Setup for Apple Intel 👈
pip install -r https://raw.githubusercontent.com/lewagon/data-setup/master/specs/releases/apple_intel.txt
Make sure that the gcloud
command is linked to the email address of your Google Cloud Platform account :
gcloud auth list
👉 This lists the email address of your GCP account :
Credentialed Accounts
ACTIVE ACCOUNT
* [email protected]
To set the active account, run:
$ gcloud config set account `ACCOUNT`
Verify the name of your gcp project :
gcloud config list
👉 This lists both the email address of your GCP account and your GCP project :
[core]
account = [email protected]
disable_usage_reporting = True
project = your-gcp-project-id
Your active configuration is: [default]
Verify the email created for the service account allowing your code to identify to GCP :
gcloud iam service-accounts list
👉 This lists the GCP email address of the service account that allows your code to identify to GCP
DISPLAY NAME EMAIL DISABLED
your-gcp-project-id your-service-account@your-service-account.iam.gserviceaccount.com False
Go to GCP IAM & Admin / Service Accounts:
- Select your project
- Click on the service account email
- Go to
PERMISSIONS
- Make sure that the service account email has a
Role
set toOwner
Verify that you have configured your machine to allow your code to identify to GCP. The service account credentials json key file should be linked to the correct service account email :
cat $GOOGLE_APPLICATION_CREDENTIALS
👉 This lists the content of your service account credentials json key :
{
"type": "service_account",
"project_id": "your-gcp-project-id",
"private_key_id": "a2d4a2d4a2d4a2d4a2d4a2d4a2d4a2d4a2d4a2d4",
"private_key": "-----BEGIN PRIVATE KEY-----\nMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInMIInM=\n-----END PRIVATE KEY-----\n",
"client_email": "your-service-account@your-service-account.iam.gserviceaccount.com",
"client_id": "105410541054105410541",
"auth_uri": "https://accounts.google.com/o/oauth2/auth",
"token_uri": "https://oauth2.googleapis.com/token",
"auth_provider_x509_cert_url": "https://www.googleapis.com/oauth2/v1/certs",
"client_x509_cert_url": "https://www.googleapis.com/robot/v1/metadata/x509/your-service-account%40your-service-account.iam.gserviceaccount.com"
}
Make sure that the file contains the correct :
- project id : your-gcp-project-id
- service account email address : your-service-account@your-service-account.iam.gserviceaccount.com
👉 If this does not display anything or if the email inside of the file is not the one of your service account, go back to the setup
Make sure that Docker recognizes the GCP ressources :
gcloud auth configure-docker
👉 This lists the image name prefixes recognized by Docker as targetted to GCP
{
"credHelpers": {
"us.gcr.io": "gcloud",
"eu.gcr.io": "gcloud",
"asia.gcr.io": "gcloud",
"staging-k8s.gcr.io": "gcloud",
"marketplace.gcr.io": "gcloud",
"gcr.io": "gcloud"
}
}
Start the Docker app
Verify that Docker can run the hello-world image :
docker run hello-world
👉 Make sure that this command completes correctly
Stop the Docker app
Check your Python version with the following commands:
zsh -c "$(curl -fsSL https://raw.githubusercontent.com/lewagon/data-setup/master/checks/python_checker.sh)" 3.10.6
Run the following command to check if you successfully installed the required packages:
zsh -c "$(curl -fsSL https://raw.githubusercontent.com/lewagon/data-setup/master/checks/pip_check.sh)"
Now run the following command to check if you can load these packages:
python -c "$(curl -fsSL https://raw.githubusercontent.com/lewagon/data-setup/master/checks/pip_check.py)"
Make sure you can run Jupyter:
jupyter notebook
And open a Python 3
notebook.
Make sure that you are running the correct python version in the notebook. Open a cell and run :
import sys; sys.version
Here you have it! A complete python virtual env with all the third-party packages you'll need for the whole bootcamp.