neuropoly · NadiaBlostein · Dec 8, 2022 · kousu · Dec 8, 2022 · kousu
diff --git a/workflow_overview.md b/workflow_overview.md
@@ -0,0 +1,57 @@
+
+# 📄 General Overview of Project Workflow
-# 📄 General Overview of Project Workflow
+# <span>📄</span> General Overview of Project Workflow
-# 📄 General Overview of Project Workflow
+# <span>📄</span> General Overview of Project Workflow
+
+Once your [onboarding](https://intranet.neuro.polymtl.ca/onboarding/README.html) is complete, you will be ready to tackle your project!
-Once your [onboarding](https://intranet.neuro.polymtl.ca/onboarding/README.html) is complete, you will be ready to tackle your project!
+Once your [onboarding](onboarding) is complete, you will be ready to tackle your project!
-Once your [onboarding](https://intranet.neuro.polymtl.ca/onboarding/README.html) is complete, you will be ready to tackle your project!
+Once your [onboarding](onboarding/README.md) is complete, you will be ready to tackle your project!
 To build the docs: 
     pip install .[sphinx] 
     make html 
 They will end up in _build/html/ 
-Once your [onboarding](https://intranet.neuro.polymtl.ca/onboarding/README.html) is complete, you will be ready to tackle your project!
+Once your [onboarding](onboarding) is complete, you will be ready to tackle your project!
-Once your [onboarding](https://intranet.neuro.polymtl.ca/onboarding/README.html) is complete, you will be ready to tackle your project!
+Once your [onboarding](onboarding/README.md) is complete, you will be ready to tackle your project!
 To build the docs: 
  
     pip install .[sphinx] 
     make html 
  
 They will end up in _build/html/ 
+
+## 🖥️ Setting up 🖥️
+
+**Step 1.**
+* Make sure that your VPN connection is established or that you are connected to the Polytechnique wifi.
+
+**Step 2.**
+* Log in to one of the available [Neuropoly compute nodes](https://intranet.neuro.polymtl.ca/computing-resources/neuropoly/README.html):
+```
+ssh <POLYGRAMES_USERNAME>@<STATION>.neuro.polymtl.ca
+```
 ### SSH (command line) 
 Once the VPN connection established, connect via ssh using the `STATION` you want: 
 ```bash 
  ssh <POLYGRAMES_USERNAME>@<STATION>.neuro.polymtl.ca 
 ``` 
 ### SSH (command line) 
  
 Once the VPN connection established, connect via ssh using the `STATION` you want: 
  
 ```bash 
  ssh <POLYGRAMES_USERNAME>@<STATION>.neuro.polymtl.ca 
 ``` 
+
+**Step 3.**
+* Create your project working directory:
+```
+cd data_nvme_<POLYGRAMES_USERNAME>
 | **Hostname** | `joplin.neuro.polymtl.ca` | 
 For fast I/O, use the NVMe hard drive, which is automatically available: `~/data_nvme_$USER` 
 | **Hostname** | `joplin.neuro.polymtl.ca` | 
  
 For fast I/O, use the NVMe hard drive, which is automatically available: `~/data_nvme_$USER` 
+mkdir <PROJECT_NAME>
+cd <PROJECT_NAME>
-mkdir <PROJECT_NAME>
-cd <PROJECT_NAME>
+mkdir <PROJECT_NAME>
+cd <PROJECT_NAME>
+git init
-mkdir <PROJECT_NAME>
-cd <PROJECT_NAME>
+mkdir <PROJECT_NAME>
+cd <PROJECT_NAME>
+git init
+```
+
+**Step 4. Developing version-controlled software**
+* Ideally, you are working on code in Github repository (either a branch of an existing repo, or a new one that you created).
+* After adding your NeuroPoly workstation [SSH key to your Github account](https://docs.github.com/en/authentication/connecting-to-github-with-ssh/adding-a-new-ssh-key-to-your-github-account?platform=linux), you are ready to make a local fork of that remote repository:
+```
+cd data_nvme_<POLYGRAMES_USERNAME>/<PROJECT_NAME>
+git clone -b "<YOUR_WORKING_BRANCH>" [email protected]:<REPOSITORY>.git
 # git & Github 
 # git & Github 
+```
+
+**Step 5. The data**
+* It is critical to make sure that you know what data you are working with. 
+* Ideally, it should be in [BIDS](https://bids-specification.readthedocs.io/en/stable/) format on the [`data.neuro`](https://intranet.neuro.polymtl.ca/data/git-datasets.html) storage node: `data.neuro:datasets/<PROJECT_DATASET>`.
-* Ideally, it should be in [BIDS](https://bids-specification.readthedocs.io/en/stable/) format on the [`data.neuro`](https://intranet.neuro.polymtl.ca/data/git-datasets.html) storage node: `data.neuro:datasets/<PROJECT_DATASET>`.
+Ideally, it should be in [BIDS](https://bids-specification.readthedocs.io) format. We have many of these on the private [`data`](data/git-datasets.md) server.
-* Ideally, it should be in [BIDS](https://bids-specification.readthedocs.io/en/stable/) format on the [`data.neuro`](https://intranet.neuro.polymtl.ca/data/git-datasets.html) storage node: `data.neuro:datasets/<PROJECT_DATASET>`.
+Ideally, it should be in [BIDS](https://bids-specification.readthedocs.io) format. We have many of these on the private [`data`](data/git-datasets.md) server.
+* Thanks to `git annex`, the following command will copy the directory structure and some small files of your dataset on `data.neuro`:
 # git & Github 
 # git-annex 
 # git & Github 
 # git-annex 
+```
+cd data_nvme_<POLYGRAMES_USERNAME>/<PROJECT_NAME>
+git clone [email protected]:datasets/<PROJECT_DATASET> 
-git clone [email protected]:datasets/<PROJECT_DATASET> 
+git submodule add [email protected]:datasets/<PROJECT_DATASET> 
-git clone [email protected]:datasets/<PROJECT_DATASET> 
+git submodule add -b v1.0.3 [email protected]:datasets/<PROJECT_DATASET> 
-git clone [email protected]:datasets/<PROJECT_DATASET> 
+git submodule add [email protected]:datasets/<PROJECT_DATASET> 
-git clone [email protected]:datasets/<PROJECT_DATASET> 
+git submodule add -b v1.0.3 [email protected]:datasets/<PROJECT_DATASET> 
+```
+
+## 🌊 Workflow 🌊
+
+### ⌨️ Code
+Any changes you make to the code should be added in small commits and pushed to your github branch.
 # git & Github 
 # git & Github 
+
+### 💿 Data
+* If you need to access your data files directly, you can use `git annex` to download the larger files to the [Neuropoly computer](https://intranet.neuro.polymtl.ca/computing-resources/neuropoly/README.html) you are working from:
+```
+cd data_nvme_<POLYGRAMES_USERNAME>/<PROJECT_NAME>/<PROJECT_DATASET> 
+git annex get .
+```
+* However, in order to save space, make sure to "undownload" those big files once you are done working with them with:
+```
+git annex drop .
+```
+* Any data derivatives that you output should be added to `data.neuro:datasets/<PROJECT_DATASET>` according to the [BIDS](https://bids-specification.readthedocs.io/en/stable/) data standard! More documentation on how to version control your data on `data.neuro` can be found [here](https://intranet.neuro.polymtl.ca/data/git-datasets.html#update).