update to readme file

salhanyf · web-flow · commit 2bb9cd7d59d2 · 2025-09-08T14:02:14.000-04:00
diff --git a/src/llm-examples/ollama/README.md b/src/llm-examples/ollama/README.md
@@ -6,88 +6,49 @@ Ollama is an open-source software tool that simplifies running large language mo
 - [Ollama](https://ollama.com)
 - [Ollama GitHub](https://github.com/ollama/ollama)
 
-
 ## Prerequisites
-
 Before starting, ensure you have [access](https://nag-devops.github.io/speed-hpc/#requesting-access) to the HPC (Speed) cluster.
 
 ## Instructions
+* Clone Speed Github repository
+    ```shell
+    git clone --depth=1 https://github.com/NAG-DevOps/speed-hpc.git
+    ```
 
-This requires having 2 sessions open
-
-### Session A - get a GPU node & start the server
-
-* SSH to speed and start an interactive session with salloc
-```shell
-ssh <ENCSusername>@speed.encs.concordia.ca
-salloc --mem=50G --gpus=1
-```
+* Navigate to ollama directory in `src/llm-examples`
 
-* Create a working directory and navigate to it
-```shell
-mkdir /speed-scratch/$USER/ollama
-cd /speed-scratch/$USER/ollama
-```
+* Run `start_ollama.sh`
+    ```shell
+    sbatch start_ollama.sh
+    ```
 
-* Download Ollama tarball and extract it (creates the ollama binary here)
-```shell
-curl -LO https://ollama.com/download/ollama-linux-amd64.tgz
-tar -xzf ollama-linux-amd64.tgz
-```
+    The script will:
+    - Request required resources
+    - Download Ollama tarball and extract it
+    - Add Ollama to user's path and setup environment variables
 
-* Add ollama to your PATH for this session
-```shell
-setenv PATH /speed-scratch/$USER/ollama/bin:$PATH
-```
+    ```shell
+    setenv PATH /speed-scratch/$USER/ollama/bin:$PATH
+    ```
 
-* Set Ollama to store its model in `/speed-scratch` to aviod quota limits
-```shell
-setenv OLLAMA_MODELS /speed-scratch/$USER/ollama/models
-mkdir -p $OLLAMA_MODELS
-```
+    - Start Ollama server with `ollama serve`
+    - Print the ssh command to connect to the server.
 
-* Start ollama server
-```shell
-ollama serve
-```
-
-* Leave this session open
-
-### Session B - hop to the same node & run/test
-* open a new terminal window and ssh to speed then to the node you have the server running on
-```shell
-ssh <ENCSusername>@speed.encs.concordia.ca
-ssh speed-XX
-cd /speed-scratch/$USER/ollama
-```
-
-* Sanity check
-```shell
-setenv PATH /speed-scratch/$USER/ollama/bin:$PATH
-ollama -v
-```
-
-* Pull a specific model and run it (Optional)
-```shell
-ollama pull llama3.1
-echo "What is today" | ollama run llama3.1
-```
-
-* Create a Python environment to run the example
-```shell
-setenv ENV_DIR /speed-scratch/$USER/envs/python-env
-mkdir -p $ENV_DIR/{tmp,pkgs,cache}
+    Note: The server is set to run for 3 hours (adjust if needed)
 
-setenv TMP $ENV_DIR/tmp
-setenv TMPDIR $ENV_DIR/tmp
-setenv PIP_CACHE_DIR $ENV_DIR/cache
+* Open a new terminal window and paste the ssh command to connect to the speed node the server is running on. The command will look like:
+    ```shell
+    ssh -L XXXXX:localhost:XXXXX <ENCSusername>@speed.encs.concordia.ca -t ssh speed-XX
+    ```
 
-python3 -m venv $ENV_DIR
-source $ENV_DIR/bin/activate.csh
-pip install -U pip ollama
-```
+* Navigate to ollama directory and do a sanity check
+    ```shell
+    setenv PATH /speed-scratch/$USER/ollama/bin:$PATH
+    ollama -v
+    ```
 
-* Copy the python file and execute it
+* Pull a specific model and run it interactively (optional).
 ```shell
-python ollama_test.py
+ollama pull llama3.2
+echo "What is today" | ollama run llama3.2
 ```