llama.zero

This is a fork of llama.cpp that can compile on Pi Zero or Pi 1 or on any arm1176jzf device. It does this by modifying CMake build files to not recognize armv6 as an architecture with neon support. The original repo forces the build to use unsupported instructions, making it run into inevitable failure.

Furthermore, this repo also includes instructions on how to make an AI assist USB device.

How to set up Llama.zero on Pi Zero as USB device

USB Module Setup

Reference can be found here

Add dwc2 device tree overlay

# This gets appended to /boot/config.txt
echo "dtoverlay=dwc2" | sudo tee -a /boot/config.txt

In /boot/cmdline.txt, insert modules-load=dwc2,g_multi after root_wait.

sudo sed -i 's/rootwait/rootwait modules-load=dwc2,g_multi/' /boot/cmdline.txt

Create image file for USB mass storage

sudo dd if=/dev/zero of=/llamazero.img bs=1M count=64
sudo mkfs.vfat /llamazero.img

Create mount directory

sudo mkdir /mnt/llamazero
sudo mount /llamazero.img /mnt/llamazero

Enable USB kernel module

sudo modprobe g_multi file=/llamazero.img cdrom=0 ro=0

Add mount command to rc.local, so system auto mount when pi starts.

sudo sed -i '/exit 0/i modprobe g_multi file=/llamazero.img cdrom=0 ro=0' /etc/rc.local

Useful commands

# Unmount fs
sudo umount /mnt/llamazero
# Mount fs
sudo mount /llamazero.img /mnt/llamazero
# Mount usb
sudo modprobe g_multi file=/llamazero.img cdrom=0 ro=0
# Unmount usb
sudo modprobe g_multi -r

Behavior

To check for new files created on computer, we:

sudo umount /mnt/llamazero
sudo mount /llamazero.img /mnt/llamazero
sudo ls /mnt/llamazero

To show for new edits created on pi, mount and unmount usb module:

# While the file system folder is mounted
sudo modprobe -r g_multi
sudo modprobe g_multi file=/llamazero.img cdrom=0 ro=0

Auto-updater script

#!/bin/bash

MOUNT_POINT=/mnt/llamazero
IMG_FILE=/llamazero.img

while true; do
    # Unmount and mount to detect new files from computer
    sudo umount $MOUNT_POINT
    sudo mount $IMG_FILE $MOUNT_POINT
    
    # Check for empty files and append "hello"
    for file in $(sudo find $MOUNT_POINT -type f -empty); do
	echo "Processing file: $(basename "$file")"
    	echo "$file" | sudo tee -a "$file" > /dev/null
	sleep 1
        # Unmount and remount USB gadget to show changes
	echo "Unmount usb"
        sudo modprobe -r g_multi
	sleep 1
	echo "Mount USB"
        sudo modprobe g_multi file=$IMG_FILE cdrom=0 ro=0
    done
    
    sleep 2  # Adjust sleep time as needed
done

Memory set-up for llama.cpp

Allocate memory for llama.cpp compilation

sudo fallocate -l 4G /swapfile
sudo chmod 600 /swapfile
sudo mkswap /swapfile
sudo swapon /swapfile

Compile llama.cpp

Install dependencies

sudo apt install -y git cmake ccache libpthread-stubs0-dev build-essential

Clone this repo

git clone https://github.com/pham-tuan-binh/llama.zero.git
cd llama.zero

Build This will take a very long time. My first successful compilation took 24 hours, no cap, but it was because I had some errors. Hopefully yours wont.

cmake -B build
cmake --build build --config Release

Add to path

echo 'export PATH=$PATH:~/llama.zero/build/bin/' >> ~/.bashrc

Test run You can test any model you want, as long as they are <512MB and is in gguf format.

llama-cli -m model.gguf -p "The meaning of life is" -n 16 2> /dev/null

Auto fill in USB file script

Make this script into a systemctl service so that it is run whenever you plug in your pi.

#!/bin/bash

MOUNT_POINT=/mnt/llamazero
IMG_FILE=/llamazero.img

while true; do
    # Unmount and mount to detect new files from computer
    sudo umount $MOUNT_POINT
    sudo mount $IMG_FILE $MOUNT_POINT

    # Check for empty files and append "hello"
    for file in $(sudo find $MOUNT_POINT -type f -empty); do
        para=$(basename "$file")
        echo "Processing file: $file"
        llama-cli -n 16 -p "$para" -m ~/tiny-15M-Q4KM.gguf 2> /dev/null | sudo tee $file

        sleep 1
        # Unmount and remount USB gadget to show changes
        echo "Unmount usb"
        sudo modprobe -r g_multi
        sleep 1
        echo "Mount USB"
        sudo modprobe g_multi file=$IMG_FILE cdrom=0 ro=0
    done

    sleep 2  # Adjust sleep time as needed
done

Name		Name	Last commit message	Last commit date
Latest commit History 4,700 Commits
.devops		.devops
.github		.github
Sources/llama		Sources/llama
ci		ci
cmake		cmake
common		common
docs		docs
examples		examples
ggml		ggml
gguf-py		gguf-py
grammars		grammars
include		include
media		media
models		models
pocs		pocs
prompts		prompts
requirements		requirements
scripts		scripts
spm-headers		spm-headers
src		src
tests		tests
.clang-format		.clang-format
.clang-tidy		.clang-tidy
.dockerignore		.dockerignore
.ecrc		.ecrc
.editorconfig		.editorconfig
.flake8		.flake8
.gitignore		.gitignore
.gitmodules		.gitmodules
.pre-commit-config.yaml		.pre-commit-config.yaml
AUTHORS		AUTHORS
CMakeLists.txt		CMakeLists.txt
CMakePresets.json		CMakePresets.json
CODEOWNERS		CODEOWNERS
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
Makefile		Makefile
Package.swift		Package.swift
README.md		README.md
SECURITY.md		SECURITY.md
convert_hf_to_gguf.py		convert_hf_to_gguf.py
convert_hf_to_gguf_update.py		convert_hf_to_gguf_update.py
convert_llama_ggml_to_gguf.py		convert_llama_ggml_to_gguf.py
convert_lora_to_gguf.py		convert_lora_to_gguf.py
flake.lock		flake.lock
flake.nix		flake.nix
mypy.ini		mypy.ini
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml
pyrightconfig.json		pyrightconfig.json
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

llama.zero

How to set up Llama.zero on Pi Zero as USB device

USB Module Setup

Useful commands

Behavior

Auto-updater script

Memory set-up for llama.cpp

Compile llama.cpp

Auto fill in USB file script

About

Releases

Packages

Languages

License

pham-tuan-binh/llama.zero

Folders and files

Latest commit

History

Repository files navigation

llama.zero

How to set up Llama.zero on Pi Zero as USB device

USB Module Setup

Useful commands

Behavior

Auto-updater script

Memory set-up for llama.cpp

Compile llama.cpp

Auto fill in USB file script

About

Resources

License

Security policy

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages