Local GPT-OSS 20B Server

Run OpenAI's GPT-OSS 20B model locally on your Mac with OpenAI SDK compatibility!

Quick Start

# 1. Start the server
./run_gpt_ollama.sh

# 2. Check server status
./status_gpt_ollama.sh

# 3. Test it works
cd test-client
npm install
npm run test:openai  # Test with OpenAI SDK
npm run test:direct  # Test with direct Ollama API

# 4. Stop the server when done
./stop_gpt_ollama.sh

What's GPT-OSS?

OpenAI's open-source 20B parameter model with:

Full chain-of-thought reasoning - You can see the model's thinking process!
OpenAI API compatibility - Use with any OpenAI SDK
Runs locally on your Mac - Private, offline AI

🧠 Unique Feature: Reasoning Tokens

GPT-OSS exposes its internal thinking process! You can watch it think in real-time:

cd test-client

# Three ways to test:
npm run test:simple  # Direct API call (no OpenAI SDK)
npm run test:openai  # OpenAI SDK (non-streaming)
npm run test:live    # OpenAI SDK with live thinking + streaming!

The model's reasoning tokens show HOW it thinks before answering - something usually hidden in other models!

Usage with OpenAI SDK

import OpenAI from 'openai';

const openai = new OpenAI({
  baseURL: 'http://localhost:11434/v1',  // Your local server
  apiKey: 'ollama',  // Required but ignored
});

// Use exactly like ChatGPT!
const response = await openai.chat.completions.create({
  model: 'gpt-oss:20b',
  messages: [{ role: 'user', content: 'Hello!' }]
});

iPhone Access

When the server is running, you can access it from your iPhone:

Check the LAN IP printed by the script
Use http://[YOUR_LAN_IP]:11434/v1 as the base URL

Why Ollama?

Works perfectly on macOS ARM64 (Apple Silicon)
Provides OpenAI API compatibility at /v1 endpoints
Easy one-command setup

Project Structure

/
├── run_gpt_ollama.sh      # Start the Ollama server
├── status_gpt_ollama.sh   # Check server status
├── stop_gpt_ollama.sh     # Stop the Ollama server
└── test-client/           # Test client examples
    ├── package.json
    ├── test-openai.js     # OpenAI SDK example
    └── test-direct.js     # Direct Ollama API example

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.claude		.claude
test-client		test-client
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
README.md		README.md
run_gpt_ollama.sh		run_gpt_ollama.sh
status_gpt_ollama.sh		status_gpt_ollama.sh
stop_gpt_ollama.sh		stop_gpt_ollama.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Local GPT-OSS 20B Server

Quick Start

What's GPT-OSS?

🧠 Unique Feature: Reasoning Tokens

Usage with OpenAI SDK

iPhone Access

Why Ollama?

Project Structure

About

Uh oh!

Releases

Packages

Languages

marckraw/run_local_llms

Folders and files

Latest commit

History

Repository files navigation

Local GPT-OSS 20B Server

Quick Start

What's GPT-OSS?

🧠 Unique Feature: Reasoning Tokens

Usage with OpenAI SDK

iPhone Access

Why Ollama?

Project Structure

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages