Text Worker

Run LLMs on The Grid. Earn USDC + AIPG for powering chat and text generation.

No bond required to run a worker today. You can connect a worker and start serving with just an API key. A planned on-chain bonding mechanism (the WorkerRegistry Diamond facet, ~1,000 AIPG, slashable for bad results) is in development and not yet required. See Run a Node for the bigger picture.

What You’re Running

The Grid Inference Worker connects your local LLM backend to The Grid:

You run a backend (Ollama, vLLM, etc.) with your models
The worker connects to The Grid
Users chat at aipg.chat or via API
Your hardware generates responses
You earn USDC + AIPG

No Python required for pre-built binaries. Just download, run, and follow the setup wizard.

Requirements

Component	Minimum	Recommended
GPU VRAM	4 GB	12 GB+
RAM	8 GB	16 GB+
Storage	20 GB	50 GB+ SSD

CPU-only is possible for small models but significantly slower.

VRAM by Model Size

Model	VRAM	Examples
3B	4 GB	Llama 3.2 3B, Phi-3 Mini
7-8B	8 GB	Llama 3 8B, Mistral 7B
13B	16 GB	Llama 2 13B
70B	48 GB+	Llama 2 70B

Quick Start

1. Install Ollama

curl -fsSL https://ollama.com/install.sh | sh
ollama pull llama3.2:3b

2. Get an API Key

3. Download the Worker

Grab the binary for your platform from Releases:

Platform	File
Windows	`grid-inference-worker-windows-x64.exe`
macOS	`grid-inference-worker-macos-arm64.zip`
Linux	`grid-inference-worker-linux-x64`

Windows — Double-click the exe.

macOS — Unzip, open the app.

Linux — chmod +x grid-inference-worker-linux-x64 && ./grid-inference-worker-linux-x64

4. Configure via Web Wizard

The worker opens a setup wizard at http://localhost:7861:

Enter your API key
Select your backend (Ollama auto-detected)
Choose a model
Click Start

That’s it. Jobs come in automatically.

Alternative: Run from Source

git clone https://github.com/AIPowerGrid/grid-inference-worker
cd grid-inference-worker
pip install -e .
grid-inference-worker

Alternative: Docker

git clone https://github.com/AIPowerGrid/grid-inference-worker
cd grid-inference-worker
cp .env.example .env
# Edit .env with your API key
docker compose up -d

Configuration

Via Web Wizard (Recommended)

Everything configurable at http://localhost:7861.

Via CLI

grid-inference-worker \
  --api-key YOUR_KEY \
  --model llama3.2:3b \
  --backend-url http://localhost:11434

Via Environment

# .env
GRID_API_KEY=your-key
MODEL_NAME=llama3.2:3b
BACKEND_TYPE=ollama
OLLAMA_URL=http://127.0.0.1:11434
GRID_WORKER_NAME=my-worker

Install as Service

Auto-start on boot:

grid-inference-worker --install-service

Supported Backends

Backend	Type	Notes
Ollama	`ollama`	Easiest setup
LM Studio	`ollama`	GUI-based
vLLM	`openai`	High performance
SGLang	`openai`	Fast inference
KoboldCpp	`openai`	CPU optimized

Anything with an OpenAI-compatible API works. Set BACKEND_TYPE=openai and point OPENAI_URL at it.

Earning

How It Works

User sends a message at aipg.chat or via API
The Grid routes it to your worker
Your model generates the response
You earn USDC + AIPG

What Earns Most

Popular models — gpt-oss-120b, qwen3-27b get the most traffic
Fast responses — quicker = more jobs/hour
24/7 uptime — consistent availability
Streaming — enable for real-time token delivery

Troubleshooting

Worker won’t start?

Check Python 3.9+ if running from source
Verify Ollama is running: ollama list

No jobs?

Normal during low traffic
Check worker shows online in dashboard
Make sure you’re running a popular model

Out of memory?

Use smaller model (3B vs 7B)
Try quantized versions (q4_0)
Close other GPU apps

Links

Resource	URL
Repository	github.com/AIPowerGrid/grid-inference-worker
Releases	github.com/AIPowerGrid/grid-inference-worker/releases
Get API Key	console.aipowergrid.io
Ollama	ollama.com
Discord	discord.gg/W9D8j6HCtC

Overview Image Worker