Text Worker
Run LLMs on The Grid. Earn AIPG for powering chat and text generation.
Worker bond: 1,000 AIPG minimum. Workers stake AIPG to register on-chain
via the WorkerRegistry Diamond facet. The stake is slashable if you serve
bad results — that’s what gives users confidence that your responses are real.
See Run a Node for the bigger picture.
What You’re Running
The Grid Inference Worker connects your local LLM backend to The Grid:
- You run a backend (Ollama, vLLM, etc.) with your models
- The worker connects to The Grid
- Users chat at aipg.chat or via API
- Your hardware generates responses
- You earn AIPG
No Python required for pre-built binaries. Just download, run, and follow the setup wizard.
Requirements
| Component | Minimum | Recommended |
|---|---|---|
| GPU VRAM | 4 GB | 12 GB+ |
| RAM | 8 GB | 16 GB+ |
| Storage | 20 GB | 50 GB+ SSD |
CPU-only is possible for small models but significantly slower.
VRAM by Model Size
| Model | VRAM | Examples |
|---|---|---|
| 3B | 4 GB | Llama 3.2 3B, Phi-3 Mini |
| 7-8B | 8 GB | Llama 3 8B, Mistral 7B |
| 13B | 16 GB | Llama 2 13B |
| 70B | 48 GB+ | Llama 2 70B |
Quick Start
1. Install Ollama
curl -fsSL https://ollama.com/install.sh | sh
ollama pull llama3.2:3b2. Get an API Key
Register at dashboard.aipowergrid.io
3. Download the Worker
Grab the binary for your platform from Releases:
| Platform | File |
|---|---|
| Windows | grid-inference-worker-windows-x64.exe |
| macOS | grid-inference-worker-macos-arm64.zip |
| Linux | grid-inference-worker-linux-x64 |
Windows — Double-click the exe.
macOS — Unzip, open the app.
Linux — chmod +x grid-inference-worker-linux-x64 && ./grid-inference-worker-linux-x64
4. Configure via Web Wizard
The worker opens a setup wizard at http://localhost:7861:
- Enter your API key
- Select your backend (Ollama auto-detected)
- Choose a model
- Click Start
That’s it. Jobs come in automatically.
Alternative: Run from Source
git clone https://github.com/AIPowerGrid/grid-inference-worker
cd grid-inference-worker
pip install -e .
grid-inference-workerAlternative: Docker
git clone https://github.com/AIPowerGrid/grid-inference-worker
cd grid-inference-worker
cp .env.example .env
# Edit .env with your API key
docker compose up -dConfiguration
Via Web Wizard (Recommended)
Everything configurable at http://localhost:7861.
Via CLI
grid-inference-worker \
--api-key YOUR_KEY \
--model llama3.2:3b \
--backend-url http://localhost:11434Via Environment
# .env
GRID_API_KEY=your-key
MODEL_NAME=llama3.2:3b
BACKEND_TYPE=ollama
OLLAMA_URL=http://127.0.0.1:11434
GRID_WORKER_NAME=my-workerInstall as Service
Auto-start on boot:
grid-inference-worker --install-serviceSupported Backends
| Backend | Type | Notes |
|---|---|---|
| Ollama | ollama | Easiest setup |
| LM Studio | ollama | GUI-based |
| vLLM | openai | High performance |
| SGLang | openai | Fast inference |
| KoboldCpp | openai | CPU optimized |
Anything with an OpenAI-compatible API works. Set BACKEND_TYPE=openai and point OPENAI_URL at it.
Earning
How It Works
- User sends a message at aipg.chat or via API
- The Grid routes it to your worker
- Your model generates the response
- You earn AIPG
What Earns Most
- Popular models — Llama 3, Mistral get the most traffic
- Fast responses — quicker = more jobs/hour
- 24/7 uptime — consistent availability
- Streaming — enable for real-time token delivery
Troubleshooting
Worker won’t start?
- Check Python 3.9+ if running from source
- Verify Ollama is running:
ollama list
No jobs?
- Normal during low traffic
- Check worker shows online in dashboard
- Make sure you’re running a popular model
Out of memory?
- Use smaller model (3B vs 7B)
- Try quantized versions (q4_0)
- Close other GPU apps
Links
| Resource | URL |
|---|---|
| Repository | github.com/AIPowerGrid/grid-inference-worker |
| Releases | github.com/AIPowerGrid/grid-inference-worker/releases |
| Get API Key | dashboard.aipowergrid.io |
| Ollama | ollama.com |
| Discord | discord.gg/W9D8j6HCtC |