Create your key
Sign up, generate a consumer key (ohk_…) and top up your wallet. No GPUs to provision, no infrastructure to run.
AI demand is growing faster than the world can build for it — yet powerful GPUs sit idle everywhere. OpenHosst Grid puts them to work, serving open models through one OpenAI-compatible API. More capacity for AI, from hardware that already exists — and a lighter footprint for the planet.
Drop-in OpenAI API • Automatic failover • Pay per token • No subscriptions
Llama 3.2, Qwen 2.5, Mistral, Phi and more are being served right now by providers around the world. You see live availability and the cheapest node — platform fee included — before a single token is spent.
Sign up, generate a consumer key (ohk_…) and top up your wallet. No GPUs to provision, no infrastructure to run.
Swap your OpenAI base URL for the grid endpoint. Every OpenAI-compatible client, SDK and framework works unchanged.
We route to the best available GPU and fail over automatically if one drops. You're billed only for completed requests — nothing else.
from openai import OpenAI
client = OpenAI(
base_url="https://grid.openhosst.com/v1",
api_key="ohk_your_key_here",
)
resp = client.chat.completions.create(
model="llama3.2",
messages=[{"role": "user", "content": "Hello, grid!"}],
stream=True,
)
for chunk in resp:
print(chunk.choices[0].delta.content or "", end="")
Run the open-source daemon beside your local Ollama and the grid feeds it paid jobs. Set your own prices, stay private behind an outbound-only connection, and cash out with Stripe.
# 1. Install Ollama + pull a model
ollama pull llama3.2
# 2. Run the OpenHosst provider daemon
export OPENHOSST_PROVIDER_KEY=ohp_your_key
npx @openhosst/provider