Quickstart
Your first request in under five minutes. Every ingress format works with every model — pick whichever your existing code already speaks.
Get an API key
Sign in at platform.hyperinfer.ai , open API keys,
and create a key. The key (prefix hi-…) is shown once at creation — store it as an
environment variable:
export HYPERINFER_API_KEY="hi-..."Pick a model
Model slugs are identical to OpenRouter’s. List everything (no auth required):
curl https://api.hyperinfer.ai/api/v1/modelsMake your first call
Chat Completions
The OpenAI Chat Completions format at POST /api/v1/chat/completions.
curl
curl https://api.hyperinfer.ai/api/v1/chat/completions \
-H "Authorization: Bearer $HYPERINFER_API_KEY" \
-H "Content-Type: application/json" \
-d '{
"model": "anthropic/claude-sonnet-4.5",
"messages": [
{ "role": "user", "content": "Say hello in five words." }
]
}'Stream it
Add "stream": true to any request to get server-sent events in the ingress format’s
native streaming protocol — see Streaming.
Notice the cross-format examples above: a Chat Completions request calling an Anthropic-slugged model and a Messages request calling an OpenAI-slugged one. That is the point — see Models & Routing.
Next steps
- Authentication — key management, spend limits, model pinning
- API Reference — every endpoint with a live playground
- Pricing — per-model rates and the credits system