Skip to Content
Quickstart

Quickstart

Your first request in under five minutes. Every ingress format works with every model — pick whichever your existing code already speaks.

Get an API key

Sign in at platform.hyperinfer.ai , open API keys, and create a key. The key (prefix hi-…) is shown once at creation — store it as an environment variable:

export HYPERINFER_API_KEY="hi-..."

Pick a model

Model slugs are identical to OpenRouter’s. List everything (no auth required):

curl https://api.hyperinfer.ai/api/v1/models

Make your first call

The OpenAI Chat Completions format at POST /api/v1/chat/completions.

curl https://api.hyperinfer.ai/api/v1/chat/completions \ -H "Authorization: Bearer $HYPERINFER_API_KEY" \ -H "Content-Type: application/json" \ -d '{ "model": "anthropic/claude-sonnet-4.5", "messages": [ { "role": "user", "content": "Say hello in five words." } ] }'

Stream it

Add "stream": true to any request to get server-sent events in the ingress format’s native streaming protocol — see Streaming.

Notice the cross-format examples above: a Chat Completions request calling an Anthropic-slugged model and a Messages request calling an OpenAI-slugged one. That is the point — see Models & Routing.

Next steps

  • Authentication — key management, spend limits, model pinning
  • API Reference — every endpoint with a live playground
  • Pricing — per-model rates and the credits system