replicate
chat
image
audio
open-source
Test your Replicate API key.
Validate a Replicate API token and benchmark the round trip against api.replicate.com.
Stateless proxy — keys never logged, stored, or persisted. What happens to your key →
Detected
Replicate
What this key does
Replicate runs open-source models behind a uniform HTTP API. Auth is Bearer with an r8_... token. Models are referenced as owner/name; predictions are async by default with a callback / polling pattern.
How to get a Replicate API key
- Sign in at replicate.com.
- Open Account → API Tokens.
- Generate a token starting with r8_.
- Paste it here.
Common errors and fixes
- 401 Unauthorized: Key is invalid, revoked, or pasted with extra whitespace. Generate a new key from the provider console and try again.
- 403 Forbidden: Key is valid but lacks permission for this resource. Check project / org / workspace scope, or that billing is set up for this key.
- 429 Too Many Requests: You hit the per-minute or per-day rate limit. Wait a moment and retry, or upgrade your tier.
- 404 Not Found: The endpoint or model id changed. Check the provider docs for the current path and model identifier.
- 5xx: The provider is having issues. Check their status page before assuming the bug is yours.
Security best practices
- Store keys in an env var or secret manager — never commit them to a repo, even a private one.
- Restrict scope: prefer per-project or per-deployment keys over a single root key shared across services.
- Rotate on a schedule (90 days is a sane default) and immediately on suspected leak.
- Audit usage in the provider console after rotation to confirm the old key has zero traffic.
- Set per-key spend limits where the provider supports them, so a leaked key has a bounded blast radius.
Pricing at a glance
Replicate bills per second of GPU time. Different models run on different hardware tiers (T4, A40, A100, H100) with different per-second rates.
FAQ
- Are predictions sync or async?
- Async by default — POST returns a prediction id you poll. Some smaller models support sync via Prefer: wait header.
- How do I list available models?
- Replicate has tens of thousands. Use the featured-models collection or browse replicate.com/explore.
- Can I run my own model?
- Yes — push a Cog image and it becomes an endpoint.
- Free tier?
- Sign-up credits exist; no permanent free tier.
- Does Replicate support webhooks?
- Yes — set webhook on the prediction and get a POST when it finishes.
- Can I cancel a prediction?
- Yes, via the cancel endpoint.
Test other providers
Related reading
- API key security best practices for LLMsHow to store, scope, rotate, and revoke LLM API keys without leaking them through git, logs, or shared environments.
- Free LLM API keys for testing in 2026Which providers offer free credits, how long they last, and how to stretch them for prototyping without a credit card.