Fair Use Policy
Last updated: 23/05/2026
What ‘flat fee’ means
AI Foundry Inference is sold as a flat monthly subscription. Tokens are not metered for billing. Token counts are reported in API responses for your own visibility, but you will never receive a per-token invoice.
Concurrency limits
Each API key has a hard cap of 8 concurrent in-flight requests. Requests beyond that returnHTTP 429with a Retry-After header — the standard OpenAI convention every SDK already handles.
We do not publish a customer-visible RPM or TPM cap. If you need more concurrency for one model, subscribe to the same plan again (each subscription is its own API key with its own 8 slots).
Acceptable use
You agree not to use the service to:
- Generate content that is illegal under New Zealand law.
- Operate at sustained volumes designed to circumvent concurrency limits (e.g. distributing requests across many low-effort accounts).
- Resell raw API access to third parties.
- Conduct security testing without prior written agreement.
Enforcement
We monitor for abuse at the aggregate level (concurrency, repeated 429s, account anomalies). We do not log prompt or completion content for enforcement. If we believe an account is violating this policy we will contact the billing email before taking action.