Rate Limits for Models API
The Models API has a standard rate limit for all requests except for requests made via Apex methods in sandbox, demo, and trial orgs. When calling the Models API through Apex in a sandbox, demo, or trial org there are different limits.
The following limit applies to Apex and REST requests in regular production orgs.
- 300 requests per minute per org for each REST endpoint (or equivalent Apex method).
However, the Embeddings and Feedback capabilities have a higher rate limit.
- 1,000 requests per minute per org (for embeddings or feedback).
If a rate limit is exceeded, the Models API returns a response with a 429
status and an error object.
To request a rate limit increase, reach out to your Salesforce account executive.
The Apex methods of the Models API are also subject to the Apex Callout Limits and Limitations.
All Models API requests are subject to Salesforce’s usage and billing rates. See Einstein Usage.
The following limits only apply to requests made through Apex methods in sandbox, demo, and trial orgs. Sandbox, demo, and trial orgs using the Models REST API are subject to the above limit for requests per minute per org for each REST endpoint.
Sandbox Orgs
The rate limit value for Apex methods in sandbox orgs is 200 requests per hour per org.
Demo and Trial Orgs
The rate limit value for Apex methods in demo and trial orgs is 150 requests per hour per org.
- Models API Developer Guide: Access Models API with REST
- Models API Developer Guide: Access Models API with Apex
- Models API Developer Guide: Supported Models for Models API