Rate Limits

Rate limits for Gooey.AI API

Rate limits are restrictions that our API imposes on the number of times a user or client can access our services within a specified period of time.

Why do we have rate limits?

Rate limits are a common practice for APIs, and they're put in place for a few different reasons:

  • They help protect against abuse or misuse of the API.

  • Rate limits help ensure that everyone has fair access to the API.

  • Rate limits can help Gooey.AI manage the aggregate load on its infrastructure.

Tier

Rate Limits

Free

2 concurrent runs, 6 requests/min

Creator

4 concurrent runs, 10 requests/min

Business

4 concurrent runs, 10 requests/min

How do these rate limits work?

At Gooey.AI, we measure rate limits in two ways:

  • Number of runs that can run simultaneously

  • RPM (requests per minute).

However, as we host several models from across the GenAI ecosystem, there might rate limits on Tokens Per Day, Images Per Day, etc. Please refer to the OpenAI guide for Token Limits and the Eleven Labs guide for the TTS rate limits.

Note:

  • Rate limits are defined at the user and org level

  • Rate limits may vary by the model being used at the model provider level.

If you require a higher number of concurrent requests, please reach out to our Sales team via (sales [at] gooey [dot] ai).

Last updated