What is the concurrency limit for the API?
Last updated: June 19, 2025
A concurrency limit is the number of requests that can be processed for an account at any given time.
Each account has an assigned concurrency limit (also referred to as a rate limit). The default is 24.
Except for flux-kontext-max where the requests to our API is limited to 6 active tasks.
If you exceed your limit, it will return a status code 429 and you will have to wait until one of your previous tasks has finished.
If you require higher volumes, please contact us at support@blackforestlabs.ai