How to Queue the multiple API requests at same time in FastAPI with Gunicorn


How can I queue the multiple API request in FastAPI.

For example if there is an API which takes 10 seconds to complete the request and there are 10 API requests at same time then how can We handle that part.

If anyone can help on this then it would be a great help


I tried to use the selenium using fastAPI and Gunicorn and its working fine if there one request at a time but if I will do multiple request the selemium generate error.
So I’m trying to find a solution like queue processing in FastAPI.


Queueing is done by the server (gunicorn in your case), not by FastAPI.

Gunicorn has a backlog where the requests are being queued. You can control it using --backlog=N switch. The default backlog size is 2048, so the error is probably due to a timeout on the client. Try to increase it.

You can spawn multiple workers in Gunicorn, use --workers=N switch. If you worker is CPU-bound, you should set N to the number of available cores. It’s a better solution.

And, if your worker is IO bound, use an asynchronous model to process more requests at once. This is the best solution.

Answered By – szrg

This Answer collected from stackoverflow, is licensed under cc by-sa 2.5 , cc by-sa 3.0 and cc by-sa 4.0

Leave a Reply

(*) Required, Your email will not be published