gcp clound run function - configuring scaling and concurrency

To change your cloud function cpu and scaling configurations, go to your cloud run function and select "Edit and Deploy new revision" as shown below:-


Proceed to update the memory, cpu allocations. 




Update the concurrency if required to take advantage of the higher cpus


According to the docs, each instance can handle up to 80 concurrency request but you can change this to 1000.

More details

https://cloud.google.com/run/docs/about-concurrency



Comments

Popular posts from this blog

vllm : Failed to infer device type

android studio kotlin source is null error

gemini cli getting file not defined error