Container Management
Concurrency
The recommended level of concurrency (the optimal number of simultaneous requests to make to the container) for the CPU and GPU containers.
The recommended level of concurrency, i.e. the optimal number of simultaneous requests to make to the container is covered below for the CPU and GPU containers. The recommended concurrency level is driven primarily by the compute requirement of Limina’s Neural Network models, such as for PII detection. For an example of how to make concurrent requests, please visit our examples repository.