Resources

class verta.endpoint.resources.Resources(cpu=None, memory=None, nvidia_gpu=None)

Kubernetes computational resources allowed for an endpoint’s model, to be passed to Endpoint.update().

New in version 0.24.1: The nvidia_gpu parameter.

The JSON equivalent for this is:

{
    "resources": {"cpu": 0.25, "memory": "512Mi"}
}
Parameters:
  • cpu (float > 0) – CPU cores allowed for an endpoint’s model.

  • memory (str) –

    Memory allowed for an endpoint’s model. Expects the same representation as Kubernetes:

    You can express memory as a plain integer or as a fixed-point integer using one of these suffixes: E, P, T, G, M, K. You can also use the power-of-two equivalents: Ei, Pi, Ti, Gi, Mi, Ki. For example, the following represent roughly the same value: 128974848, 129e6, 129M, 123Mi.

  • nvidia_gpu (NvidiaGPU, optional) – Nvidia GPU resources allowed for an endpoint’s model.

Examples

from verta.endpoint.resources import Resources, NvidiaGPU, NvidiaGPUModel
resources = Resources(cpu=.25, memory="512Mi", nvidia_gpu=NvidiaGPU(1, NvidiaGPUModel.V100)