SageMaker / Client / update_endpoint_weights_and_capacities

update_endpoint_weights_and_capacities#

SageMaker.Client.update_endpoint_weights_and_capacities(**kwargs)#

Updates variant weight of one or more variants associated with an existing endpoint, or capacity of one variant associated with an existing endpoint. When it receives the request, SageMaker sets the endpoint status to Updating. After updating the endpoint, it sets the status to InService. To check the status of an endpoint, use the DescribeEndpoint API.

See also: AWS API Documentation

Request Syntax

response = client.update_endpoint_weights_and_capacities(
    EndpointName='string',
    DesiredWeightsAndCapacities=[
        {
            'VariantName': 'string',
            'DesiredWeight': ...,
            'DesiredInstanceCount': 123,
            'ServerlessUpdateConfig': {
                'MaxConcurrency': 123,
                'ProvisionedConcurrency': 123
            }
        },
    ]
)
Parameters:
  • EndpointName (string) –

    [REQUIRED]

    The name of an existing SageMaker endpoint.

  • DesiredWeightsAndCapacities (list) –

    [REQUIRED]

    An object that provides new capacity and weight values for a variant.

    • (dict) –

      Specifies weight and capacity values for a production variant.

      • VariantName (string) – [REQUIRED]

        The name of the variant to update.

      • DesiredWeight (float) –

        The variant’s weight.

      • DesiredInstanceCount (integer) –

        The variant’s capacity.

      • ServerlessUpdateConfig (dict) –

        Specifies the serverless update concurrency configuration for an endpoint variant.

        • MaxConcurrency (integer) –

          The updated maximum number of concurrent invocations your serverless endpoint can process.

        • ProvisionedConcurrency (integer) –

          The updated amount of provisioned concurrency to allocate for the serverless endpoint. Should be less than or equal to MaxConcurrency.

Return type:

dict

Returns:

Response Syntax

{
    'EndpointArn': 'string'
}

Response Structure

  • (dict) –

    • EndpointArn (string) –

      The Amazon Resource Name (ARN) of the updated endpoint.

Exceptions