SageMaker / Client / update_compute_quota

update_compute_quota#

SageMaker.Client.update_compute_quota(**kwargs)#

Update the compute allocation definition.

See also: AWS API Documentation

Request Syntax

response = client.update_compute_quota(
    ComputeQuotaId='string',
    TargetVersion=123,
    ComputeQuotaConfig={
        'ComputeQuotaResources': [
            {
                'InstanceType': 'ml.p4d.24xlarge'|'ml.p4de.24xlarge'|'ml.p5.48xlarge'|'ml.trn1.32xlarge'|'ml.trn1n.32xlarge'|'ml.g5.xlarge'|'ml.g5.2xlarge'|'ml.g5.4xlarge'|'ml.g5.8xlarge'|'ml.g5.12xlarge'|'ml.g5.16xlarge'|'ml.g5.24xlarge'|'ml.g5.48xlarge'|'ml.c5.large'|'ml.c5.xlarge'|'ml.c5.2xlarge'|'ml.c5.4xlarge'|'ml.c5.9xlarge'|'ml.c5.12xlarge'|'ml.c5.18xlarge'|'ml.c5.24xlarge'|'ml.c5n.large'|'ml.c5n.2xlarge'|'ml.c5n.4xlarge'|'ml.c5n.9xlarge'|'ml.c5n.18xlarge'|'ml.m5.large'|'ml.m5.xlarge'|'ml.m5.2xlarge'|'ml.m5.4xlarge'|'ml.m5.8xlarge'|'ml.m5.12xlarge'|'ml.m5.16xlarge'|'ml.m5.24xlarge'|'ml.t3.medium'|'ml.t3.large'|'ml.t3.xlarge'|'ml.t3.2xlarge'|'ml.g6.xlarge'|'ml.g6.2xlarge'|'ml.g6.4xlarge'|'ml.g6.8xlarge'|'ml.g6.16xlarge'|'ml.g6.12xlarge'|'ml.g6.24xlarge'|'ml.g6.48xlarge'|'ml.gr6.4xlarge'|'ml.gr6.8xlarge'|'ml.g6e.xlarge'|'ml.g6e.2xlarge'|'ml.g6e.4xlarge'|'ml.g6e.8xlarge'|'ml.g6e.16xlarge'|'ml.g6e.12xlarge'|'ml.g6e.24xlarge'|'ml.g6e.48xlarge'|'ml.p5e.48xlarge'|'ml.p5en.48xlarge'|'ml.trn2.48xlarge'|'ml.c6i.large'|'ml.c6i.xlarge'|'ml.c6i.2xlarge'|'ml.c6i.4xlarge'|'ml.c6i.8xlarge'|'ml.c6i.12xlarge'|'ml.c6i.16xlarge'|'ml.c6i.24xlarge'|'ml.c6i.32xlarge'|'ml.m6i.large'|'ml.m6i.xlarge'|'ml.m6i.2xlarge'|'ml.m6i.4xlarge'|'ml.m6i.8xlarge'|'ml.m6i.12xlarge'|'ml.m6i.16xlarge'|'ml.m6i.24xlarge'|'ml.m6i.32xlarge'|'ml.r6i.large'|'ml.r6i.xlarge'|'ml.r6i.2xlarge'|'ml.r6i.4xlarge'|'ml.r6i.8xlarge'|'ml.r6i.12xlarge'|'ml.r6i.16xlarge'|'ml.r6i.24xlarge'|'ml.r6i.32xlarge',
                'Count': 123
            },
        ],
        'ResourceSharingConfig': {
            'Strategy': 'Lend'|'DontLend'|'LendAndBorrow',
            'BorrowLimit': 123
        },
        'PreemptTeamTasks': 'Never'|'LowerPriority'
    },
    ComputeQuotaTarget={
        'TeamName': 'string',
        'FairShareWeight': 123
    },
    ActivationState='Enabled'|'Disabled',
    Description='string'
)
Parameters:
  • ComputeQuotaId (string) –

    [REQUIRED]

    ID of the compute allocation definition.

  • TargetVersion (integer) –

    [REQUIRED]

    Target version.

  • ComputeQuotaConfig (dict) –

    Configuration of the compute allocation definition. This includes the resource sharing option, and the setting to preempt low priority tasks.

    • ComputeQuotaResources (list) –

      Allocate compute resources by instance types.

      • (dict) –

        Configuration of the resources used for the compute allocation definition.

        • InstanceType (string) – [REQUIRED]

          The instance type of the instance group for the cluster.

        • Count (integer) – [REQUIRED]

          The number of instances to add to the instance group of a SageMaker HyperPod cluster.

    • ResourceSharingConfig (dict) –

      Resource sharing configuration. This defines how an entity can lend and borrow idle compute with other entities within the cluster.

      • Strategy (string) – [REQUIRED]

        The strategy of how idle compute is shared within the cluster. The following are the options of strategies.

        • DontLend: entities do not lend idle compute.

        • Lend: entities can lend idle compute to entities that can borrow.

        • LendandBorrow: entities can lend idle compute and borrow idle compute from other entities.

        Default is LendandBorrow.

      • BorrowLimit (integer) –

        The limit on how much idle compute can be borrowed.The values can be 1 - 500 percent of idle compute that the team is allowed to borrow.

        Default is 50.

    • PreemptTeamTasks (string) –

      Allows workloads from within an entity to preempt same-team workloads. When set to LowerPriority, the entity’s lower priority tasks are preempted by their own higher priority tasks.

      Default is LowerPriority.

  • ComputeQuotaTarget (dict) –

    The target entity to allocate compute resources to.

    • TeamName (string) – [REQUIRED]

      Name of the team to allocate compute resources to.

    • FairShareWeight (integer) –

      Assigned entity fair-share weight. Idle compute will be shared across entities based on these assigned weights. This weight is only used when FairShare is enabled.

      A weight of 0 is the lowest priority and 100 is the highest. Weight 0 is the default.

  • ActivationState (string) –

    The state of the compute allocation being described. Use to enable or disable compute allocation.

    Default is Enabled.

  • Description (string) – Description of the compute allocation definition.

Return type:

dict

Returns:

Response Syntax

{
    'ComputeQuotaArn': 'string',
    'ComputeQuotaVersion': 123
}

Response Structure

  • (dict) –

    • ComputeQuotaArn (string) –

      ARN of the compute allocation definition.

    • ComputeQuotaVersion (integer) –

      Version of the compute allocation definition.

Exceptions