AWS Batch

2019/04/03 - AWS Batch - 4 updated api methods

Changes  Support for GPU resource requirement in RegisterJobDefinition and SubmitJob

DescribeJobDefinitions (updated) Link ¶
Changes (response)
{'jobDefinitions': {'containerProperties': {'resourceRequirements': [{'type': 'GPU',
                                                                      'value': 'string'}]},
                    'nodeProperties': {'nodeRangeProperties': {'container': {'resourceRequirements': [{'type': 'GPU',
                                                                                                       'value': 'string'}]}}}}}

Describes a list of job definitions. You can specify a status (such as ACTIVE ) to only return job definitions that match that status.

See also: AWS API Documentation

Request Syntax

client.describe_job_definitions(
    jobDefinitions=[
        'string',
    ],
    maxResults=123,
    jobDefinitionName='string',
    status='string',
    nextToken='string'
)
type jobDefinitions

list

param jobDefinitions

A space-separated list of up to 100 job definition names or full Amazon Resource Name (ARN) entries.

  • (string) --

type maxResults

integer

param maxResults

The maximum number of results returned by DescribeJobDefinitions in paginated output. When this parameter is used, DescribeJobDefinitions only returns maxResults results in a single page along with a nextToken response element. The remaining results of the initial request can be seen by sending another DescribeJobDefinitions request with the returned nextToken value. This value can be between 1 and 100. If this parameter is not used, then DescribeJobDefinitions returns up to 100 results and a nextToken value if applicable.

type jobDefinitionName

string

param jobDefinitionName

The name of the job definition to describe.

type status

string

param status

The status with which to filter job definitions.

type nextToken

string

param nextToken

The nextToken value returned from a previous paginated DescribeJobDefinitions request where maxResults was used and the results exceeded the value of that parameter. Pagination continues from the end of the previous results that returned the nextToken value. This value is null when there are no more results to return.

Note

This token should be treated as an opaque identifier that is only used to retrieve the next items in a list and not for other programmatic purposes.

rtype

dict

returns

Response Syntax

{
    'jobDefinitions': [
        {
            'jobDefinitionName': 'string',
            'jobDefinitionArn': 'string',
            'revision': 123,
            'status': 'string',
            'type': 'string',
            'parameters': {
                'string': 'string'
            },
            'retryStrategy': {
                'attempts': 123
            },
            'containerProperties': {
                'image': 'string',
                'vcpus': 123,
                'memory': 123,
                'command': [
                    'string',
                ],
                'jobRoleArn': 'string',
                'volumes': [
                    {
                        'host': {
                            'sourcePath': 'string'
                        },
                        'name': 'string'
                    },
                ],
                'environment': [
                    {
                        'name': 'string',
                        'value': 'string'
                    },
                ],
                'mountPoints': [
                    {
                        'containerPath': 'string',
                        'readOnly': True|False,
                        'sourceVolume': 'string'
                    },
                ],
                'readonlyRootFilesystem': True|False,
                'privileged': True|False,
                'ulimits': [
                    {
                        'hardLimit': 123,
                        'name': 'string',
                        'softLimit': 123
                    },
                ],
                'user': 'string',
                'instanceType': 'string',
                'resourceRequirements': [
                    {
                        'value': 'string',
                        'type': 'GPU'
                    },
                ]
            },
            'timeout': {
                'attemptDurationSeconds': 123
            },
            'nodeProperties': {
                'numNodes': 123,
                'mainNode': 123,
                'nodeRangeProperties': [
                    {
                        'targetNodes': 'string',
                        'container': {
                            'image': 'string',
                            'vcpus': 123,
                            'memory': 123,
                            'command': [
                                'string',
                            ],
                            'jobRoleArn': 'string',
                            'volumes': [
                                {
                                    'host': {
                                        'sourcePath': 'string'
                                    },
                                    'name': 'string'
                                },
                            ],
                            'environment': [
                                {
                                    'name': 'string',
                                    'value': 'string'
                                },
                            ],
                            'mountPoints': [
                                {
                                    'containerPath': 'string',
                                    'readOnly': True|False,
                                    'sourceVolume': 'string'
                                },
                            ],
                            'readonlyRootFilesystem': True|False,
                            'privileged': True|False,
                            'ulimits': [
                                {
                                    'hardLimit': 123,
                                    'name': 'string',
                                    'softLimit': 123
                                },
                            ],
                            'user': 'string',
                            'instanceType': 'string',
                            'resourceRequirements': [
                                {
                                    'value': 'string',
                                    'type': 'GPU'
                                },
                            ]
                        }
                    },
                ]
            }
        },
    ],
    'nextToken': 'string'
}

Response Structure

  • (dict) --

    • jobDefinitions (list) --

      The list of job definitions.

      • (dict) --

        An object representing an AWS Batch job definition.

        • jobDefinitionName (string) --

          The name of the job definition.

        • jobDefinitionArn (string) --

          The Amazon Resource Name (ARN) for the job definition.

        • revision (integer) --

          The revision of the job definition.

        • status (string) --

          The status of the job definition.

        • type (string) --

          The type of job definition.

        • parameters (dict) --

          Default parameters or parameter substitution placeholders that are set in the job definition. Parameters are specified as a key-value pair mapping. Parameters in a SubmitJob request override any corresponding parameter defaults from the job definition. For more information about specifying parameters, see Job Definition Parameters in the AWS Batch User Guide .

          • (string) --

            • (string) --

        • retryStrategy (dict) --

          The retry strategy to use for failed jobs that are submitted with this job definition.

          • attempts (integer) --

            The number of times to move a job to the RUNNABLE status. You may specify between 1 and 10 attempts. If the value of attempts is greater than one, the job is retried on failure the same number of attempts as the value.

        • containerProperties (dict) --

          An object with various properties specific to container-based jobs.

          • image (string) --

            The image used to start a container. This string is passed directly to the Docker daemon. Images in the Docker Hub registry are available by default. Other repositories are specified with `` repository-url /image :tag `` . Up to 255 letters (uppercase and lowercase), numbers, hyphens, underscores, colons, periods, forward slashes, and number signs are allowed. This parameter maps to Image in the Create a container section of the Docker Remote API and the IMAGE parameter of docker run .

            • Images in Amazon ECR repositories use the full registry and repository URI (for example, 012345678910.dkr.ecr.<region-name>.amazonaws.com/<repository-name> ).

            • Images in official repositories on Docker Hub use a single name (for example, ubuntu or mongo ).

            • Images in other repositories on Docker Hub are qualified with an organization name (for example, amazon/amazon-ecs-agent ).

            • Images in other online repositories are qualified further by a domain name (for example, quay.io/assemblyline/ubuntu ).

          • vcpus (integer) --

            The number of vCPUs reserved for the container. This parameter maps to CpuShares in the Create a container section of the Docker Remote API and the --cpu-shares option to docker run . Each vCPU is equivalent to 1,024 CPU shares. You must specify at least one vCPU.

          • memory (integer) --

            The hard limit (in MiB) of memory to present to the container. If your container attempts to exceed the memory specified here, the container is killed. This parameter maps to Memory in the Create a container section of the Docker Remote API and the --memory option to docker run . You must specify at least 4 MiB of memory for a job.

            Note

            If you are trying to maximize your resource utilization by providing your jobs as much memory as possible for a particular instance type, see Memory Management in the AWS Batch User Guide .

          • command (list) --

            The command that is passed to the container. This parameter maps to Cmd in the Create a container section of the Docker Remote API and the COMMAND parameter to docker run . For more information, see https://docs.docker.com/engine/reference/builder/#cmd .

            • (string) --

          • jobRoleArn (string) --

            The Amazon Resource Name (ARN) of the IAM role that the container can assume for AWS permissions.

          • volumes (list) --

            A list of data volumes used in a job.

            • (dict) --

              A data volume used in a job's container properties.

              • host (dict) --

                The contents of the host parameter determine whether your data volume persists on the host container instance and where it is stored. If the host parameter is empty, then the Docker daemon assigns a host path for your data volume. However, the data is not guaranteed to persist after the containers associated with it stop running.

                • sourcePath (string) --

                  The path on the host container instance that is presented to the container. If this parameter is empty, then the Docker daemon has assigned a host path for you. If the host parameter contains a sourcePath file location, then the data volume persists at the specified location on the host container instance until you delete it manually. If the sourcePath value does not exist on the host container instance, the Docker daemon creates it. If the location does exist, the contents of the source path folder are exported.

              • name (string) --

                The name of the volume. Up to 255 letters (uppercase and lowercase), numbers, hyphens, and underscores are allowed. This name is referenced in the sourceVolume parameter of container definition mountPoints .

          • environment (list) --

            The environment variables to pass to a container. This parameter maps to Env in the Create a container section of the Docker Remote API and the --env option to docker run .

            Warning

            We do not recommend using plaintext environment variables for sensitive information, such as credential data.

            Note

            Environment variables must not start with AWS_BATCH ; this naming convention is reserved for variables that are set by the AWS Batch service.

            • (dict) --

              A key-value pair object.

              • name (string) --

                The name of the key-value pair. For environment variables, this is the name of the environment variable.

              • value (string) --

                The value of the key-value pair. For environment variables, this is the value of the environment variable.

          • mountPoints (list) --

            The mount points for data volumes in your container. This parameter maps to Volumes in the Create a container section of the Docker Remote API and the --volume option to docker run .

            • (dict) --

              Details on a Docker volume mount point that is used in a job's container properties.

              • containerPath (string) --

                The path on the container at which to mount the host volume.

              • readOnly (boolean) --

                If this value is true , the container has read-only access to the volume; otherwise, the container can write to the volume. The default value is false .

              • sourceVolume (string) --

                The name of the volume to mount.

          • readonlyRootFilesystem (boolean) --

            When this parameter is true, the container is given read-only access to its root file system. This parameter maps to ReadonlyRootfs in the Create a container section of the Docker Remote API and the --read-only option to docker run .

          • privileged (boolean) --

            When this parameter is true, the container is given elevated privileges on the host container instance (similar to the root user). This parameter maps to Privileged in the Create a container section of the Docker Remote API and the --privileged option to docker run .

          • ulimits (list) --

            A list of ulimits to set in the container. This parameter maps to Ulimits in the Create a container section of the Docker Remote API and the --ulimit option to docker run .

            • (dict) --

              The ulimit settings to pass to the container.

              • hardLimit (integer) --

                The hard limit for the ulimit type.

              • name (string) --

                The type of the ulimit .

              • softLimit (integer) --

                The soft limit for the ulimit type.

          • user (string) --

            The user name to use inside the container. This parameter maps to User in the Create a container section of the Docker Remote API and the --user option to docker run .

          • instanceType (string) --

            The instance type to use for a multi-node parallel job. Currently all node groups in a multi-node parallel job must use the same instance type. This parameter is not valid for single-node container jobs.

          • resourceRequirements (list) --

            The type and amount of a resource to assign to a container. Currently, the only supported resource is GPU .

            • (dict) --

              The type and amount of a resource to assign to a container. Currently, the only supported resource type is GPU .

              • value (string) --

                The number of physical GPUs to reserve for the container. The number of GPUs reserved for all containers in a job should not exceed the number of available GPUs on the compute resource that the job is launched on.

              • type (string) --

                The type of resource to assign to a container. Currently, the only supported resource type is GPU .

        • timeout (dict) --

          The timeout configuration for jobs that are submitted with this job definition. You can specify a timeout duration after which AWS Batch terminates your jobs if they have not finished.

          • attemptDurationSeconds (integer) --

            The time duration in seconds (measured from the job attempt's startedAt timestamp) after which AWS Batch terminates your jobs if they have not finished.

        • nodeProperties (dict) --

          An object with various properties specific to multi-node parallel jobs.

          • numNodes (integer) --

            The number of nodes associated with a multi-node parallel job.

          • mainNode (integer) --

            Specifies the node index for the main node of a multi-node parallel job. This node index value must be fewer than the number of nodes.

          • nodeRangeProperties (list) --

            A list of node ranges and their properties associated with a multi-node parallel job.

            • (dict) --

              An object representing the properties of the node range for a multi-node parallel job.

              • targetNodes (string) --

                The range of nodes, using node index values. A range of 0:3 indicates nodes with index values of 0 through 3 . If the starting range value is omitted (:n ), then 0 is used to start the range. If the ending range value is omitted (n: ), then the highest possible node index is used to end the range. Your accumulative node ranges must account for all nodes (0:n). You may nest node ranges, for example 0:10 and 4:5, in which case the 4:5 range properties override the 0:10 properties.

              • container (dict) --

                The container details for the node range.

                • image (string) --

                  The image used to start a container. This string is passed directly to the Docker daemon. Images in the Docker Hub registry are available by default. Other repositories are specified with `` repository-url /image :tag `` . Up to 255 letters (uppercase and lowercase), numbers, hyphens, underscores, colons, periods, forward slashes, and number signs are allowed. This parameter maps to Image in the Create a container section of the Docker Remote API and the IMAGE parameter of docker run .

                  • Images in Amazon ECR repositories use the full registry and repository URI (for example, 012345678910.dkr.ecr.<region-name>.amazonaws.com/<repository-name> ).

                  • Images in official repositories on Docker Hub use a single name (for example, ubuntu or mongo ).

                  • Images in other repositories on Docker Hub are qualified with an organization name (for example, amazon/amazon-ecs-agent ).

                  • Images in other online repositories are qualified further by a domain name (for example, quay.io/assemblyline/ubuntu ).

                • vcpus (integer) --

                  The number of vCPUs reserved for the container. This parameter maps to CpuShares in the Create a container section of the Docker Remote API and the --cpu-shares option to docker run . Each vCPU is equivalent to 1,024 CPU shares. You must specify at least one vCPU.

                • memory (integer) --

                  The hard limit (in MiB) of memory to present to the container. If your container attempts to exceed the memory specified here, the container is killed. This parameter maps to Memory in the Create a container section of the Docker Remote API and the --memory option to docker run . You must specify at least 4 MiB of memory for a job.

                  Note

                  If you are trying to maximize your resource utilization by providing your jobs as much memory as possible for a particular instance type, see Memory Management in the AWS Batch User Guide .

                • command (list) --

                  The command that is passed to the container. This parameter maps to Cmd in the Create a container section of the Docker Remote API and the COMMAND parameter to docker run . For more information, see https://docs.docker.com/engine/reference/builder/#cmd .

                  • (string) --

                • jobRoleArn (string) --

                  The Amazon Resource Name (ARN) of the IAM role that the container can assume for AWS permissions.

                • volumes (list) --

                  A list of data volumes used in a job.

                  • (dict) --

                    A data volume used in a job's container properties.

                    • host (dict) --

                      The contents of the host parameter determine whether your data volume persists on the host container instance and where it is stored. If the host parameter is empty, then the Docker daemon assigns a host path for your data volume. However, the data is not guaranteed to persist after the containers associated with it stop running.

                      • sourcePath (string) --

                        The path on the host container instance that is presented to the container. If this parameter is empty, then the Docker daemon has assigned a host path for you. If the host parameter contains a sourcePath file location, then the data volume persists at the specified location on the host container instance until you delete it manually. If the sourcePath value does not exist on the host container instance, the Docker daemon creates it. If the location does exist, the contents of the source path folder are exported.

                    • name (string) --

                      The name of the volume. Up to 255 letters (uppercase and lowercase), numbers, hyphens, and underscores are allowed. This name is referenced in the sourceVolume parameter of container definition mountPoints .

                • environment (list) --

                  The environment variables to pass to a container. This parameter maps to Env in the Create a container section of the Docker Remote API and the --env option to docker run .

                  Warning

                  We do not recommend using plaintext environment variables for sensitive information, such as credential data.

                  Note

                  Environment variables must not start with AWS_BATCH ; this naming convention is reserved for variables that are set by the AWS Batch service.

                  • (dict) --

                    A key-value pair object.

                    • name (string) --

                      The name of the key-value pair. For environment variables, this is the name of the environment variable.

                    • value (string) --

                      The value of the key-value pair. For environment variables, this is the value of the environment variable.

                • mountPoints (list) --

                  The mount points for data volumes in your container. This parameter maps to Volumes in the Create a container section of the Docker Remote API and the --volume option to docker run .

                  • (dict) --

                    Details on a Docker volume mount point that is used in a job's container properties.

                    • containerPath (string) --

                      The path on the container at which to mount the host volume.

                    • readOnly (boolean) --

                      If this value is true , the container has read-only access to the volume; otherwise, the container can write to the volume. The default value is false .

                    • sourceVolume (string) --

                      The name of the volume to mount.

                • readonlyRootFilesystem (boolean) --

                  When this parameter is true, the container is given read-only access to its root file system. This parameter maps to ReadonlyRootfs in the Create a container section of the Docker Remote API and the --read-only option to docker run .

                • privileged (boolean) --

                  When this parameter is true, the container is given elevated privileges on the host container instance (similar to the root user). This parameter maps to Privileged in the Create a container section of the Docker Remote API and the --privileged option to docker run .

                • ulimits (list) --

                  A list of ulimits to set in the container. This parameter maps to Ulimits in the Create a container section of the Docker Remote API and the --ulimit option to docker run .

                  • (dict) --

                    The ulimit settings to pass to the container.

                    • hardLimit (integer) --

                      The hard limit for the ulimit type.

                    • name (string) --

                      The type of the ulimit .

                    • softLimit (integer) --

                      The soft limit for the ulimit type.

                • user (string) --

                  The user name to use inside the container. This parameter maps to User in the Create a container section of the Docker Remote API and the --user option to docker run .

                • instanceType (string) --

                  The instance type to use for a multi-node parallel job. Currently all node groups in a multi-node parallel job must use the same instance type. This parameter is not valid for single-node container jobs.

                • resourceRequirements (list) --

                  The type and amount of a resource to assign to a container. Currently, the only supported resource is GPU .

                  • (dict) --

                    The type and amount of a resource to assign to a container. Currently, the only supported resource type is GPU .

                    • value (string) --

                      The number of physical GPUs to reserve for the container. The number of GPUs reserved for all containers in a job should not exceed the number of available GPUs on the compute resource that the job is launched on.

                    • type (string) --

                      The type of resource to assign to a container. Currently, the only supported resource type is GPU .

    • nextToken (string) --

      The nextToken value to include in a future DescribeJobDefinitions request. When the results of a DescribeJobDefinitions request exceed maxResults , this value can be used to retrieve the next page of results. This value is null when there are no more results to return.

DescribeJobs (updated) Link ¶
Changes (response)
{'jobs': {'container': {'resourceRequirements': [{'type': 'GPU',
                                                  'value': 'string'}]},
          'nodeProperties': {'nodeRangeProperties': {'container': {'resourceRequirements': [{'type': 'GPU',
                                                                                             'value': 'string'}]}}}}}

Describes a list of AWS Batch jobs.

See also: AWS API Documentation

Request Syntax

client.describe_jobs(
    jobs=[
        'string',
    ]
)
type jobs

list

param jobs

[REQUIRED]

A space-separated list of up to 100 job IDs.

  • (string) --

rtype

dict

returns

Response Syntax

{
    'jobs': [
        {
            'jobName': 'string',
            'jobId': 'string',
            'jobQueue': 'string',
            'status': 'SUBMITTED'|'PENDING'|'RUNNABLE'|'STARTING'|'RUNNING'|'SUCCEEDED'|'FAILED',
            'attempts': [
                {
                    'container': {
                        'containerInstanceArn': 'string',
                        'taskArn': 'string',
                        'exitCode': 123,
                        'reason': 'string',
                        'logStreamName': 'string',
                        'networkInterfaces': [
                            {
                                'attachmentId': 'string',
                                'ipv6Address': 'string',
                                'privateIpv4Address': 'string'
                            },
                        ]
                    },
                    'startedAt': 123,
                    'stoppedAt': 123,
                    'statusReason': 'string'
                },
            ],
            'statusReason': 'string',
            'createdAt': 123,
            'retryStrategy': {
                'attempts': 123
            },
            'startedAt': 123,
            'stoppedAt': 123,
            'dependsOn': [
                {
                    'jobId': 'string',
                    'type': 'N_TO_N'|'SEQUENTIAL'
                },
            ],
            'jobDefinition': 'string',
            'parameters': {
                'string': 'string'
            },
            'container': {
                'image': 'string',
                'vcpus': 123,
                'memory': 123,
                'command': [
                    'string',
                ],
                'jobRoleArn': 'string',
                'volumes': [
                    {
                        'host': {
                            'sourcePath': 'string'
                        },
                        'name': 'string'
                    },
                ],
                'environment': [
                    {
                        'name': 'string',
                        'value': 'string'
                    },
                ],
                'mountPoints': [
                    {
                        'containerPath': 'string',
                        'readOnly': True|False,
                        'sourceVolume': 'string'
                    },
                ],
                'readonlyRootFilesystem': True|False,
                'ulimits': [
                    {
                        'hardLimit': 123,
                        'name': 'string',
                        'softLimit': 123
                    },
                ],
                'privileged': True|False,
                'user': 'string',
                'exitCode': 123,
                'reason': 'string',
                'containerInstanceArn': 'string',
                'taskArn': 'string',
                'logStreamName': 'string',
                'instanceType': 'string',
                'networkInterfaces': [
                    {
                        'attachmentId': 'string',
                        'ipv6Address': 'string',
                        'privateIpv4Address': 'string'
                    },
                ],
                'resourceRequirements': [
                    {
                        'value': 'string',
                        'type': 'GPU'
                    },
                ]
            },
            'nodeDetails': {
                'nodeIndex': 123,
                'isMainNode': True|False
            },
            'nodeProperties': {
                'numNodes': 123,
                'mainNode': 123,
                'nodeRangeProperties': [
                    {
                        'targetNodes': 'string',
                        'container': {
                            'image': 'string',
                            'vcpus': 123,
                            'memory': 123,
                            'command': [
                                'string',
                            ],
                            'jobRoleArn': 'string',
                            'volumes': [
                                {
                                    'host': {
                                        'sourcePath': 'string'
                                    },
                                    'name': 'string'
                                },
                            ],
                            'environment': [
                                {
                                    'name': 'string',
                                    'value': 'string'
                                },
                            ],
                            'mountPoints': [
                                {
                                    'containerPath': 'string',
                                    'readOnly': True|False,
                                    'sourceVolume': 'string'
                                },
                            ],
                            'readonlyRootFilesystem': True|False,
                            'privileged': True|False,
                            'ulimits': [
                                {
                                    'hardLimit': 123,
                                    'name': 'string',
                                    'softLimit': 123
                                },
                            ],
                            'user': 'string',
                            'instanceType': 'string',
                            'resourceRequirements': [
                                {
                                    'value': 'string',
                                    'type': 'GPU'
                                },
                            ]
                        }
                    },
                ]
            },
            'arrayProperties': {
                'statusSummary': {
                    'string': 123
                },
                'size': 123,
                'index': 123
            },
            'timeout': {
                'attemptDurationSeconds': 123
            }
        },
    ]
}

Response Structure

  • (dict) --

    • jobs (list) --

      The list of jobs.

      • (dict) --

        An object representing an AWS Batch job.

        • jobName (string) --

          The name of the job.

        • jobId (string) --

          The ID for the job.

        • jobQueue (string) --

          The Amazon Resource Name (ARN) of the job queue with which the job is associated.

        • status (string) --

          The current status for the job.

          Note

          If your jobs do not progress to STARTING , see Jobs Stuck in ``RUNNABLE` Status <https://docs.aws.amazon.com/batch/latest/userguide/troubleshooting.html#job_stuck_in_runnable>`__ in the troubleshooting section of the AWS Batch User Guide .

        • attempts (list) --

          A list of job attempts associated with this job.

          • (dict) --

            An object representing a job attempt.

            • container (dict) --

              Details about the container in this job attempt.

              • containerInstanceArn (string) --

                The Amazon Resource Name (ARN) of the Amazon ECS container instance that hosts the job attempt.

              • taskArn (string) --

                The Amazon Resource Name (ARN) of the Amazon ECS task that is associated with the job attempt. Each container attempt receives a task ARN when they reach the STARTING status.

              • exitCode (integer) --

                The exit code for the job attempt. A non-zero exit code is considered a failure.

              • reason (string) --

                A short (255 max characters) human-readable string to provide additional details about a running or stopped container.

              • logStreamName (string) --

                The name of the CloudWatch Logs log stream associated with the container. The log group for AWS Batch jobs is /aws/batch/job . Each container attempt receives a log stream name when they reach the RUNNING status.

              • networkInterfaces (list) --

                The network interfaces associated with the job attempt.

                • (dict) --

                  An object representing the elastic network interface for a multi-node parallel job node.

                  • attachmentId (string) --

                    The attachment ID for the network interface.

                  • ipv6Address (string) --

                    The private IPv6 address for the network interface.

                  • privateIpv4Address (string) --

                    The private IPv4 address for the network interface.

            • startedAt (integer) --

              The Unix timestamp (in seconds and milliseconds) for when the attempt was started (when the attempt transitioned from the STARTING state to the RUNNING state).

            • stoppedAt (integer) --

              The Unix timestamp (in seconds and milliseconds) for when the attempt was stopped (when the attempt transitioned from the RUNNING state to a terminal state, such as SUCCEEDED or FAILED ).

            • statusReason (string) --

              A short, human-readable string to provide additional details about the current status of the job attempt.

        • statusReason (string) --

          A short, human-readable string to provide additional details about the current status of the job.

        • createdAt (integer) --

          The Unix timestamp (in seconds and milliseconds) for when the job was created. For non-array jobs and parent array jobs, this is when the job entered the SUBMITTED state (at the time SubmitJob was called). For array child jobs, this is when the child job was spawned by its parent and entered the PENDING state.

        • retryStrategy (dict) --

          The retry strategy to use for this job if an attempt fails.

          • attempts (integer) --

            The number of times to move a job to the RUNNABLE status. You may specify between 1 and 10 attempts. If the value of attempts is greater than one, the job is retried on failure the same number of attempts as the value.

        • startedAt (integer) --

          The Unix timestamp (in seconds and milliseconds) for when the job was started (when the job transitioned from the STARTING state to the RUNNING state).

        • stoppedAt (integer) --

          The Unix timestamp (in seconds and milliseconds) for when the job was stopped (when the job transitioned from the RUNNING state to a terminal state, such as SUCCEEDED or FAILED ).

        • dependsOn (list) --

          A list of job names or IDs on which this job depends.

          • (dict) --

            An object representing an AWS Batch job dependency.

            • jobId (string) --

              The job ID of the AWS Batch job associated with this dependency.

            • type (string) --

              The type of the job dependency.

        • jobDefinition (string) --

          The job definition that is used by this job.

        • parameters (dict) --

          Additional parameters passed to the job that replace parameter substitution placeholders or override any corresponding parameter defaults from the job definition.

          • (string) --

            • (string) --

        • container (dict) --

          An object representing the details of the container that is associated with the job.

          • image (string) --

            The image used to start the container.

          • vcpus (integer) --

            The number of VCPUs allocated for the job.

          • memory (integer) --

            The number of MiB of memory reserved for the job.

          • command (list) --

            The command that is passed to the container.

            • (string) --

          • jobRoleArn (string) --

            The Amazon Resource Name (ARN) associated with the job upon execution.

          • volumes (list) --

            A list of volumes associated with the job.

            • (dict) --

              A data volume used in a job's container properties.

              • host (dict) --

                The contents of the host parameter determine whether your data volume persists on the host container instance and where it is stored. If the host parameter is empty, then the Docker daemon assigns a host path for your data volume. However, the data is not guaranteed to persist after the containers associated with it stop running.

                • sourcePath (string) --

                  The path on the host container instance that is presented to the container. If this parameter is empty, then the Docker daemon has assigned a host path for you. If the host parameter contains a sourcePath file location, then the data volume persists at the specified location on the host container instance until you delete it manually. If the sourcePath value does not exist on the host container instance, the Docker daemon creates it. If the location does exist, the contents of the source path folder are exported.

              • name (string) --

                The name of the volume. Up to 255 letters (uppercase and lowercase), numbers, hyphens, and underscores are allowed. This name is referenced in the sourceVolume parameter of container definition mountPoints .

          • environment (list) --

            The environment variables to pass to a container.

            Note

            Environment variables must not start with AWS_BATCH ; this naming convention is reserved for variables that are set by the AWS Batch service.

            • (dict) --

              A key-value pair object.

              • name (string) --

                The name of the key-value pair. For environment variables, this is the name of the environment variable.

              • value (string) --

                The value of the key-value pair. For environment variables, this is the value of the environment variable.

          • mountPoints (list) --

            The mount points for data volumes in your container.

            • (dict) --

              Details on a Docker volume mount point that is used in a job's container properties.

              • containerPath (string) --

                The path on the container at which to mount the host volume.

              • readOnly (boolean) --

                If this value is true , the container has read-only access to the volume; otherwise, the container can write to the volume. The default value is false .

              • sourceVolume (string) --

                The name of the volume to mount.

          • readonlyRootFilesystem (boolean) --

            When this parameter is true, the container is given read-only access to its root file system.

          • ulimits (list) --

            A list of ulimit values to set in the container.

            • (dict) --

              The ulimit settings to pass to the container.

              • hardLimit (integer) --

                The hard limit for the ulimit type.

              • name (string) --

                The type of the ulimit .

              • softLimit (integer) --

                The soft limit for the ulimit type.

          • privileged (boolean) --

            When this parameter is true, the container is given elevated privileges on the host container instance (similar to the root user).

          • user (string) --

            The user name to use inside the container.

          • exitCode (integer) --

            The exit code to return upon completion.

          • reason (string) --

            A short (255 max characters) human-readable string to provide additional details about a running or stopped container.

          • containerInstanceArn (string) --

            The Amazon Resource Name (ARN) of the container instance on which the container is running.

          • taskArn (string) --

            The Amazon Resource Name (ARN) of the Amazon ECS task that is associated with the container job. Each container attempt receives a task ARN when they reach the STARTING status.

          • logStreamName (string) --

            The name of the CloudWatch Logs log stream associated with the container. The log group for AWS Batch jobs is /aws/batch/job . Each container attempt receives a log stream name when they reach the RUNNING status.

          • instanceType (string) --

            The instance type of the underlying host infrastructure of a multi-node parallel job.

          • networkInterfaces (list) --

            The network interfaces associated with the job.

            • (dict) --

              An object representing the elastic network interface for a multi-node parallel job node.

              • attachmentId (string) --

                The attachment ID for the network interface.

              • ipv6Address (string) --

                The private IPv6 address for the network interface.

              • privateIpv4Address (string) --

                The private IPv4 address for the network interface.

          • resourceRequirements (list) --

            The type and amount of a resource to assign to a container. Currently, the only supported resource is GPU .

            • (dict) --

              The type and amount of a resource to assign to a container. Currently, the only supported resource type is GPU .

              • value (string) --

                The number of physical GPUs to reserve for the container. The number of GPUs reserved for all containers in a job should not exceed the number of available GPUs on the compute resource that the job is launched on.

              • type (string) --

                The type of resource to assign to a container. Currently, the only supported resource type is GPU .

        • nodeDetails (dict) --

          An object representing the details of a node that is associated with a multi-node parallel job.

          • nodeIndex (integer) --

            The node index for the node. Node index numbering begins at zero. This index is also available on the node with the AWS_BATCH_JOB_NODE_INDEX environment variable.

          • isMainNode (boolean) --

            Specifies whether the current node is the main node for a multi-node parallel job.

        • nodeProperties (dict) --

          An object representing the node properties of a multi-node parallel job.

          • numNodes (integer) --

            The number of nodes associated with a multi-node parallel job.

          • mainNode (integer) --

            Specifies the node index for the main node of a multi-node parallel job. This node index value must be fewer than the number of nodes.

          • nodeRangeProperties (list) --

            A list of node ranges and their properties associated with a multi-node parallel job.

            • (dict) --

              An object representing the properties of the node range for a multi-node parallel job.

              • targetNodes (string) --

                The range of nodes, using node index values. A range of 0:3 indicates nodes with index values of 0 through 3 . If the starting range value is omitted (:n ), then 0 is used to start the range. If the ending range value is omitted (n: ), then the highest possible node index is used to end the range. Your accumulative node ranges must account for all nodes (0:n). You may nest node ranges, for example 0:10 and 4:5, in which case the 4:5 range properties override the 0:10 properties.

              • container (dict) --

                The container details for the node range.

                • image (string) --

                  The image used to start a container. This string is passed directly to the Docker daemon. Images in the Docker Hub registry are available by default. Other repositories are specified with `` repository-url /image :tag `` . Up to 255 letters (uppercase and lowercase), numbers, hyphens, underscores, colons, periods, forward slashes, and number signs are allowed. This parameter maps to Image in the Create a container section of the Docker Remote API and the IMAGE parameter of docker run .

                  • Images in Amazon ECR repositories use the full registry and repository URI (for example, 012345678910.dkr.ecr.<region-name>.amazonaws.com/<repository-name> ).

                  • Images in official repositories on Docker Hub use a single name (for example, ubuntu or mongo ).

                  • Images in other repositories on Docker Hub are qualified with an organization name (for example, amazon/amazon-ecs-agent ).

                  • Images in other online repositories are qualified further by a domain name (for example, quay.io/assemblyline/ubuntu ).

                • vcpus (integer) --

                  The number of vCPUs reserved for the container. This parameter maps to CpuShares in the Create a container section of the Docker Remote API and the --cpu-shares option to docker run . Each vCPU is equivalent to 1,024 CPU shares. You must specify at least one vCPU.

                • memory (integer) --

                  The hard limit (in MiB) of memory to present to the container. If your container attempts to exceed the memory specified here, the container is killed. This parameter maps to Memory in the Create a container section of the Docker Remote API and the --memory option to docker run . You must specify at least 4 MiB of memory for a job.

                  Note

                  If you are trying to maximize your resource utilization by providing your jobs as much memory as possible for a particular instance type, see Memory Management in the AWS Batch User Guide .

                • command (list) --

                  The command that is passed to the container. This parameter maps to Cmd in the Create a container section of the Docker Remote API and the COMMAND parameter to docker run . For more information, see https://docs.docker.com/engine/reference/builder/#cmd .

                  • (string) --

                • jobRoleArn (string) --

                  The Amazon Resource Name (ARN) of the IAM role that the container can assume for AWS permissions.

                • volumes (list) --

                  A list of data volumes used in a job.

                  • (dict) --

                    A data volume used in a job's container properties.

                    • host (dict) --

                      The contents of the host parameter determine whether your data volume persists on the host container instance and where it is stored. If the host parameter is empty, then the Docker daemon assigns a host path for your data volume. However, the data is not guaranteed to persist after the containers associated with it stop running.

                      • sourcePath (string) --

                        The path on the host container instance that is presented to the container. If this parameter is empty, then the Docker daemon has assigned a host path for you. If the host parameter contains a sourcePath file location, then the data volume persists at the specified location on the host container instance until you delete it manually. If the sourcePath value does not exist on the host container instance, the Docker daemon creates it. If the location does exist, the contents of the source path folder are exported.

                    • name (string) --

                      The name of the volume. Up to 255 letters (uppercase and lowercase), numbers, hyphens, and underscores are allowed. This name is referenced in the sourceVolume parameter of container definition mountPoints .

                • environment (list) --

                  The environment variables to pass to a container. This parameter maps to Env in the Create a container section of the Docker Remote API and the --env option to docker run .

                  Warning

                  We do not recommend using plaintext environment variables for sensitive information, such as credential data.

                  Note

                  Environment variables must not start with AWS_BATCH ; this naming convention is reserved for variables that are set by the AWS Batch service.

                  • (dict) --

                    A key-value pair object.

                    • name (string) --

                      The name of the key-value pair. For environment variables, this is the name of the environment variable.

                    • value (string) --

                      The value of the key-value pair. For environment variables, this is the value of the environment variable.

                • mountPoints (list) --

                  The mount points for data volumes in your container. This parameter maps to Volumes in the Create a container section of the Docker Remote API and the --volume option to docker run .

                  • (dict) --

                    Details on a Docker volume mount point that is used in a job's container properties.

                    • containerPath (string) --

                      The path on the container at which to mount the host volume.

                    • readOnly (boolean) --

                      If this value is true , the container has read-only access to the volume; otherwise, the container can write to the volume. The default value is false .

                    • sourceVolume (string) --

                      The name of the volume to mount.

                • readonlyRootFilesystem (boolean) --

                  When this parameter is true, the container is given read-only access to its root file system. This parameter maps to ReadonlyRootfs in the Create a container section of the Docker Remote API and the --read-only option to docker run .

                • privileged (boolean) --

                  When this parameter is true, the container is given elevated privileges on the host container instance (similar to the root user). This parameter maps to Privileged in the Create a container section of the Docker Remote API and the --privileged option to docker run .

                • ulimits (list) --

                  A list of ulimits to set in the container. This parameter maps to Ulimits in the Create a container section of the Docker Remote API and the --ulimit option to docker run .

                  • (dict) --

                    The ulimit settings to pass to the container.

                    • hardLimit (integer) --

                      The hard limit for the ulimit type.

                    • name (string) --

                      The type of the ulimit .

                    • softLimit (integer) --

                      The soft limit for the ulimit type.

                • user (string) --

                  The user name to use inside the container. This parameter maps to User in the Create a container section of the Docker Remote API and the --user option to docker run .

                • instanceType (string) --

                  The instance type to use for a multi-node parallel job. Currently all node groups in a multi-node parallel job must use the same instance type. This parameter is not valid for single-node container jobs.

                • resourceRequirements (list) --

                  The type and amount of a resource to assign to a container. Currently, the only supported resource is GPU .

                  • (dict) --

                    The type and amount of a resource to assign to a container. Currently, the only supported resource type is GPU .

                    • value (string) --

                      The number of physical GPUs to reserve for the container. The number of GPUs reserved for all containers in a job should not exceed the number of available GPUs on the compute resource that the job is launched on.

                    • type (string) --

                      The type of resource to assign to a container. Currently, the only supported resource type is GPU .

        • arrayProperties (dict) --

          The array properties of the job, if it is an array job.

          • statusSummary (dict) --

            A summary of the number of array job children in each available job status. This parameter is returned for parent array jobs.

            • (string) --

              • (integer) --

          • size (integer) --

            The size of the array job. This parameter is returned for parent array jobs.

          • index (integer) --

            The job index within the array that is associated with this job. This parameter is returned for array job children.

        • timeout (dict) --

          The timeout configuration for the job.

          • attemptDurationSeconds (integer) --

            The time duration in seconds (measured from the job attempt's startedAt timestamp) after which AWS Batch terminates your jobs if they have not finished.

RegisterJobDefinition (updated) Link ¶
Changes (request)
{'containerProperties': {'resourceRequirements': [{'type': 'GPU',
                                                   'value': 'string'}]},
 'nodeProperties': {'nodeRangeProperties': {'container': {'resourceRequirements': [{'type': 'GPU',
                                                                                    'value': 'string'}]}}}}

Registers an AWS Batch job definition.

See also: AWS API Documentation

Request Syntax

client.register_job_definition(
    jobDefinitionName='string',
    type='container'|'multinode',
    parameters={
        'string': 'string'
    },
    containerProperties={
        'image': 'string',
        'vcpus': 123,
        'memory': 123,
        'command': [
            'string',
        ],
        'jobRoleArn': 'string',
        'volumes': [
            {
                'host': {
                    'sourcePath': 'string'
                },
                'name': 'string'
            },
        ],
        'environment': [
            {
                'name': 'string',
                'value': 'string'
            },
        ],
        'mountPoints': [
            {
                'containerPath': 'string',
                'readOnly': True|False,
                'sourceVolume': 'string'
            },
        ],
        'readonlyRootFilesystem': True|False,
        'privileged': True|False,
        'ulimits': [
            {
                'hardLimit': 123,
                'name': 'string',
                'softLimit': 123
            },
        ],
        'user': 'string',
        'instanceType': 'string',
        'resourceRequirements': [
            {
                'value': 'string',
                'type': 'GPU'
            },
        ]
    },
    nodeProperties={
        'numNodes': 123,
        'mainNode': 123,
        'nodeRangeProperties': [
            {
                'targetNodes': 'string',
                'container': {
                    'image': 'string',
                    'vcpus': 123,
                    'memory': 123,
                    'command': [
                        'string',
                    ],
                    'jobRoleArn': 'string',
                    'volumes': [
                        {
                            'host': {
                                'sourcePath': 'string'
                            },
                            'name': 'string'
                        },
                    ],
                    'environment': [
                        {
                            'name': 'string',
                            'value': 'string'
                        },
                    ],
                    'mountPoints': [
                        {
                            'containerPath': 'string',
                            'readOnly': True|False,
                            'sourceVolume': 'string'
                        },
                    ],
                    'readonlyRootFilesystem': True|False,
                    'privileged': True|False,
                    'ulimits': [
                        {
                            'hardLimit': 123,
                            'name': 'string',
                            'softLimit': 123
                        },
                    ],
                    'user': 'string',
                    'instanceType': 'string',
                    'resourceRequirements': [
                        {
                            'value': 'string',
                            'type': 'GPU'
                        },
                    ]
                }
            },
        ]
    },
    retryStrategy={
        'attempts': 123
    },
    timeout={
        'attemptDurationSeconds': 123
    }
)
type jobDefinitionName

string

param jobDefinitionName

[REQUIRED]

The name of the job definition to register. Up to 128 letters (uppercase and lowercase), numbers, hyphens, and underscores are allowed.

type type

string

param type

[REQUIRED]

The type of job definition.

type parameters

dict

param parameters

Default parameter substitution placeholders to set in the job definition. Parameters are specified as a key-value pair mapping. Parameters in a SubmitJob request override any corresponding parameter defaults from the job definition.

  • (string) --

    • (string) --

type containerProperties

dict

param containerProperties

An object with various properties specific to single-node container-based jobs. If the job definition's type parameter is container , then you must specify either containerProperties or nodeProperties .

  • image (string) --

    The image used to start a container. This string is passed directly to the Docker daemon. Images in the Docker Hub registry are available by default. Other repositories are specified with `` repository-url /image :tag `` . Up to 255 letters (uppercase and lowercase), numbers, hyphens, underscores, colons, periods, forward slashes, and number signs are allowed. This parameter maps to Image in the Create a container section of the Docker Remote API and the IMAGE parameter of docker run .

    • Images in Amazon ECR repositories use the full registry and repository URI (for example, 012345678910.dkr.ecr.<region-name>.amazonaws.com/<repository-name> ).

    • Images in official repositories on Docker Hub use a single name (for example, ubuntu or mongo ).

    • Images in other repositories on Docker Hub are qualified with an organization name (for example, amazon/amazon-ecs-agent ).

    • Images in other online repositories are qualified further by a domain name (for example, quay.io/assemblyline/ubuntu ).

  • vcpus (integer) --

    The number of vCPUs reserved for the container. This parameter maps to CpuShares in the Create a container section of the Docker Remote API and the --cpu-shares option to docker run . Each vCPU is equivalent to 1,024 CPU shares. You must specify at least one vCPU.

  • memory (integer) --

    The hard limit (in MiB) of memory to present to the container. If your container attempts to exceed the memory specified here, the container is killed. This parameter maps to Memory in the Create a container section of the Docker Remote API and the --memory option to docker run . You must specify at least 4 MiB of memory for a job.

    Note

    If you are trying to maximize your resource utilization by providing your jobs as much memory as possible for a particular instance type, see Memory Management in the AWS Batch User Guide .

  • command (list) --

    The command that is passed to the container. This parameter maps to Cmd in the Create a container section of the Docker Remote API and the COMMAND parameter to docker run . For more information, see https://docs.docker.com/engine/reference/builder/#cmd .

    • (string) --

  • jobRoleArn (string) --

    The Amazon Resource Name (ARN) of the IAM role that the container can assume for AWS permissions.

  • volumes (list) --

    A list of data volumes used in a job.

    • (dict) --

      A data volume used in a job's container properties.

      • host (dict) --

        The contents of the host parameter determine whether your data volume persists on the host container instance and where it is stored. If the host parameter is empty, then the Docker daemon assigns a host path for your data volume. However, the data is not guaranteed to persist after the containers associated with it stop running.

        • sourcePath (string) --

          The path on the host container instance that is presented to the container. If this parameter is empty, then the Docker daemon has assigned a host path for you. If the host parameter contains a sourcePath file location, then the data volume persists at the specified location on the host container instance until you delete it manually. If the sourcePath value does not exist on the host container instance, the Docker daemon creates it. If the location does exist, the contents of the source path folder are exported.

      • name (string) --

        The name of the volume. Up to 255 letters (uppercase and lowercase), numbers, hyphens, and underscores are allowed. This name is referenced in the sourceVolume parameter of container definition mountPoints .

  • environment (list) --

    The environment variables to pass to a container. This parameter maps to Env in the Create a container section of the Docker Remote API and the --env option to docker run .

    Warning

    We do not recommend using plaintext environment variables for sensitive information, such as credential data.

    Note

    Environment variables must not start with AWS_BATCH ; this naming convention is reserved for variables that are set by the AWS Batch service.

    • (dict) --

      A key-value pair object.

      • name (string) --

        The name of the key-value pair. For environment variables, this is the name of the environment variable.

      • value (string) --

        The value of the key-value pair. For environment variables, this is the value of the environment variable.

  • mountPoints (list) --

    The mount points for data volumes in your container. This parameter maps to Volumes in the Create a container section of the Docker Remote API and the --volume option to docker run .

    • (dict) --

      Details on a Docker volume mount point that is used in a job's container properties.

      • containerPath (string) --

        The path on the container at which to mount the host volume.

      • readOnly (boolean) --

        If this value is true , the container has read-only access to the volume; otherwise, the container can write to the volume. The default value is false .

      • sourceVolume (string) --

        The name of the volume to mount.

  • readonlyRootFilesystem (boolean) --

    When this parameter is true, the container is given read-only access to its root file system. This parameter maps to ReadonlyRootfs in the Create a container section of the Docker Remote API and the --read-only option to docker run .

  • privileged (boolean) --

    When this parameter is true, the container is given elevated privileges on the host container instance (similar to the root user). This parameter maps to Privileged in the Create a container section of the Docker Remote API and the --privileged option to docker run .

  • ulimits (list) --

    A list of ulimits to set in the container. This parameter maps to Ulimits in the Create a container section of the Docker Remote API and the --ulimit option to docker run .

    • (dict) --

      The ulimit settings to pass to the container.

      • hardLimit (integer) -- [REQUIRED]

        The hard limit for the ulimit type.

      • name (string) -- [REQUIRED]

        The type of the ulimit .

      • softLimit (integer) -- [REQUIRED]

        The soft limit for the ulimit type.

  • user (string) --

    The user name to use inside the container. This parameter maps to User in the Create a container section of the Docker Remote API and the --user option to docker run .

  • instanceType (string) --

    The instance type to use for a multi-node parallel job. Currently all node groups in a multi-node parallel job must use the same instance type. This parameter is not valid for single-node container jobs.

  • resourceRequirements (list) --

    The type and amount of a resource to assign to a container. Currently, the only supported resource is GPU .

    • (dict) --

      The type and amount of a resource to assign to a container. Currently, the only supported resource type is GPU .

      • value (string) -- [REQUIRED]

        The number of physical GPUs to reserve for the container. The number of GPUs reserved for all containers in a job should not exceed the number of available GPUs on the compute resource that the job is launched on.

      • type (string) -- [REQUIRED]

        The type of resource to assign to a container. Currently, the only supported resource type is GPU .

type nodeProperties

dict

param nodeProperties

An object with various properties specific to multi-node parallel jobs. If you specify node properties for a job, it becomes a multi-node parallel job. For more information, see Multi-node Parallel Jobs in the AWS Batch User Guide . If the job definition's type parameter is container , then you must specify either containerProperties or nodeProperties .

  • numNodes (integer) -- [REQUIRED]

    The number of nodes associated with a multi-node parallel job.

  • mainNode (integer) -- [REQUIRED]

    Specifies the node index for the main node of a multi-node parallel job. This node index value must be fewer than the number of nodes.

  • nodeRangeProperties (list) -- [REQUIRED]

    A list of node ranges and their properties associated with a multi-node parallel job.

    • (dict) --

      An object representing the properties of the node range for a multi-node parallel job.

      • targetNodes (string) -- [REQUIRED]

        The range of nodes, using node index values. A range of 0:3 indicates nodes with index values of 0 through 3 . If the starting range value is omitted (:n ), then 0 is used to start the range. If the ending range value is omitted (n: ), then the highest possible node index is used to end the range. Your accumulative node ranges must account for all nodes (0:n). You may nest node ranges, for example 0:10 and 4:5, in which case the 4:5 range properties override the 0:10 properties.

      • container (dict) --

        The container details for the node range.

        • image (string) --

          The image used to start a container. This string is passed directly to the Docker daemon. Images in the Docker Hub registry are available by default. Other repositories are specified with `` repository-url /image :tag `` . Up to 255 letters (uppercase and lowercase), numbers, hyphens, underscores, colons, periods, forward slashes, and number signs are allowed. This parameter maps to Image in the Create a container section of the Docker Remote API and the IMAGE parameter of docker run .

          • Images in Amazon ECR repositories use the full registry and repository URI (for example, 012345678910.dkr.ecr.<region-name>.amazonaws.com/<repository-name> ).

          • Images in official repositories on Docker Hub use a single name (for example, ubuntu or mongo ).

          • Images in other repositories on Docker Hub are qualified with an organization name (for example, amazon/amazon-ecs-agent ).

          • Images in other online repositories are qualified further by a domain name (for example, quay.io/assemblyline/ubuntu ).

        • vcpus (integer) --

          The number of vCPUs reserved for the container. This parameter maps to CpuShares in the Create a container section of the Docker Remote API and the --cpu-shares option to docker run . Each vCPU is equivalent to 1,024 CPU shares. You must specify at least one vCPU.

        • memory (integer) --

          The hard limit (in MiB) of memory to present to the container. If your container attempts to exceed the memory specified here, the container is killed. This parameter maps to Memory in the Create a container section of the Docker Remote API and the --memory option to docker run . You must specify at least 4 MiB of memory for a job.

          Note

          If you are trying to maximize your resource utilization by providing your jobs as much memory as possible for a particular instance type, see Memory Management in the AWS Batch User Guide .

        • command (list) --

          The command that is passed to the container. This parameter maps to Cmd in the Create a container section of the Docker Remote API and the COMMAND parameter to docker run . For more information, see https://docs.docker.com/engine/reference/builder/#cmd .

          • (string) --

        • jobRoleArn (string) --

          The Amazon Resource Name (ARN) of the IAM role that the container can assume for AWS permissions.

        • volumes (list) --

          A list of data volumes used in a job.

          • (dict) --

            A data volume used in a job's container properties.

            • host (dict) --

              The contents of the host parameter determine whether your data volume persists on the host container instance and where it is stored. If the host parameter is empty, then the Docker daemon assigns a host path for your data volume. However, the data is not guaranteed to persist after the containers associated with it stop running.

              • sourcePath (string) --

                The path on the host container instance that is presented to the container. If this parameter is empty, then the Docker daemon has assigned a host path for you. If the host parameter contains a sourcePath file location, then the data volume persists at the specified location on the host container instance until you delete it manually. If the sourcePath value does not exist on the host container instance, the Docker daemon creates it. If the location does exist, the contents of the source path folder are exported.

            • name (string) --

              The name of the volume. Up to 255 letters (uppercase and lowercase), numbers, hyphens, and underscores are allowed. This name is referenced in the sourceVolume parameter of container definition mountPoints .

        • environment (list) --

          The environment variables to pass to a container. This parameter maps to Env in the Create a container section of the Docker Remote API and the --env option to docker run .

          Warning

          We do not recommend using plaintext environment variables for sensitive information, such as credential data.

          Note

          Environment variables must not start with AWS_BATCH ; this naming convention is reserved for variables that are set by the AWS Batch service.

          • (dict) --

            A key-value pair object.

            • name (string) --

              The name of the key-value pair. For environment variables, this is the name of the environment variable.

            • value (string) --

              The value of the key-value pair. For environment variables, this is the value of the environment variable.

        • mountPoints (list) --

          The mount points for data volumes in your container. This parameter maps to Volumes in the Create a container section of the Docker Remote API and the --volume option to docker run .

          • (dict) --

            Details on a Docker volume mount point that is used in a job's container properties.

            • containerPath (string) --

              The path on the container at which to mount the host volume.

            • readOnly (boolean) --

              If this value is true , the container has read-only access to the volume; otherwise, the container can write to the volume. The default value is false .

            • sourceVolume (string) --

              The name of the volume to mount.

        • readonlyRootFilesystem (boolean) --

          When this parameter is true, the container is given read-only access to its root file system. This parameter maps to ReadonlyRootfs in the Create a container section of the Docker Remote API and the --read-only option to docker run .

        • privileged (boolean) --

          When this parameter is true, the container is given elevated privileges on the host container instance (similar to the root user). This parameter maps to Privileged in the Create a container section of the Docker Remote API and the --privileged option to docker run .

        • ulimits (list) --

          A list of ulimits to set in the container. This parameter maps to Ulimits in the Create a container section of the Docker Remote API and the --ulimit option to docker run .

          • (dict) --

            The ulimit settings to pass to the container.

            • hardLimit (integer) -- [REQUIRED]

              The hard limit for the ulimit type.

            • name (string) -- [REQUIRED]

              The type of the ulimit .

            • softLimit (integer) -- [REQUIRED]

              The soft limit for the ulimit type.

        • user (string) --

          The user name to use inside the container. This parameter maps to User in the Create a container section of the Docker Remote API and the --user option to docker run .

        • instanceType (string) --

          The instance type to use for a multi-node parallel job. Currently all node groups in a multi-node parallel job must use the same instance type. This parameter is not valid for single-node container jobs.

        • resourceRequirements (list) --

          The type and amount of a resource to assign to a container. Currently, the only supported resource is GPU .

          • (dict) --

            The type and amount of a resource to assign to a container. Currently, the only supported resource type is GPU .

            • value (string) -- [REQUIRED]

              The number of physical GPUs to reserve for the container. The number of GPUs reserved for all containers in a job should not exceed the number of available GPUs on the compute resource that the job is launched on.

            • type (string) -- [REQUIRED]

              The type of resource to assign to a container. Currently, the only supported resource type is GPU .

type retryStrategy

dict

param retryStrategy

The retry strategy to use for failed jobs that are submitted with this job definition. Any retry strategy that is specified during a SubmitJob operation overrides the retry strategy defined here. If a job is terminated due to a timeout, it is not retried.

  • attempts (integer) --

    The number of times to move a job to the RUNNABLE status. You may specify between 1 and 10 attempts. If the value of attempts is greater than one, the job is retried on failure the same number of attempts as the value.

type timeout

dict

param timeout

The timeout configuration for jobs that are submitted with this job definition, after which AWS Batch terminates your jobs if they have not finished. If a job is terminated due to a timeout, it is not retried. The minimum value for the timeout is 60 seconds. Any timeout configuration that is specified during a SubmitJob operation overrides the timeout configuration defined here. For more information, see Job Timeouts in the Amazon Elastic Container Service Developer Guide .

  • attemptDurationSeconds (integer) --

    The time duration in seconds (measured from the job attempt's startedAt timestamp) after which AWS Batch terminates your jobs if they have not finished.

rtype

dict

returns

Response Syntax

{
    'jobDefinitionName': 'string',
    'jobDefinitionArn': 'string',
    'revision': 123
}

Response Structure

  • (dict) --

    • jobDefinitionName (string) --

      The name of the job definition.

    • jobDefinitionArn (string) --

      The Amazon Resource Name (ARN) of the job definition.

    • revision (integer) --

      The revision of the job definition.

SubmitJob (updated) Link ¶
Changes (request)
{'containerOverrides': {'resourceRequirements': [{'type': 'GPU',
                                                  'value': 'string'}]},
 'nodeOverrides': {'nodePropertyOverrides': {'containerOverrides': {'resourceRequirements': [{'type': 'GPU',
                                                                                              'value': 'string'}]}},
                   'numNodes': 'integer'}}

Submits an AWS Batch job from a job definition. Parameters specified during SubmitJob override parameters defined in the job definition.

See also: AWS API Documentation

Request Syntax

client.submit_job(
    jobName='string',
    jobQueue='string',
    arrayProperties={
        'size': 123
    },
    dependsOn=[
        {
            'jobId': 'string',
            'type': 'N_TO_N'|'SEQUENTIAL'
        },
    ],
    jobDefinition='string',
    parameters={
        'string': 'string'
    },
    containerOverrides={
        'vcpus': 123,
        'memory': 123,
        'command': [
            'string',
        ],
        'instanceType': 'string',
        'environment': [
            {
                'name': 'string',
                'value': 'string'
            },
        ],
        'resourceRequirements': [
            {
                'value': 'string',
                'type': 'GPU'
            },
        ]
    },
    nodeOverrides={
        'numNodes': 123,
        'nodePropertyOverrides': [
            {
                'targetNodes': 'string',
                'containerOverrides': {
                    'vcpus': 123,
                    'memory': 123,
                    'command': [
                        'string',
                    ],
                    'instanceType': 'string',
                    'environment': [
                        {
                            'name': 'string',
                            'value': 'string'
                        },
                    ],
                    'resourceRequirements': [
                        {
                            'value': 'string',
                            'type': 'GPU'
                        },
                    ]
                }
            },
        ]
    },
    retryStrategy={
        'attempts': 123
    },
    timeout={
        'attemptDurationSeconds': 123
    }
)
type jobName

string

param jobName

[REQUIRED]

The name of the job. The first character must be alphanumeric, and up to 128 letters (uppercase and lowercase), numbers, hyphens, and underscores are allowed.

type jobQueue

string

param jobQueue

[REQUIRED]

The job queue into which the job is submitted. You can specify either the name or the Amazon Resource Name (ARN) of the queue.

type arrayProperties

dict

param arrayProperties

The array properties for the submitted job, such as the size of the array. The array size can be between 2 and 10,000. If you specify array properties for a job, it becomes an array job. For more information, see Array Jobs in the AWS Batch User Guide .

  • size (integer) --

    The size of the array job.

type dependsOn

list

param dependsOn

A list of dependencies for the job. A job can depend upon a maximum of 20 jobs. You can specify a SEQUENTIAL type dependency without specifying a job ID for array jobs so that each child array job completes sequentially, starting at index 0. You can also specify an N_TO_N type dependency with a job ID for array jobs. In that case, each index child of this job must wait for the corresponding index child of each dependency to complete before it can begin.

  • (dict) --

    An object representing an AWS Batch job dependency.

    • jobId (string) --

      The job ID of the AWS Batch job associated with this dependency.

    • type (string) --

      The type of the job dependency.

type jobDefinition

string

param jobDefinition

[REQUIRED]

The job definition used by this job. This value can be either a name:revision or the Amazon Resource Name (ARN) for the job definition.

type parameters

dict

param parameters

Additional parameters passed to the job that replace parameter substitution placeholders that are set in the job definition. Parameters are specified as a key and value pair mapping. Parameters in a SubmitJob request override any corresponding parameter defaults from the job definition.

  • (string) --

    • (string) --

type containerOverrides

dict

param containerOverrides

A list of container overrides in JSON format that specify the name of a container in the specified job definition and the overrides it should receive. You can override the default command for a container (that is specified in the job definition or the Docker image) with a command override. You can also override existing environment variables (that are specified in the job definition or Docker image) on a container or add new environment variables to it with an environment override.

  • vcpus (integer) --

    The number of vCPUs to reserve for the container. This value overrides the value set in the job definition.

  • memory (integer) --

    The number of MiB of memory reserved for the job. This value overrides the value set in the job definition.

  • command (list) --

    The command to send to the container that overrides the default command from the Docker image or the job definition.

    • (string) --

  • instanceType (string) --

    The instance type to use for a multi-node parallel job. This parameter is not valid for single-node container jobs.

  • environment (list) --

    The environment variables to send to the container. You can add new environment variables, which are added to the container at launch, or you can override the existing environment variables from the Docker image or the job definition.

    Note

    Environment variables must not start with AWS_BATCH ; this naming convention is reserved for variables that are set by the AWS Batch service.

    • (dict) --

      A key-value pair object.

      • name (string) --

        The name of the key-value pair. For environment variables, this is the name of the environment variable.

      • value (string) --

        The value of the key-value pair. For environment variables, this is the value of the environment variable.

  • resourceRequirements (list) --

    The type and amount of a resource to assign to a container. This value overrides the value set in the job definition. Currently, the only supported resource is GPU .

    • (dict) --

      The type and amount of a resource to assign to a container. Currently, the only supported resource type is GPU .

      • value (string) -- [REQUIRED]

        The number of physical GPUs to reserve for the container. The number of GPUs reserved for all containers in a job should not exceed the number of available GPUs on the compute resource that the job is launched on.

      • type (string) -- [REQUIRED]

        The type of resource to assign to a container. Currently, the only supported resource type is GPU .

type nodeOverrides

dict

param nodeOverrides

A list of node overrides in JSON format that specify the node range to target and the container overrides for that node range.

  • numNodes (integer) --

    The number of nodes to use with a multi-node parallel job. This value overrides the number of nodes that are specified in the job definition. To use this override:

    • There must be at least one node range in your job definition that has an open upper boundary (such as : or n: ).

    • The lower boundary of the node range specified in the job definition must be fewer than the number of nodes specified in the override.

    • The main node index specified in the job definition must be fewer than the number of nodes specified in the override.

  • nodePropertyOverrides (list) --

    The node property overrides for the job.

    • (dict) --

      Object representing any node overrides to a job definition that is used in a SubmitJob API operation.

      • targetNodes (string) -- [REQUIRED]

        The range of nodes, using node index values, with which to override. A range of 0:3 indicates nodes with index values of 0 through 3 . If the starting range value is omitted (:n ), then 0 is used to start the range. If the ending range value is omitted (n: ), then the highest possible node index is used to end the range.

      • containerOverrides (dict) --

        The overrides that should be sent to a node range.

        • vcpus (integer) --

          The number of vCPUs to reserve for the container. This value overrides the value set in the job definition.

        • memory (integer) --

          The number of MiB of memory reserved for the job. This value overrides the value set in the job definition.

        • command (list) --

          The command to send to the container that overrides the default command from the Docker image or the job definition.

          • (string) --

        • instanceType (string) --

          The instance type to use for a multi-node parallel job. This parameter is not valid for single-node container jobs.

        • environment (list) --

          The environment variables to send to the container. You can add new environment variables, which are added to the container at launch, or you can override the existing environment variables from the Docker image or the job definition.

          Note

          Environment variables must not start with AWS_BATCH ; this naming convention is reserved for variables that are set by the AWS Batch service.

          • (dict) --

            A key-value pair object.

            • name (string) --

              The name of the key-value pair. For environment variables, this is the name of the environment variable.

            • value (string) --

              The value of the key-value pair. For environment variables, this is the value of the environment variable.

        • resourceRequirements (list) --

          The type and amount of a resource to assign to a container. This value overrides the value set in the job definition. Currently, the only supported resource is GPU .

          • (dict) --

            The type and amount of a resource to assign to a container. Currently, the only supported resource type is GPU .

            • value (string) -- [REQUIRED]

              The number of physical GPUs to reserve for the container. The number of GPUs reserved for all containers in a job should not exceed the number of available GPUs on the compute resource that the job is launched on.

            • type (string) -- [REQUIRED]

              The type of resource to assign to a container. Currently, the only supported resource type is GPU .

type retryStrategy

dict

param retryStrategy

The retry strategy to use for failed jobs from this SubmitJob operation. When a retry strategy is specified here, it overrides the retry strategy defined in the job definition.

  • attempts (integer) --

    The number of times to move a job to the RUNNABLE status. You may specify between 1 and 10 attempts. If the value of attempts is greater than one, the job is retried on failure the same number of attempts as the value.

type timeout

dict

param timeout

The timeout configuration for this SubmitJob operation. You can specify a timeout duration after which AWS Batch terminates your jobs if they have not finished. If a job is terminated due to a timeout, it is not retried. The minimum value for the timeout is 60 seconds. This configuration overrides any timeout configuration specified in the job definition. For array jobs, child jobs have the same timeout configuration as the parent job. For more information, see Job Timeouts in the Amazon Elastic Container Service Developer Guide .

  • attemptDurationSeconds (integer) --

    The time duration in seconds (measured from the job attempt's startedAt timestamp) after which AWS Batch terminates your jobs if they have not finished.

rtype

dict

returns

Response Syntax

{
    'jobName': 'string',
    'jobId': 'string'
}

Response Structure

  • (dict) --

    • jobName (string) --

      The name of the job.

    • jobId (string) --

      The unique identifier for the job.