AWS Database Migration Service

2019/07/17 - AWS Database Migration Service - 6 updated api methods

Changes  S3 endpoint settings update: 1) Option to append operation column to full-load files. 2) Option to add a commit timestamp column to full-load and cdc files. Updated DescribeAccountAttributes to include UniqueAccountIdentifier.

CreateEndpoint (updated) Link ¶
Changes (request, response)
Request
{'S3Settings': {'IncludeOpForFullLoad': 'boolean',
                'TimestampColumnName': 'string'}}
Response
{'Endpoint': {'S3Settings': {'IncludeOpForFullLoad': 'boolean',
                             'TimestampColumnName': 'string'}}}

Creates an endpoint using the provided settings.

See also: AWS API Documentation

Request Syntax

client.create_endpoint(
    EndpointIdentifier='string',
    EndpointType='source'|'target',
    EngineName='string',
    Username='string',
    Password='string',
    ServerName='string',
    Port=123,
    DatabaseName='string',
    ExtraConnectionAttributes='string',
    KmsKeyId='string',
    Tags=[
        {
            'Key': 'string',
            'Value': 'string'
        },
    ],
    CertificateArn='string',
    SslMode='none'|'require'|'verify-ca'|'verify-full',
    ServiceAccessRoleArn='string',
    ExternalTableDefinition='string',
    DynamoDbSettings={
        'ServiceAccessRoleArn': 'string'
    },
    S3Settings={
        'ServiceAccessRoleArn': 'string',
        'ExternalTableDefinition': 'string',
        'CsvRowDelimiter': 'string',
        'CsvDelimiter': 'string',
        'BucketFolder': 'string',
        'BucketName': 'string',
        'CompressionType': 'none'|'gzip',
        'EncryptionMode': 'sse-s3'|'sse-kms',
        'ServerSideEncryptionKmsKeyId': 'string',
        'DataFormat': 'csv'|'parquet',
        'EncodingType': 'plain'|'plain-dictionary'|'rle-dictionary',
        'DictPageSizeLimit': 123,
        'RowGroupLength': 123,
        'DataPageSize': 123,
        'ParquetVersion': 'parquet-1-0'|'parquet-2-0',
        'EnableStatistics': True|False,
        'IncludeOpForFullLoad': True|False,
        'CdcInsertsOnly': True|False,
        'TimestampColumnName': 'string'
    },
    DmsTransferSettings={
        'ServiceAccessRoleArn': 'string',
        'BucketName': 'string'
    },
    MongoDbSettings={
        'Username': 'string',
        'Password': 'string',
        'ServerName': 'string',
        'Port': 123,
        'DatabaseName': 'string',
        'AuthType': 'no'|'password',
        'AuthMechanism': 'default'|'mongodb_cr'|'scram_sha_1',
        'NestingLevel': 'none'|'one',
        'ExtractDocId': 'string',
        'DocsToInvestigate': 'string',
        'AuthSource': 'string',
        'KmsKeyId': 'string'
    },
    KinesisSettings={
        'StreamArn': 'string',
        'MessageFormat': 'json',
        'ServiceAccessRoleArn': 'string'
    },
    ElasticsearchSettings={
        'ServiceAccessRoleArn': 'string',
        'EndpointUri': 'string',
        'FullLoadErrorPercentage': 123,
        'ErrorRetryDuration': 123
    },
    RedshiftSettings={
        'AcceptAnyDate': True|False,
        'AfterConnectScript': 'string',
        'BucketFolder': 'string',
        'BucketName': 'string',
        'ConnectionTimeout': 123,
        'DatabaseName': 'string',
        'DateFormat': 'string',
        'EmptyAsNull': True|False,
        'EncryptionMode': 'sse-s3'|'sse-kms',
        'FileTransferUploadStreams': 123,
        'LoadTimeout': 123,
        'MaxFileSize': 123,
        'Password': 'string',
        'Port': 123,
        'RemoveQuotes': True|False,
        'ReplaceInvalidChars': 'string',
        'ReplaceChars': 'string',
        'ServerName': 'string',
        'ServiceAccessRoleArn': 'string',
        'ServerSideEncryptionKmsKeyId': 'string',
        'TimeFormat': 'string',
        'TrimBlanks': True|False,
        'TruncateColumns': True|False,
        'Username': 'string',
        'WriteBufferSize': 123
    }
)
type EndpointIdentifier

string

param EndpointIdentifier

[REQUIRED]

The database endpoint identifier. Identifiers must begin with a letter; must contain only ASCII letters, digits, and hyphens; and must not end with a hyphen or contain two consecutive hyphens.

type EndpointType

string

param EndpointType

[REQUIRED]

The type of endpoint. Valid values are source and target .

type EngineName

string

param EngineName

[REQUIRED]

The type of engine for the endpoint. Valid values, depending on the EndpointType value, include mysql , oracle , postgres , mariadb , aurora , aurora-postgresql , redshift , s3 , db2 , azuredb , sybase , dynamodb , mongodb , and sqlserver .

type Username

string

param Username

The user name to be used to log in to the endpoint database.

type Password

string

param Password

The password to be used to log in to the endpoint database.

type ServerName

string

param ServerName

The name of the server where the endpoint database resides.

type Port

integer

param Port

The port used by the endpoint database.

type DatabaseName

string

param DatabaseName

The name of the endpoint database.

type ExtraConnectionAttributes

string

param ExtraConnectionAttributes

Additional attributes associated with the connection. Each attribute is specified as a name-value pair associated by an equal sign (=). Multiple attributes are separated by a semicolon (;) with no additional white space. For information on the attributes available for connecting your source or target endpoint, see Working with AWS DMS Endpoints in the AWS Database Migration Service User Guide.

type KmsKeyId

string

param KmsKeyId

An AWS KMS key identifier that is used to encrypt the connection parameters for the endpoint.

If you don't specify a value for the KmsKeyId parameter, then AWS DMS uses your default encryption key.

AWS KMS creates the default encryption key for your AWS account. Your AWS account has a different default encryption key for each AWS Region.

type Tags

list

param Tags

One or more tags to be assigned to the endpoint.

  • (dict) --

    • Key (string) --

      A key is the required name of the tag. The string value can be from 1 to 128 Unicode characters in length and cannot be prefixed with "aws:" or "dms:". The string can only contain only the set of Unicode letters, digits, white-space, '_', '.', '/', '=', '+', '-' (Java regex: "^([\p{L}\p{Z}\p{N}_.:/=+\-]*)$").

    • Value (string) --

      A value is the optional value of the tag. The string value can be from 1 to 256 Unicode characters in length and cannot be prefixed with "aws:" or "dms:". The string can only contain only the set of Unicode letters, digits, white-space, '_', '.', '/', '=', '+', '-' (Java regex: "^([\p{L}\p{Z}\p{N}_.:/=+\-]*)$").

type CertificateArn

string

param CertificateArn

The Amazon Resource Name (ARN) for the certificate.

type SslMode

string

param SslMode

The Secure Sockets Layer (SSL) mode to use for the SSL connection. The default is none

type ServiceAccessRoleArn

string

param ServiceAccessRoleArn

The Amazon Resource Name (ARN) for the service access role that you want to use to create the endpoint.

type ExternalTableDefinition

string

param ExternalTableDefinition

The external table definition.

type DynamoDbSettings

dict

param DynamoDbSettings

Settings in JSON format for the target Amazon DynamoDB endpoint. For more information about the available settings, see Using Object Mapping to Migrate Data to DynamoDB in the AWS Database Migration Service User Guide.

  • ServiceAccessRoleArn (string) -- [REQUIRED]

    The Amazon Resource Name (ARN) used by the service access IAM role.

type S3Settings

dict

param S3Settings

Settings in JSON format for the target Amazon S3 endpoint. For more information about the available settings, see Extra Connection Attributes When Using Amazon S3 as a Target for AWS DMS in the AWS Database Migration Service User Guide.

  • ServiceAccessRoleArn (string) --

    The Amazon Resource Name (ARN) used by the service access IAM role.

  • ExternalTableDefinition (string) --

    The external table definition.

  • CsvRowDelimiter (string) --

    The delimiter used to separate rows in the source files. The default is a carriage return (\n ).

  • CsvDelimiter (string) --

    The delimiter used to separate columns in the source files. The default is a comma.

  • BucketFolder (string) --

    An optional parameter to set a folder name in the S3 bucket. If provided, tables are created in the path `` bucketFolder /schema_name /table_name /`` . If this parameter is not specified, then the path used is `` schema_name /table_name /`` .

  • BucketName (string) --

    The name of the S3 bucket.

  • CompressionType (string) --

    An optional parameter to use GZIP to compress the target files. Set to GZIP to compress the target files. Set to NONE (the default) or do not use to leave the files uncompressed. Applies to both .csv and .parquet file formats.

  • EncryptionMode (string) --

    The type of server-side encryption that you want to use for your data. This encryption type is part of the endpoint settings or the extra connections attributes for Amazon S3. You can choose either SSE_S3 (the default) or SSE_KMS . To use SSE_S3 , you need an AWS Identity and Access Management (IAM) role with permission to allow "arn:aws:s3:::dms-*" to use the following actions:

    • s3:CreateBucket

    • s3:ListBucket

    • s3:DeleteBucket

    • s3:GetBucketLocation

    • s3:GetObject

    • s3:PutObject

    • s3:DeleteObject

    • s3:GetObjectVersion

    • s3:GetBucketPolicy

    • s3:PutBucketPolicy

    • s3:DeleteBucketPolicy

  • ServerSideEncryptionKmsKeyId (string) --

    If you are using SSE_KMS for the EncryptionMode , provide the AWS KMS key ID. The key that you use needs an attached policy that enables AWS Identity and Access Management (IAM) user permissions and allows use of the key.

    Here is a CLI example: ``aws dms create-endpoint --endpoint-identifier value --endpoint-type target --engine-name s3 --s3-settings ServiceAccessRoleArn=*value* ,BucketFolder=*value* ,BucketName=*value* ,EncryptionMode=SSE_KMS,ServerSideEncryptionKmsKeyId=*value* ``

  • DataFormat (string) --

    The format of the data that you want to use for output. You can choose one of the following:

    • csv : This is a row-based file format with comma-separated values (.csv).

    • parquet : Apache Parquet (.parquet) is a columnar storage file format that features efficient compression and provides faster query response.

  • EncodingType (string) --

    The type of encoding you are using:

    • RLE_DICTIONARY uses a combination of bit-packing and run-length encoding to store repeated values more efficiently. This is the default.

    • PLAIN doesn't use encoding at all. Values are stored as they are.

    • PLAIN_DICTIONARY builds a dictionary of the values encountered in a given column. The dictionary is stored in a dictionary page for each column chunk.

  • DictPageSizeLimit (integer) --

    The maximum size of an encoded dictionary page of a column. If the dictionary page exceeds this, this column is stored using an encoding type of PLAIN . This parameter defaults to 1024 * 1024 bytes (1 MiB), the maximum size of a dictionary page before it reverts to PLAIN encoding. This size is used for .parquet file format only.

  • RowGroupLength (integer) --

    The number of rows in a row group. A smaller row group size provides faster reads. But as the number of row groups grows, the slower writes become. This parameter defaults to 10,000 rows. This number is used for .parquet file format only.

    If you choose a value larger than the maximum, RowGroupLength is set to the max row group length in bytes (64 * 1024 * 1024).

  • DataPageSize (integer) --

    The size of one data page in bytes. This parameter defaults to 1024 * 1024 bytes (1 MiB). This number is used for .parquet file format only.

  • ParquetVersion (string) --

    The version of the Apache Parquet format that you want to use: parquet_1_0 (the default) or parquet_2_0 .

  • EnableStatistics (boolean) --

    A value that enables statistics for Parquet pages and row groups. Choose true to enable statistics, false to disable. Statistics include NULL , DISTINCT , MAX , and MIN values. This parameter defaults to true . This value is used for .parquet file format only.

  • IncludeOpForFullLoad (boolean) --

    A value that enables a full load to write INSERT operations to the comma-separated value (.csv) output files only to indicate how the rows were added to the source database.

    Note

    AWS DMS supports IncludeOpForFullLoad in versions 3.1.4 and later.

    For full load, records can only be inserted. By default (the false setting), no information is recorded in these output files for a full load to indicate that the rows were inserted at the source database. If IncludeOpForFullLoad is set to true or y , the INSERT is recorded as an I annotation in the first field of the .csv file. This allows the format of your target records from a full load to be consistent with the target records from a CDC load.

    Note

    This setting works together with CdcInsertsOnly for output to .csv files only. For more information about how these settings work together, see Indicating Source DB Operations in Migrated S3 Data in the AWS Database Migration Service User Guide. .

  • CdcInsertsOnly (boolean) --

    A value that enables a change data capture (CDC) load to write only INSERT operations to .csv or columnar storage (.parquet) output files. By default (the false setting), the first field in a .csv or .parquet record contains the letter I (INSERT), U (UPDATE), or D (DELETE). These values indicate whether the row was inserted, updated, or deleted at the source database for a CDC load to the target.

    If cdcInsertsOnly is set to true or y , only INSERTs from the source database are migrated to the .csv or .parquet file. For .csv format only, how these INSERTs are recorded depends on the value of IncludeOpForFullLoad . If IncludeOpForFullLoad is set to true , the first field of every CDC record is set to I to indicate the INSERT operation at the source. If IncludeOpForFullLoad is set to false , every CDC record is written without a first field to indicate the INSERT operation at the source. For more information about how these settings work together, see Indicating Source DB Operations in Migrated S3 Data in the AWS Database Migration Service User Guide. .

    Note

    AWS DMS supports this interaction between CdcInsertsOnly and IncludeOpForFullLoad in versions 3.1.4 and later.

  • TimestampColumnName (string) --

    A value that includes a timestamp column in the Amazon S3 target endpoint data. AWS DMS includes an additional column in the migrated data when you set timestampColumnName to a non-blank value.

    Note

    AWS DMS supports TimestampColumnName in versions 3.1.4 and later.

    For a full load, each row of the timestamp column contains a timestamp for when the data was transferred from the source to the target by DMS. For a CDC load, each row of the timestamp column contains the timestamp for the commit of that row in the source database. The format for the timestamp column value is yyyy-MM-dd HH:mm:ss.SSSSSS . For CDC, the microsecond precision depends on the commit timestamp supported by DMS for the source database. When the AddColumnName setting is set to true , DMS also includes the name for the timestamp column that you set as the nonblank value of timestampColumnName .

type DmsTransferSettings

dict

param DmsTransferSettings

The settings in JSON format for the DMS transfer type of source endpoint.

Possible attributes include the following:

  • serviceAccessRoleArn - The IAM role that has permission to access the Amazon S3 bucket.

  • bucketName - The name of the S3 bucket to use.

  • compressionType - An optional parameter to use GZIP to compress the target files. To use GZIP, set this value to NONE (the default). To keep the files uncompressed, don't use this value.

Shorthand syntax for these attributes is as follows: ServiceAccessRoleArn=string,BucketName=string,CompressionType=string

JSON syntax for these attributes is as follows: { "ServiceAccessRoleArn": "string", "BucketName": "string", "CompressionType": "none"|"gzip" }

  • ServiceAccessRoleArn (string) --

    The IAM role that has permission to access the Amazon S3 bucket.

  • BucketName (string) --

    The name of the S3 bucket to use.

type MongoDbSettings

dict

param MongoDbSettings

Settings in JSON format for the source MongoDB endpoint. For more information about the available settings, see the configuration properties section in Using MongoDB as a Target for AWS Database Migration Service in the AWS Database Migration Service User Guide.

  • Username (string) --

    The user name you use to access the MongoDB source endpoint.

  • Password (string) --

    The password for the user account you use to access the MongoDB source endpoint.

  • ServerName (string) --

    The name of the server on the MongoDB source endpoint.

  • Port (integer) --

    The port value for the MongoDB source endpoint.

  • DatabaseName (string) --

    The database name on the MongoDB source endpoint.

  • AuthType (string) --

    The authentication type you use to access the MongoDB source endpoint.

    Valid values: NO, PASSWORD

    When NO is selected, user name and password parameters are not used and can be empty.

  • AuthMechanism (string) --

    The authentication mechanism you use to access the MongoDB source endpoint.

    Valid values: DEFAULT, MONGODB_CR, SCRAM_SHA_1

    DEFAULT – For MongoDB version 2.x, use MONGODB_CR. For MongoDB version 3.x, use SCRAM_SHA_1. This attribute is not used when authType=No.

  • NestingLevel (string) --

    Specifies either document or table mode.

    Valid values: NONE, ONE

    Default value is NONE. Specify NONE to use document mode. Specify ONE to use table mode.

  • ExtractDocId (string) --

    Specifies the document ID. Use this attribute when NestingLevel is set to NONE.

    Default value is false.

  • DocsToInvestigate (string) --

    Indicates the number of documents to preview to determine the document organization. Use this attribute when NestingLevel is set to ONE.

    Must be a positive value greater than 0. Default value is 1000.

  • AuthSource (string) --

    The MongoDB database name. This attribute is not used when authType=NO .

    The default is admin.

  • KmsKeyId (string) --

    The AWS KMS key identifier that is used to encrypt the content on the replication instance. If you don't specify a value for the KmsKeyId parameter, then AWS DMS uses your default encryption key. AWS KMS creates the default encryption key for your AWS account. Your AWS account has a different default encryption key for each AWS Region.

type KinesisSettings

dict

param KinesisSettings

Settings in JSON format for the target Amazon Kinesis Data Streams endpoint. For more information about the available settings, see Using Object Mapping to Migrate Data to a Kinesis Data Stream in the AWS Database Migration User Guide.

  • StreamArn (string) --

    The Amazon Resource Name (ARN) for the Amazon Kinesis Data Streams endpoint.

  • MessageFormat (string) --

    The output format for the records created on the endpoint. The message format is JSON .

  • ServiceAccessRoleArn (string) --

    The Amazon Resource Name (ARN) for the IAM role that DMS uses to write to the Amazon Kinesis data stream.

type ElasticsearchSettings

dict

param ElasticsearchSettings

Settings in JSON format for the target Elasticsearch endpoint. For more information about the available settings, see Extra Connection Attributes When Using Elasticsearch as a Target for AWS DMS in the AWS Database Migration User Guide.

  • ServiceAccessRoleArn (string) -- [REQUIRED]

    The Amazon Resource Name (ARN) used by service to access the IAM role.

  • EndpointUri (string) -- [REQUIRED]

    The endpoint for the Elasticsearch cluster.

  • FullLoadErrorPercentage (integer) --

    The maximum percentage of records that can fail to be written before a full load operation stops.

  • ErrorRetryDuration (integer) --

    The maximum number of seconds that DMS retries failed API requests to the Elasticsearch cluster.

type RedshiftSettings

dict

param RedshiftSettings
  • AcceptAnyDate (boolean) --

    A value that indicates to allow any date format, including invalid formats such as 00/00/00 00:00:00, to be loaded without generating an error. You can choose true or false (the default).

    This parameter applies only to TIMESTAMP and DATE columns. Always use ACCEPTANYDATE with the DATEFORMAT parameter. If the date format for the data doesn't match the DATEFORMAT specification, Amazon Redshift inserts a NULL value into that field.

  • AfterConnectScript (string) --

    Code to run after connecting. This parameter should contain the code itself, not the name of a file containing the code.

  • BucketFolder (string) --

    The location where the comma-separated value (.csv) files are stored before being uploaded to the S3 bucket.

  • BucketName (string) --

    The name of the S3 bucket you want to use

  • ConnectionTimeout (integer) --

    A value that sets the amount of time to wait (in milliseconds) before timing out, beginning from when you initially establish a connection.

  • DatabaseName (string) --

    The name of the Amazon Redshift data warehouse (service) that you are working with.

  • DateFormat (string) --

    The date format that you are using. Valid values are auto (case-sensitive), your date format string enclosed in quotes, or NULL. If this parameter is left unset (NULL), it defaults to a format of 'YYYY-MM-DD'. Using auto recognizes most strings, even some that aren't supported when you use a date format string.

    If your date and time values use formats different from each other, set this to auto .

  • EmptyAsNull (boolean) --

    A value that specifies whether AWS DMS should migrate empty CHAR and VARCHAR fields as NULL. A value of true sets empty CHAR and VARCHAR fields to null. The default is false .

  • EncryptionMode (string) --

    The type of server-side encryption that you want to use for your data. This encryption type is part of the endpoint settings or the extra connections attributes for Amazon S3. You can choose either SSE_S3 (the default) or SSE_KMS . To use SSE_S3 , create an AWS Identity and Access Management (IAM) role with a policy that allows "arn:aws:s3:::*" to use the following actions: "s3:PutObject", "s3:ListBucket"

  • FileTransferUploadStreams (integer) --

    The number of threads used to upload a single file. This parameter accepts a value from 1 through 64. It defaults to 10.

  • LoadTimeout (integer) --

    The amount of time to wait (in milliseconds) before timing out, beginning from when you begin loading.

  • MaxFileSize (integer) --

    The maximum size (in KB) of any .csv file used to transfer data to Amazon Redshift. This accepts a value from 1 through 1,048,576. It defaults to 32,768 KB (32 MB).

  • Password (string) --

    The password for the user named in the username property.

  • Port (integer) --

    The port number for Amazon Redshift. The default value is 5439.

  • RemoveQuotes (boolean) --

    A value that specifies to remove surrounding quotation marks from strings in the incoming data. All characters within the quotation marks, including delimiters, are retained. Choose true to remove quotation marks. The default is false .

  • ReplaceInvalidChars (string) --

    A list of characters that you want to replace. Use with ReplaceChars .

  • ReplaceChars (string) --

    A value that specifies to replaces the invalid characters specified in ReplaceInvalidChars , substituting the specified characters instead. The default is "?" .

  • ServerName (string) --

    The name of the Amazon Redshift cluster you are using.

  • ServiceAccessRoleArn (string) --

    The Amazon Resource Name (ARN) of the IAM role that has access to the Amazon Redshift service.

  • ServerSideEncryptionKmsKeyId (string) --

    The AWS KMS key ID. If you are using SSE_KMS for the EncryptionMode , provide this key ID. The key that you use needs an attached policy that enables IAM user permissions and allows use of the key.

  • TimeFormat (string) --

    The time format that you want to use. Valid values are auto (case-sensitive), 'timeformat_string' , 'epochsecs' , or 'epochmillisecs' . It defaults to 10. Using auto recognizes most strings, even some that aren't supported when you use a time format string.

    If your date and time values use formats different from each other, set this parameter to auto .

  • TrimBlanks (boolean) --

    A value that specifies to remove the trailing white space characters from a VARCHAR string. This parameter applies only to columns with a VARCHAR data type. Choose true to remove unneeded white space. The default is false .

  • TruncateColumns (boolean) --

    A value that specifies to truncate data in columns to the appropriate number of characters, so that the data fits in the column. This parameter applies only to columns with a VARCHAR or CHAR data type, and rows with a size of 4 MB or less. Choose true to truncate data. The default is false .

  • Username (string) --

    An Amazon Redshift user name for a registered user.

  • WriteBufferSize (integer) --

    The size of the write buffer to use in rows. Valid values range from 1 through 2,048. The default is 1,024. Use this setting to tune performance.

rtype

dict

returns

Response Syntax

{
    'Endpoint': {
        'EndpointIdentifier': 'string',
        'EndpointType': 'source'|'target',
        'EngineName': 'string',
        'EngineDisplayName': 'string',
        'Username': 'string',
        'ServerName': 'string',
        'Port': 123,
        'DatabaseName': 'string',
        'ExtraConnectionAttributes': 'string',
        'Status': 'string',
        'KmsKeyId': 'string',
        'EndpointArn': 'string',
        'CertificateArn': 'string',
        'SslMode': 'none'|'require'|'verify-ca'|'verify-full',
        'ServiceAccessRoleArn': 'string',
        'ExternalTableDefinition': 'string',
        'ExternalId': 'string',
        'DynamoDbSettings': {
            'ServiceAccessRoleArn': 'string'
        },
        'S3Settings': {
            'ServiceAccessRoleArn': 'string',
            'ExternalTableDefinition': 'string',
            'CsvRowDelimiter': 'string',
            'CsvDelimiter': 'string',
            'BucketFolder': 'string',
            'BucketName': 'string',
            'CompressionType': 'none'|'gzip',
            'EncryptionMode': 'sse-s3'|'sse-kms',
            'ServerSideEncryptionKmsKeyId': 'string',
            'DataFormat': 'csv'|'parquet',
            'EncodingType': 'plain'|'plain-dictionary'|'rle-dictionary',
            'DictPageSizeLimit': 123,
            'RowGroupLength': 123,
            'DataPageSize': 123,
            'ParquetVersion': 'parquet-1-0'|'parquet-2-0',
            'EnableStatistics': True|False,
            'IncludeOpForFullLoad': True|False,
            'CdcInsertsOnly': True|False,
            'TimestampColumnName': 'string'
        },
        'DmsTransferSettings': {
            'ServiceAccessRoleArn': 'string',
            'BucketName': 'string'
        },
        'MongoDbSettings': {
            'Username': 'string',
            'Password': 'string',
            'ServerName': 'string',
            'Port': 123,
            'DatabaseName': 'string',
            'AuthType': 'no'|'password',
            'AuthMechanism': 'default'|'mongodb_cr'|'scram_sha_1',
            'NestingLevel': 'none'|'one',
            'ExtractDocId': 'string',
            'DocsToInvestigate': 'string',
            'AuthSource': 'string',
            'KmsKeyId': 'string'
        },
        'KinesisSettings': {
            'StreamArn': 'string',
            'MessageFormat': 'json',
            'ServiceAccessRoleArn': 'string'
        },
        'ElasticsearchSettings': {
            'ServiceAccessRoleArn': 'string',
            'EndpointUri': 'string',
            'FullLoadErrorPercentage': 123,
            'ErrorRetryDuration': 123
        },
        'RedshiftSettings': {
            'AcceptAnyDate': True|False,
            'AfterConnectScript': 'string',
            'BucketFolder': 'string',
            'BucketName': 'string',
            'ConnectionTimeout': 123,
            'DatabaseName': 'string',
            'DateFormat': 'string',
            'EmptyAsNull': True|False,
            'EncryptionMode': 'sse-s3'|'sse-kms',
            'FileTransferUploadStreams': 123,
            'LoadTimeout': 123,
            'MaxFileSize': 123,
            'Password': 'string',
            'Port': 123,
            'RemoveQuotes': True|False,
            'ReplaceInvalidChars': 'string',
            'ReplaceChars': 'string',
            'ServerName': 'string',
            'ServiceAccessRoleArn': 'string',
            'ServerSideEncryptionKmsKeyId': 'string',
            'TimeFormat': 'string',
            'TrimBlanks': True|False,
            'TruncateColumns': True|False,
            'Username': 'string',
            'WriteBufferSize': 123
        }
    }
}

Response Structure

  • (dict) --

    • Endpoint (dict) --

      The endpoint that was created.

      • EndpointIdentifier (string) --

        The database endpoint identifier. Identifiers must begin with a letter; must contain only ASCII letters, digits, and hyphens; and must not end with a hyphen or contain two consecutive hyphens.

      • EndpointType (string) --

        The type of endpoint. Valid values are source and target .

      • EngineName (string) --

        The database engine name. Valid values, depending on the EndpointType, include mysql, oracle, postgres, mariadb, aurora, aurora-postgresql, redshift, s3, db2, azuredb, sybase, dynamodb, mongodb, and sqlserver.

      • EngineDisplayName (string) --

        The expanded name for the engine name. For example, if the EngineName parameter is "aurora," this value would be "Amazon Aurora MySQL."

      • Username (string) --

        The user name used to connect to the endpoint.

      • ServerName (string) --

        The name of the server at the endpoint.

      • Port (integer) --

        The port value used to access the endpoint.

      • DatabaseName (string) --

        The name of the database at the endpoint.

      • ExtraConnectionAttributes (string) --

        Additional connection attributes used to connect to the endpoint.

      • Status (string) --

        The status of the endpoint.

      • KmsKeyId (string) --

        An AWS KMS key identifier that is used to encrypt the connection parameters for the endpoint.

        If you don't specify a value for the KmsKeyId parameter, then AWS DMS uses your default encryption key.

        AWS KMS creates the default encryption key for your AWS account. Your AWS account has a different default encryption key for each AWS Region.

      • EndpointArn (string) --

        The Amazon Resource Name (ARN) string that uniquely identifies the endpoint.

      • CertificateArn (string) --

        The Amazon Resource Name (ARN) used for SSL connection to the endpoint.

      • SslMode (string) --

        The SSL mode used to connect to the endpoint. The default value is none .

      • ServiceAccessRoleArn (string) --

        The Amazon Resource Name (ARN) used by the service access IAM role.

      • ExternalTableDefinition (string) --

        The external table definition.

      • ExternalId (string) --

        Value returned by a call to CreateEndpoint that can be used for cross-account validation. Use it on a subsequent call to CreateEndpoint to create the endpoint with a cross-account.

      • DynamoDbSettings (dict) --

        The settings for the target DynamoDB database. For more information, see the DynamoDBSettings structure.

        • ServiceAccessRoleArn (string) --

          The Amazon Resource Name (ARN) used by the service access IAM role.

      • S3Settings (dict) --

        The settings for the S3 target endpoint. For more information, see the S3Settings structure.

        • ServiceAccessRoleArn (string) --

          The Amazon Resource Name (ARN) used by the service access IAM role.

        • ExternalTableDefinition (string) --

          The external table definition.

        • CsvRowDelimiter (string) --

          The delimiter used to separate rows in the source files. The default is a carriage return (\n ).

        • CsvDelimiter (string) --

          The delimiter used to separate columns in the source files. The default is a comma.

        • BucketFolder (string) --

          An optional parameter to set a folder name in the S3 bucket. If provided, tables are created in the path `` bucketFolder /schema_name /table_name /`` . If this parameter is not specified, then the path used is `` schema_name /table_name /`` .

        • BucketName (string) --

          The name of the S3 bucket.

        • CompressionType (string) --

          An optional parameter to use GZIP to compress the target files. Set to GZIP to compress the target files. Set to NONE (the default) or do not use to leave the files uncompressed. Applies to both .csv and .parquet file formats.

        • EncryptionMode (string) --

          The type of server-side encryption that you want to use for your data. This encryption type is part of the endpoint settings or the extra connections attributes for Amazon S3. You can choose either SSE_S3 (the default) or SSE_KMS . To use SSE_S3 , you need an AWS Identity and Access Management (IAM) role with permission to allow "arn:aws:s3:::dms-*" to use the following actions:

          • s3:CreateBucket

          • s3:ListBucket

          • s3:DeleteBucket

          • s3:GetBucketLocation

          • s3:GetObject

          • s3:PutObject

          • s3:DeleteObject

          • s3:GetObjectVersion

          • s3:GetBucketPolicy

          • s3:PutBucketPolicy

          • s3:DeleteBucketPolicy

        • ServerSideEncryptionKmsKeyId (string) --

          If you are using SSE_KMS for the EncryptionMode , provide the AWS KMS key ID. The key that you use needs an attached policy that enables AWS Identity and Access Management (IAM) user permissions and allows use of the key.

          Here is a CLI example: ``aws dms create-endpoint --endpoint-identifier value --endpoint-type target --engine-name s3 --s3-settings ServiceAccessRoleArn=*value* ,BucketFolder=*value* ,BucketName=*value* ,EncryptionMode=SSE_KMS,ServerSideEncryptionKmsKeyId=*value* ``

        • DataFormat (string) --

          The format of the data that you want to use for output. You can choose one of the following:

          • csv : This is a row-based file format with comma-separated values (.csv).

          • parquet : Apache Parquet (.parquet) is a columnar storage file format that features efficient compression and provides faster query response.

        • EncodingType (string) --

          The type of encoding you are using:

          • RLE_DICTIONARY uses a combination of bit-packing and run-length encoding to store repeated values more efficiently. This is the default.

          • PLAIN doesn't use encoding at all. Values are stored as they are.

          • PLAIN_DICTIONARY builds a dictionary of the values encountered in a given column. The dictionary is stored in a dictionary page for each column chunk.

        • DictPageSizeLimit (integer) --

          The maximum size of an encoded dictionary page of a column. If the dictionary page exceeds this, this column is stored using an encoding type of PLAIN . This parameter defaults to 1024 * 1024 bytes (1 MiB), the maximum size of a dictionary page before it reverts to PLAIN encoding. This size is used for .parquet file format only.

        • RowGroupLength (integer) --

          The number of rows in a row group. A smaller row group size provides faster reads. But as the number of row groups grows, the slower writes become. This parameter defaults to 10,000 rows. This number is used for .parquet file format only.

          If you choose a value larger than the maximum, RowGroupLength is set to the max row group length in bytes (64 * 1024 * 1024).

        • DataPageSize (integer) --

          The size of one data page in bytes. This parameter defaults to 1024 * 1024 bytes (1 MiB). This number is used for .parquet file format only.

        • ParquetVersion (string) --

          The version of the Apache Parquet format that you want to use: parquet_1_0 (the default) or parquet_2_0 .

        • EnableStatistics (boolean) --

          A value that enables statistics for Parquet pages and row groups. Choose true to enable statistics, false to disable. Statistics include NULL , DISTINCT , MAX , and MIN values. This parameter defaults to true . This value is used for .parquet file format only.

        • IncludeOpForFullLoad (boolean) --

          A value that enables a full load to write INSERT operations to the comma-separated value (.csv) output files only to indicate how the rows were added to the source database.

          Note

          AWS DMS supports IncludeOpForFullLoad in versions 3.1.4 and later.

          For full load, records can only be inserted. By default (the false setting), no information is recorded in these output files for a full load to indicate that the rows were inserted at the source database. If IncludeOpForFullLoad is set to true or y , the INSERT is recorded as an I annotation in the first field of the .csv file. This allows the format of your target records from a full load to be consistent with the target records from a CDC load.

          Note

          This setting works together with CdcInsertsOnly for output to .csv files only. For more information about how these settings work together, see Indicating Source DB Operations in Migrated S3 Data in the AWS Database Migration Service User Guide. .

        • CdcInsertsOnly (boolean) --

          A value that enables a change data capture (CDC) load to write only INSERT operations to .csv or columnar storage (.parquet) output files. By default (the false setting), the first field in a .csv or .parquet record contains the letter I (INSERT), U (UPDATE), or D (DELETE). These values indicate whether the row was inserted, updated, or deleted at the source database for a CDC load to the target.

          If cdcInsertsOnly is set to true or y , only INSERTs from the source database are migrated to the .csv or .parquet file. For .csv format only, how these INSERTs are recorded depends on the value of IncludeOpForFullLoad . If IncludeOpForFullLoad is set to true , the first field of every CDC record is set to I to indicate the INSERT operation at the source. If IncludeOpForFullLoad is set to false , every CDC record is written without a first field to indicate the INSERT operation at the source. For more information about how these settings work together, see Indicating Source DB Operations in Migrated S3 Data in the AWS Database Migration Service User Guide. .

          Note

          AWS DMS supports this interaction between CdcInsertsOnly and IncludeOpForFullLoad in versions 3.1.4 and later.

        • TimestampColumnName (string) --

          A value that includes a timestamp column in the Amazon S3 target endpoint data. AWS DMS includes an additional column in the migrated data when you set timestampColumnName to a non-blank value.

          Note

          AWS DMS supports TimestampColumnName in versions 3.1.4 and later.

          For a full load, each row of the timestamp column contains a timestamp for when the data was transferred from the source to the target by DMS. For a CDC load, each row of the timestamp column contains the timestamp for the commit of that row in the source database. The format for the timestamp column value is yyyy-MM-dd HH:mm:ss.SSSSSS . For CDC, the microsecond precision depends on the commit timestamp supported by DMS for the source database. When the AddColumnName setting is set to true , DMS also includes the name for the timestamp column that you set as the nonblank value of timestampColumnName .

      • DmsTransferSettings (dict) --

        The settings in JSON format for the DMS transfer type of source endpoint.

        Possible attributes include the following:

        • serviceAccessRoleArn - The IAM role that has permission to access the Amazon S3 bucket.

        • bucketName - The name of the S3 bucket to use.

        • compressionType - An optional parameter to use GZIP to compress the target files. To use GZIP, set this value to NONE (the default). To keep the files uncompressed, don't use this value.

        Shorthand syntax for these attributes is as follows: ServiceAccessRoleArn=string,BucketName=string,CompressionType=string

        JSON syntax for these attributes is as follows: { "ServiceAccessRoleArn": "string", "BucketName": "string", "CompressionType": "none"|"gzip" }

        • ServiceAccessRoleArn (string) --

          The IAM role that has permission to access the Amazon S3 bucket.

        • BucketName (string) --

          The name of the S3 bucket to use.

      • MongoDbSettings (dict) --

        The settings for the MongoDB source endpoint. For more information, see the MongoDbSettings structure.

        • Username (string) --

          The user name you use to access the MongoDB source endpoint.

        • Password (string) --

          The password for the user account you use to access the MongoDB source endpoint.

        • ServerName (string) --

          The name of the server on the MongoDB source endpoint.

        • Port (integer) --

          The port value for the MongoDB source endpoint.

        • DatabaseName (string) --

          The database name on the MongoDB source endpoint.

        • AuthType (string) --

          The authentication type you use to access the MongoDB source endpoint.

          Valid values: NO, PASSWORD

          When NO is selected, user name and password parameters are not used and can be empty.

        • AuthMechanism (string) --

          The authentication mechanism you use to access the MongoDB source endpoint.

          Valid values: DEFAULT, MONGODB_CR, SCRAM_SHA_1

          DEFAULT – For MongoDB version 2.x, use MONGODB_CR. For MongoDB version 3.x, use SCRAM_SHA_1. This attribute is not used when authType=No.

        • NestingLevel (string) --

          Specifies either document or table mode.

          Valid values: NONE, ONE

          Default value is NONE. Specify NONE to use document mode. Specify ONE to use table mode.

        • ExtractDocId (string) --

          Specifies the document ID. Use this attribute when NestingLevel is set to NONE.

          Default value is false.

        • DocsToInvestigate (string) --

          Indicates the number of documents to preview to determine the document organization. Use this attribute when NestingLevel is set to ONE.

          Must be a positive value greater than 0. Default value is 1000.

        • AuthSource (string) --

          The MongoDB database name. This attribute is not used when authType=NO .

          The default is admin.

        • KmsKeyId (string) --

          The AWS KMS key identifier that is used to encrypt the content on the replication instance. If you don't specify a value for the KmsKeyId parameter, then AWS DMS uses your default encryption key. AWS KMS creates the default encryption key for your AWS account. Your AWS account has a different default encryption key for each AWS Region.

      • KinesisSettings (dict) --

        The settings for the Amazon Kinesis source endpoint. For more information, see the KinesisSettings structure.

        • StreamArn (string) --

          The Amazon Resource Name (ARN) for the Amazon Kinesis Data Streams endpoint.

        • MessageFormat (string) --

          The output format for the records created on the endpoint. The message format is JSON .

        • ServiceAccessRoleArn (string) --

          The Amazon Resource Name (ARN) for the IAM role that DMS uses to write to the Amazon Kinesis data stream.

      • ElasticsearchSettings (dict) --

        The settings for the Elasticsearch source endpoint. For more information, see the ElasticsearchSettings structure.

        • ServiceAccessRoleArn (string) --

          The Amazon Resource Name (ARN) used by service to access the IAM role.

        • EndpointUri (string) --

          The endpoint for the Elasticsearch cluster.

        • FullLoadErrorPercentage (integer) --

          The maximum percentage of records that can fail to be written before a full load operation stops.

        • ErrorRetryDuration (integer) --

          The maximum number of seconds that DMS retries failed API requests to the Elasticsearch cluster.

      • RedshiftSettings (dict) --

        Settings for the Amazon Redshift endpoint.

        • AcceptAnyDate (boolean) --

          A value that indicates to allow any date format, including invalid formats such as 00/00/00 00:00:00, to be loaded without generating an error. You can choose true or false (the default).

          This parameter applies only to TIMESTAMP and DATE columns. Always use ACCEPTANYDATE with the DATEFORMAT parameter. If the date format for the data doesn't match the DATEFORMAT specification, Amazon Redshift inserts a NULL value into that field.

        • AfterConnectScript (string) --

          Code to run after connecting. This parameter should contain the code itself, not the name of a file containing the code.

        • BucketFolder (string) --

          The location where the comma-separated value (.csv) files are stored before being uploaded to the S3 bucket.

        • BucketName (string) --

          The name of the S3 bucket you want to use

        • ConnectionTimeout (integer) --

          A value that sets the amount of time to wait (in milliseconds) before timing out, beginning from when you initially establish a connection.

        • DatabaseName (string) --

          The name of the Amazon Redshift data warehouse (service) that you are working with.

        • DateFormat (string) --

          The date format that you are using. Valid values are auto (case-sensitive), your date format string enclosed in quotes, or NULL. If this parameter is left unset (NULL), it defaults to a format of 'YYYY-MM-DD'. Using auto recognizes most strings, even some that aren't supported when you use a date format string.

          If your date and time values use formats different from each other, set this to auto .

        • EmptyAsNull (boolean) --

          A value that specifies whether AWS DMS should migrate empty CHAR and VARCHAR fields as NULL. A value of true sets empty CHAR and VARCHAR fields to null. The default is false .

        • EncryptionMode (string) --

          The type of server-side encryption that you want to use for your data. This encryption type is part of the endpoint settings or the extra connections attributes for Amazon S3. You can choose either SSE_S3 (the default) or SSE_KMS . To use SSE_S3 , create an AWS Identity and Access Management (IAM) role with a policy that allows "arn:aws:s3:::*" to use the following actions: "s3:PutObject", "s3:ListBucket"

        • FileTransferUploadStreams (integer) --

          The number of threads used to upload a single file. This parameter accepts a value from 1 through 64. It defaults to 10.

        • LoadTimeout (integer) --

          The amount of time to wait (in milliseconds) before timing out, beginning from when you begin loading.

        • MaxFileSize (integer) --

          The maximum size (in KB) of any .csv file used to transfer data to Amazon Redshift. This accepts a value from 1 through 1,048,576. It defaults to 32,768 KB (32 MB).

        • Password (string) --

          The password for the user named in the username property.

        • Port (integer) --

          The port number for Amazon Redshift. The default value is 5439.

        • RemoveQuotes (boolean) --

          A value that specifies to remove surrounding quotation marks from strings in the incoming data. All characters within the quotation marks, including delimiters, are retained. Choose true to remove quotation marks. The default is false .

        • ReplaceInvalidChars (string) --

          A list of characters that you want to replace. Use with ReplaceChars .

        • ReplaceChars (string) --

          A value that specifies to replaces the invalid characters specified in ReplaceInvalidChars , substituting the specified characters instead. The default is "?" .

        • ServerName (string) --

          The name of the Amazon Redshift cluster you are using.

        • ServiceAccessRoleArn (string) --

          The Amazon Resource Name (ARN) of the IAM role that has access to the Amazon Redshift service.

        • ServerSideEncryptionKmsKeyId (string) --

          The AWS KMS key ID. If you are using SSE_KMS for the EncryptionMode , provide this key ID. The key that you use needs an attached policy that enables IAM user permissions and allows use of the key.

        • TimeFormat (string) --

          The time format that you want to use. Valid values are auto (case-sensitive), 'timeformat_string' , 'epochsecs' , or 'epochmillisecs' . It defaults to 10. Using auto recognizes most strings, even some that aren't supported when you use a time format string.

          If your date and time values use formats different from each other, set this parameter to auto .

        • TrimBlanks (boolean) --

          A value that specifies to remove the trailing white space characters from a VARCHAR string. This parameter applies only to columns with a VARCHAR data type. Choose true to remove unneeded white space. The default is false .

        • TruncateColumns (boolean) --

          A value that specifies to truncate data in columns to the appropriate number of characters, so that the data fits in the column. This parameter applies only to columns with a VARCHAR or CHAR data type, and rows with a size of 4 MB or less. Choose true to truncate data. The default is false .

        • Username (string) --

          An Amazon Redshift user name for a registered user.

        • WriteBufferSize (integer) --

          The size of the write buffer to use in rows. Valid values range from 1 through 2,048. The default is 1,024. Use this setting to tune performance.

DeleteEndpoint (updated) Link ¶
Changes (response)
{'Endpoint': {'S3Settings': {'IncludeOpForFullLoad': 'boolean',
                             'TimestampColumnName': 'string'}}}

Deletes the specified endpoint.

Note

All tasks associated with the endpoint must be deleted before you can delete the endpoint.

See also: AWS API Documentation

Request Syntax

client.delete_endpoint(
    EndpointArn='string'
)
type EndpointArn

string

param EndpointArn

[REQUIRED]

The Amazon Resource Name (ARN) string that uniquely identifies the endpoint.

rtype

dict

returns

Response Syntax

{
    'Endpoint': {
        'EndpointIdentifier': 'string',
        'EndpointType': 'source'|'target',
        'EngineName': 'string',
        'EngineDisplayName': 'string',
        'Username': 'string',
        'ServerName': 'string',
        'Port': 123,
        'DatabaseName': 'string',
        'ExtraConnectionAttributes': 'string',
        'Status': 'string',
        'KmsKeyId': 'string',
        'EndpointArn': 'string',
        'CertificateArn': 'string',
        'SslMode': 'none'|'require'|'verify-ca'|'verify-full',
        'ServiceAccessRoleArn': 'string',
        'ExternalTableDefinition': 'string',
        'ExternalId': 'string',
        'DynamoDbSettings': {
            'ServiceAccessRoleArn': 'string'
        },
        'S3Settings': {
            'ServiceAccessRoleArn': 'string',
            'ExternalTableDefinition': 'string',
            'CsvRowDelimiter': 'string',
            'CsvDelimiter': 'string',
            'BucketFolder': 'string',
            'BucketName': 'string',
            'CompressionType': 'none'|'gzip',
            'EncryptionMode': 'sse-s3'|'sse-kms',
            'ServerSideEncryptionKmsKeyId': 'string',
            'DataFormat': 'csv'|'parquet',
            'EncodingType': 'plain'|'plain-dictionary'|'rle-dictionary',
            'DictPageSizeLimit': 123,
            'RowGroupLength': 123,
            'DataPageSize': 123,
            'ParquetVersion': 'parquet-1-0'|'parquet-2-0',
            'EnableStatistics': True|False,
            'IncludeOpForFullLoad': True|False,
            'CdcInsertsOnly': True|False,
            'TimestampColumnName': 'string'
        },
        'DmsTransferSettings': {
            'ServiceAccessRoleArn': 'string',
            'BucketName': 'string'
        },
        'MongoDbSettings': {
            'Username': 'string',
            'Password': 'string',
            'ServerName': 'string',
            'Port': 123,
            'DatabaseName': 'string',
            'AuthType': 'no'|'password',
            'AuthMechanism': 'default'|'mongodb_cr'|'scram_sha_1',
            'NestingLevel': 'none'|'one',
            'ExtractDocId': 'string',
            'DocsToInvestigate': 'string',
            'AuthSource': 'string',
            'KmsKeyId': 'string'
        },
        'KinesisSettings': {
            'StreamArn': 'string',
            'MessageFormat': 'json',
            'ServiceAccessRoleArn': 'string'
        },
        'ElasticsearchSettings': {
            'ServiceAccessRoleArn': 'string',
            'EndpointUri': 'string',
            'FullLoadErrorPercentage': 123,
            'ErrorRetryDuration': 123
        },
        'RedshiftSettings': {
            'AcceptAnyDate': True|False,
            'AfterConnectScript': 'string',
            'BucketFolder': 'string',
            'BucketName': 'string',
            'ConnectionTimeout': 123,
            'DatabaseName': 'string',
            'DateFormat': 'string',
            'EmptyAsNull': True|False,
            'EncryptionMode': 'sse-s3'|'sse-kms',
            'FileTransferUploadStreams': 123,
            'LoadTimeout': 123,
            'MaxFileSize': 123,
            'Password': 'string',
            'Port': 123,
            'RemoveQuotes': True|False,
            'ReplaceInvalidChars': 'string',
            'ReplaceChars': 'string',
            'ServerName': 'string',
            'ServiceAccessRoleArn': 'string',
            'ServerSideEncryptionKmsKeyId': 'string',
            'TimeFormat': 'string',
            'TrimBlanks': True|False,
            'TruncateColumns': True|False,
            'Username': 'string',
            'WriteBufferSize': 123
        }
    }
}

Response Structure

  • (dict) --

    • Endpoint (dict) --

      The endpoint that was deleted.

      • EndpointIdentifier (string) --

        The database endpoint identifier. Identifiers must begin with a letter; must contain only ASCII letters, digits, and hyphens; and must not end with a hyphen or contain two consecutive hyphens.

      • EndpointType (string) --

        The type of endpoint. Valid values are source and target .

      • EngineName (string) --

        The database engine name. Valid values, depending on the EndpointType, include mysql, oracle, postgres, mariadb, aurora, aurora-postgresql, redshift, s3, db2, azuredb, sybase, dynamodb, mongodb, and sqlserver.

      • EngineDisplayName (string) --

        The expanded name for the engine name. For example, if the EngineName parameter is "aurora," this value would be "Amazon Aurora MySQL."

      • Username (string) --

        The user name used to connect to the endpoint.

      • ServerName (string) --

        The name of the server at the endpoint.

      • Port (integer) --

        The port value used to access the endpoint.

      • DatabaseName (string) --

        The name of the database at the endpoint.

      • ExtraConnectionAttributes (string) --

        Additional connection attributes used to connect to the endpoint.

      • Status (string) --

        The status of the endpoint.

      • KmsKeyId (string) --

        An AWS KMS key identifier that is used to encrypt the connection parameters for the endpoint.

        If you don't specify a value for the KmsKeyId parameter, then AWS DMS uses your default encryption key.

        AWS KMS creates the default encryption key for your AWS account. Your AWS account has a different default encryption key for each AWS Region.

      • EndpointArn (string) --

        The Amazon Resource Name (ARN) string that uniquely identifies the endpoint.

      • CertificateArn (string) --

        The Amazon Resource Name (ARN) used for SSL connection to the endpoint.

      • SslMode (string) --

        The SSL mode used to connect to the endpoint. The default value is none .

      • ServiceAccessRoleArn (string) --

        The Amazon Resource Name (ARN) used by the service access IAM role.

      • ExternalTableDefinition (string) --

        The external table definition.

      • ExternalId (string) --

        Value returned by a call to CreateEndpoint that can be used for cross-account validation. Use it on a subsequent call to CreateEndpoint to create the endpoint with a cross-account.

      • DynamoDbSettings (dict) --

        The settings for the target DynamoDB database. For more information, see the DynamoDBSettings structure.

        • ServiceAccessRoleArn (string) --

          The Amazon Resource Name (ARN) used by the service access IAM role.

      • S3Settings (dict) --

        The settings for the S3 target endpoint. For more information, see the S3Settings structure.

        • ServiceAccessRoleArn (string) --

          The Amazon Resource Name (ARN) used by the service access IAM role.

        • ExternalTableDefinition (string) --

          The external table definition.

        • CsvRowDelimiter (string) --

          The delimiter used to separate rows in the source files. The default is a carriage return (\n ).

        • CsvDelimiter (string) --

          The delimiter used to separate columns in the source files. The default is a comma.

        • BucketFolder (string) --

          An optional parameter to set a folder name in the S3 bucket. If provided, tables are created in the path `` bucketFolder /schema_name /table_name /`` . If this parameter is not specified, then the path used is `` schema_name /table_name /`` .

        • BucketName (string) --

          The name of the S3 bucket.

        • CompressionType (string) --

          An optional parameter to use GZIP to compress the target files. Set to GZIP to compress the target files. Set to NONE (the default) or do not use to leave the files uncompressed. Applies to both .csv and .parquet file formats.

        • EncryptionMode (string) --

          The type of server-side encryption that you want to use for your data. This encryption type is part of the endpoint settings or the extra connections attributes for Amazon S3. You can choose either SSE_S3 (the default) or SSE_KMS . To use SSE_S3 , you need an AWS Identity and Access Management (IAM) role with permission to allow "arn:aws:s3:::dms-*" to use the following actions:

          • s3:CreateBucket

          • s3:ListBucket

          • s3:DeleteBucket

          • s3:GetBucketLocation

          • s3:GetObject

          • s3:PutObject

          • s3:DeleteObject

          • s3:GetObjectVersion

          • s3:GetBucketPolicy

          • s3:PutBucketPolicy

          • s3:DeleteBucketPolicy

        • ServerSideEncryptionKmsKeyId (string) --

          If you are using SSE_KMS for the EncryptionMode , provide the AWS KMS key ID. The key that you use needs an attached policy that enables AWS Identity and Access Management (IAM) user permissions and allows use of the key.

          Here is a CLI example: ``aws dms create-endpoint --endpoint-identifier value --endpoint-type target --engine-name s3 --s3-settings ServiceAccessRoleArn=*value* ,BucketFolder=*value* ,BucketName=*value* ,EncryptionMode=SSE_KMS,ServerSideEncryptionKmsKeyId=*value* ``

        • DataFormat (string) --

          The format of the data that you want to use for output. You can choose one of the following:

          • csv : This is a row-based file format with comma-separated values (.csv).

          • parquet : Apache Parquet (.parquet) is a columnar storage file format that features efficient compression and provides faster query response.

        • EncodingType (string) --

          The type of encoding you are using:

          • RLE_DICTIONARY uses a combination of bit-packing and run-length encoding to store repeated values more efficiently. This is the default.

          • PLAIN doesn't use encoding at all. Values are stored as they are.

          • PLAIN_DICTIONARY builds a dictionary of the values encountered in a given column. The dictionary is stored in a dictionary page for each column chunk.

        • DictPageSizeLimit (integer) --

          The maximum size of an encoded dictionary page of a column. If the dictionary page exceeds this, this column is stored using an encoding type of PLAIN . This parameter defaults to 1024 * 1024 bytes (1 MiB), the maximum size of a dictionary page before it reverts to PLAIN encoding. This size is used for .parquet file format only.

        • RowGroupLength (integer) --

          The number of rows in a row group. A smaller row group size provides faster reads. But as the number of row groups grows, the slower writes become. This parameter defaults to 10,000 rows. This number is used for .parquet file format only.

          If you choose a value larger than the maximum, RowGroupLength is set to the max row group length in bytes (64 * 1024 * 1024).

        • DataPageSize (integer) --

          The size of one data page in bytes. This parameter defaults to 1024 * 1024 bytes (1 MiB). This number is used for .parquet file format only.

        • ParquetVersion (string) --

          The version of the Apache Parquet format that you want to use: parquet_1_0 (the default) or parquet_2_0 .

        • EnableStatistics (boolean) --

          A value that enables statistics for Parquet pages and row groups. Choose true to enable statistics, false to disable. Statistics include NULL , DISTINCT , MAX , and MIN values. This parameter defaults to true . This value is used for .parquet file format only.

        • IncludeOpForFullLoad (boolean) --

          A value that enables a full load to write INSERT operations to the comma-separated value (.csv) output files only to indicate how the rows were added to the source database.

          Note

          AWS DMS supports IncludeOpForFullLoad in versions 3.1.4 and later.

          For full load, records can only be inserted. By default (the false setting), no information is recorded in these output files for a full load to indicate that the rows were inserted at the source database. If IncludeOpForFullLoad is set to true or y , the INSERT is recorded as an I annotation in the first field of the .csv file. This allows the format of your target records from a full load to be consistent with the target records from a CDC load.

          Note

          This setting works together with CdcInsertsOnly for output to .csv files only. For more information about how these settings work together, see Indicating Source DB Operations in Migrated S3 Data in the AWS Database Migration Service User Guide. .

        • CdcInsertsOnly (boolean) --

          A value that enables a change data capture (CDC) load to write only INSERT operations to .csv or columnar storage (.parquet) output files. By default (the false setting), the first field in a .csv or .parquet record contains the letter I (INSERT), U (UPDATE), or D (DELETE). These values indicate whether the row was inserted, updated, or deleted at the source database for a CDC load to the target.

          If cdcInsertsOnly is set to true or y , only INSERTs from the source database are migrated to the .csv or .parquet file. For .csv format only, how these INSERTs are recorded depends on the value of IncludeOpForFullLoad . If IncludeOpForFullLoad is set to true , the first field of every CDC record is set to I to indicate the INSERT operation at the source. If IncludeOpForFullLoad is set to false , every CDC record is written without a first field to indicate the INSERT operation at the source. For more information about how these settings work together, see Indicating Source DB Operations in Migrated S3 Data in the AWS Database Migration Service User Guide. .

          Note

          AWS DMS supports this interaction between CdcInsertsOnly and IncludeOpForFullLoad in versions 3.1.4 and later.

        • TimestampColumnName (string) --

          A value that includes a timestamp column in the Amazon S3 target endpoint data. AWS DMS includes an additional column in the migrated data when you set timestampColumnName to a non-blank value.

          Note

          AWS DMS supports TimestampColumnName in versions 3.1.4 and later.

          For a full load, each row of the timestamp column contains a timestamp for when the data was transferred from the source to the target by DMS. For a CDC load, each row of the timestamp column contains the timestamp for the commit of that row in the source database. The format for the timestamp column value is yyyy-MM-dd HH:mm:ss.SSSSSS . For CDC, the microsecond precision depends on the commit timestamp supported by DMS for the source database. When the AddColumnName setting is set to true , DMS also includes the name for the timestamp column that you set as the nonblank value of timestampColumnName .

      • DmsTransferSettings (dict) --

        The settings in JSON format for the DMS transfer type of source endpoint.

        Possible attributes include the following:

        • serviceAccessRoleArn - The IAM role that has permission to access the Amazon S3 bucket.

        • bucketName - The name of the S3 bucket to use.

        • compressionType - An optional parameter to use GZIP to compress the target files. To use GZIP, set this value to NONE (the default). To keep the files uncompressed, don't use this value.

        Shorthand syntax for these attributes is as follows: ServiceAccessRoleArn=string,BucketName=string,CompressionType=string

        JSON syntax for these attributes is as follows: { "ServiceAccessRoleArn": "string", "BucketName": "string", "CompressionType": "none"|"gzip" }

        • ServiceAccessRoleArn (string) --

          The IAM role that has permission to access the Amazon S3 bucket.

        • BucketName (string) --

          The name of the S3 bucket to use.

      • MongoDbSettings (dict) --

        The settings for the MongoDB source endpoint. For more information, see the MongoDbSettings structure.

        • Username (string) --

          The user name you use to access the MongoDB source endpoint.

        • Password (string) --

          The password for the user account you use to access the MongoDB source endpoint.

        • ServerName (string) --

          The name of the server on the MongoDB source endpoint.

        • Port (integer) --

          The port value for the MongoDB source endpoint.

        • DatabaseName (string) --

          The database name on the MongoDB source endpoint.

        • AuthType (string) --

          The authentication type you use to access the MongoDB source endpoint.

          Valid values: NO, PASSWORD

          When NO is selected, user name and password parameters are not used and can be empty.

        • AuthMechanism (string) --

          The authentication mechanism you use to access the MongoDB source endpoint.

          Valid values: DEFAULT, MONGODB_CR, SCRAM_SHA_1

          DEFAULT – For MongoDB version 2.x, use MONGODB_CR. For MongoDB version 3.x, use SCRAM_SHA_1. This attribute is not used when authType=No.

        • NestingLevel (string) --

          Specifies either document or table mode.

          Valid values: NONE, ONE

          Default value is NONE. Specify NONE to use document mode. Specify ONE to use table mode.

        • ExtractDocId (string) --

          Specifies the document ID. Use this attribute when NestingLevel is set to NONE.

          Default value is false.

        • DocsToInvestigate (string) --

          Indicates the number of documents to preview to determine the document organization. Use this attribute when NestingLevel is set to ONE.

          Must be a positive value greater than 0. Default value is 1000.

        • AuthSource (string) --

          The MongoDB database name. This attribute is not used when authType=NO .

          The default is admin.

        • KmsKeyId (string) --

          The AWS KMS key identifier that is used to encrypt the content on the replication instance. If you don't specify a value for the KmsKeyId parameter, then AWS DMS uses your default encryption key. AWS KMS creates the default encryption key for your AWS account. Your AWS account has a different default encryption key for each AWS Region.

      • KinesisSettings (dict) --

        The settings for the Amazon Kinesis source endpoint. For more information, see the KinesisSettings structure.

        • StreamArn (string) --

          The Amazon Resource Name (ARN) for the Amazon Kinesis Data Streams endpoint.

        • MessageFormat (string) --

          The output format for the records created on the endpoint. The message format is JSON .

        • ServiceAccessRoleArn (string) --

          The Amazon Resource Name (ARN) for the IAM role that DMS uses to write to the Amazon Kinesis data stream.

      • ElasticsearchSettings (dict) --

        The settings for the Elasticsearch source endpoint. For more information, see the ElasticsearchSettings structure.

        • ServiceAccessRoleArn (string) --

          The Amazon Resource Name (ARN) used by service to access the IAM role.

        • EndpointUri (string) --

          The endpoint for the Elasticsearch cluster.

        • FullLoadErrorPercentage (integer) --

          The maximum percentage of records that can fail to be written before a full load operation stops.

        • ErrorRetryDuration (integer) --

          The maximum number of seconds that DMS retries failed API requests to the Elasticsearch cluster.

      • RedshiftSettings (dict) --

        Settings for the Amazon Redshift endpoint.

        • AcceptAnyDate (boolean) --

          A value that indicates to allow any date format, including invalid formats such as 00/00/00 00:00:00, to be loaded without generating an error. You can choose true or false (the default).

          This parameter applies only to TIMESTAMP and DATE columns. Always use ACCEPTANYDATE with the DATEFORMAT parameter. If the date format for the data doesn't match the DATEFORMAT specification, Amazon Redshift inserts a NULL value into that field.

        • AfterConnectScript (string) --

          Code to run after connecting. This parameter should contain the code itself, not the name of a file containing the code.

        • BucketFolder (string) --

          The location where the comma-separated value (.csv) files are stored before being uploaded to the S3 bucket.

        • BucketName (string) --

          The name of the S3 bucket you want to use

        • ConnectionTimeout (integer) --

          A value that sets the amount of time to wait (in milliseconds) before timing out, beginning from when you initially establish a connection.

        • DatabaseName (string) --

          The name of the Amazon Redshift data warehouse (service) that you are working with.

        • DateFormat (string) --

          The date format that you are using. Valid values are auto (case-sensitive), your date format string enclosed in quotes, or NULL. If this parameter is left unset (NULL), it defaults to a format of 'YYYY-MM-DD'. Using auto recognizes most strings, even some that aren't supported when you use a date format string.

          If your date and time values use formats different from each other, set this to auto .

        • EmptyAsNull (boolean) --

          A value that specifies whether AWS DMS should migrate empty CHAR and VARCHAR fields as NULL. A value of true sets empty CHAR and VARCHAR fields to null. The default is false .

        • EncryptionMode (string) --

          The type of server-side encryption that you want to use for your data. This encryption type is part of the endpoint settings or the extra connections attributes for Amazon S3. You can choose either SSE_S3 (the default) or SSE_KMS . To use SSE_S3 , create an AWS Identity and Access Management (IAM) role with a policy that allows "arn:aws:s3:::*" to use the following actions: "s3:PutObject", "s3:ListBucket"

        • FileTransferUploadStreams (integer) --

          The number of threads used to upload a single file. This parameter accepts a value from 1 through 64. It defaults to 10.

        • LoadTimeout (integer) --

          The amount of time to wait (in milliseconds) before timing out, beginning from when you begin loading.

        • MaxFileSize (integer) --

          The maximum size (in KB) of any .csv file used to transfer data to Amazon Redshift. This accepts a value from 1 through 1,048,576. It defaults to 32,768 KB (32 MB).

        • Password (string) --

          The password for the user named in the username property.

        • Port (integer) --

          The port number for Amazon Redshift. The default value is 5439.

        • RemoveQuotes (boolean) --

          A value that specifies to remove surrounding quotation marks from strings in the incoming data. All characters within the quotation marks, including delimiters, are retained. Choose true to remove quotation marks. The default is false .

        • ReplaceInvalidChars (string) --

          A list of characters that you want to replace. Use with ReplaceChars .

        • ReplaceChars (string) --

          A value that specifies to replaces the invalid characters specified in ReplaceInvalidChars , substituting the specified characters instead. The default is "?" .

        • ServerName (string) --

          The name of the Amazon Redshift cluster you are using.

        • ServiceAccessRoleArn (string) --

          The Amazon Resource Name (ARN) of the IAM role that has access to the Amazon Redshift service.

        • ServerSideEncryptionKmsKeyId (string) --

          The AWS KMS key ID. If you are using SSE_KMS for the EncryptionMode , provide this key ID. The key that you use needs an attached policy that enables IAM user permissions and allows use of the key.

        • TimeFormat (string) --

          The time format that you want to use. Valid values are auto (case-sensitive), 'timeformat_string' , 'epochsecs' , or 'epochmillisecs' . It defaults to 10. Using auto recognizes most strings, even some that aren't supported when you use a time format string.

          If your date and time values use formats different from each other, set this parameter to auto .

        • TrimBlanks (boolean) --

          A value that specifies to remove the trailing white space characters from a VARCHAR string. This parameter applies only to columns with a VARCHAR data type. Choose true to remove unneeded white space. The default is false .

        • TruncateColumns (boolean) --

          A value that specifies to truncate data in columns to the appropriate number of characters, so that the data fits in the column. This parameter applies only to columns with a VARCHAR or CHAR data type, and rows with a size of 4 MB or less. Choose true to truncate data. The default is false .

        • Username (string) --

          An Amazon Redshift user name for a registered user.

        • WriteBufferSize (integer) --

          The size of the write buffer to use in rows. Valid values range from 1 through 2,048. The default is 1,024. Use this setting to tune performance.

DescribeAccountAttributes (updated) Link ¶
Changes (response)
{'UniqueAccountIdentifier': 'string'}

Lists all of the AWS DMS attributes for a customer account. These attributes include AWS DMS quotas for the account and a unique account identifier in a particular DMS region. DMS quotas include a list of resource quotas supported by the account, such as the number of replication instances allowed. The description for each resource quota, includes the quota name, current usage toward that quota, and the quota's maximum value. DMS uses the unique account identifier to name each artifact used by DMS in the given region.

This command does not take any parameters.

See also: AWS API Documentation

Request Syntax

client.describe_account_attributes()
rtype

dict

returns

Response Syntax

{
    'AccountQuotas': [
        {
            'AccountQuotaName': 'string',
            'Used': 123,
            'Max': 123
        },
    ],
    'UniqueAccountIdentifier': 'string'
}

Response Structure

  • (dict) --

    • AccountQuotas (list) --

      Account quota information.

      • (dict) --

        Describes a quota for an AWS account, for example, the number of replication instances allowed.

        • AccountQuotaName (string) --

          The name of the AWS DMS quota for this AWS account.

        • Used (integer) --

          The amount currently used toward the quota maximum.

        • Max (integer) --

          The maximum allowed value for the quota.

    • UniqueAccountIdentifier (string) --

      A unique AWS DMS identifier for an account in a particular AWS Region. The value of this identifier has the following format: c99999999999 . DMS uses this identifier to name artifacts. For example, DMS uses this identifier to name the default Amazon S3 bucket for storing task assessment reports in a given AWS Region. The format of this S3 bucket name is the following: dms-*AccountNumber* -*UniqueAccountIdentifier* . Here is an example name for this default S3 bucket: dms-111122223333-c44445555666 .

      Note

      AWS DMS supports UniqueAccountIdentifier in versions 3.1.4 and later.

DescribeEndpoints (updated) Link ¶
Changes (response)
{'Endpoints': {'S3Settings': {'IncludeOpForFullLoad': 'boolean',
                              'TimestampColumnName': 'string'}}}

Returns information about the endpoints for your account in the current region.

See also: AWS API Documentation

Request Syntax

client.describe_endpoints(
    Filters=[
        {
            'Name': 'string',
            'Values': [
                'string',
            ]
        },
    ],
    MaxRecords=123,
    Marker='string'
)
type Filters

list

param Filters

Filters applied to the describe action.

Valid filter names: endpoint-arn | endpoint-type | endpoint-id | engine-name

  • (dict) --

    • Name (string) -- [REQUIRED]

      The name of the filter.

    • Values (list) -- [REQUIRED]

      The filter value.

      • (string) --

type MaxRecords

integer

param MaxRecords

The maximum number of records to include in the response. If more records exist than the specified MaxRecords value, a pagination token called a marker is included in the response so that the remaining results can be retrieved.

Default: 100

Constraints: Minimum 20, maximum 100.

type Marker

string

param Marker

An optional pagination token provided by a previous request. If this parameter is specified, the response includes only records beyond the marker, up to the value specified by MaxRecords .

rtype

dict

returns

Response Syntax

{
    'Marker': 'string',
    'Endpoints': [
        {
            'EndpointIdentifier': 'string',
            'EndpointType': 'source'|'target',
            'EngineName': 'string',
            'EngineDisplayName': 'string',
            'Username': 'string',
            'ServerName': 'string',
            'Port': 123,
            'DatabaseName': 'string',
            'ExtraConnectionAttributes': 'string',
            'Status': 'string',
            'KmsKeyId': 'string',
            'EndpointArn': 'string',
            'CertificateArn': 'string',
            'SslMode': 'none'|'require'|'verify-ca'|'verify-full',
            'ServiceAccessRoleArn': 'string',
            'ExternalTableDefinition': 'string',
            'ExternalId': 'string',
            'DynamoDbSettings': {
                'ServiceAccessRoleArn': 'string'
            },
            'S3Settings': {
                'ServiceAccessRoleArn': 'string',
                'ExternalTableDefinition': 'string',
                'CsvRowDelimiter': 'string',
                'CsvDelimiter': 'string',
                'BucketFolder': 'string',
                'BucketName': 'string',
                'CompressionType': 'none'|'gzip',
                'EncryptionMode': 'sse-s3'|'sse-kms',
                'ServerSideEncryptionKmsKeyId': 'string',
                'DataFormat': 'csv'|'parquet',
                'EncodingType': 'plain'|'plain-dictionary'|'rle-dictionary',
                'DictPageSizeLimit': 123,
                'RowGroupLength': 123,
                'DataPageSize': 123,
                'ParquetVersion': 'parquet-1-0'|'parquet-2-0',
                'EnableStatistics': True|False,
                'IncludeOpForFullLoad': True|False,
                'CdcInsertsOnly': True|False,
                'TimestampColumnName': 'string'
            },
            'DmsTransferSettings': {
                'ServiceAccessRoleArn': 'string',
                'BucketName': 'string'
            },
            'MongoDbSettings': {
                'Username': 'string',
                'Password': 'string',
                'ServerName': 'string',
                'Port': 123,
                'DatabaseName': 'string',
                'AuthType': 'no'|'password',
                'AuthMechanism': 'default'|'mongodb_cr'|'scram_sha_1',
                'NestingLevel': 'none'|'one',
                'ExtractDocId': 'string',
                'DocsToInvestigate': 'string',
                'AuthSource': 'string',
                'KmsKeyId': 'string'
            },
            'KinesisSettings': {
                'StreamArn': 'string',
                'MessageFormat': 'json',
                'ServiceAccessRoleArn': 'string'
            },
            'ElasticsearchSettings': {
                'ServiceAccessRoleArn': 'string',
                'EndpointUri': 'string',
                'FullLoadErrorPercentage': 123,
                'ErrorRetryDuration': 123
            },
            'RedshiftSettings': {
                'AcceptAnyDate': True|False,
                'AfterConnectScript': 'string',
                'BucketFolder': 'string',
                'BucketName': 'string',
                'ConnectionTimeout': 123,
                'DatabaseName': 'string',
                'DateFormat': 'string',
                'EmptyAsNull': True|False,
                'EncryptionMode': 'sse-s3'|'sse-kms',
                'FileTransferUploadStreams': 123,
                'LoadTimeout': 123,
                'MaxFileSize': 123,
                'Password': 'string',
                'Port': 123,
                'RemoveQuotes': True|False,
                'ReplaceInvalidChars': 'string',
                'ReplaceChars': 'string',
                'ServerName': 'string',
                'ServiceAccessRoleArn': 'string',
                'ServerSideEncryptionKmsKeyId': 'string',
                'TimeFormat': 'string',
                'TrimBlanks': True|False,
                'TruncateColumns': True|False,
                'Username': 'string',
                'WriteBufferSize': 123
            }
        },
    ]
}

Response Structure

  • (dict) --

    • Marker (string) --

      An optional pagination token provided by a previous request. If this parameter is specified, the response includes only records beyond the marker, up to the value specified by MaxRecords .

    • Endpoints (list) --

      Endpoint description.

      • (dict) --

        • EndpointIdentifier (string) --

          The database endpoint identifier. Identifiers must begin with a letter; must contain only ASCII letters, digits, and hyphens; and must not end with a hyphen or contain two consecutive hyphens.

        • EndpointType (string) --

          The type of endpoint. Valid values are source and target .

        • EngineName (string) --

          The database engine name. Valid values, depending on the EndpointType, include mysql, oracle, postgres, mariadb, aurora, aurora-postgresql, redshift, s3, db2, azuredb, sybase, dynamodb, mongodb, and sqlserver.

        • EngineDisplayName (string) --

          The expanded name for the engine name. For example, if the EngineName parameter is "aurora," this value would be "Amazon Aurora MySQL."

        • Username (string) --

          The user name used to connect to the endpoint.

        • ServerName (string) --

          The name of the server at the endpoint.

        • Port (integer) --

          The port value used to access the endpoint.

        • DatabaseName (string) --

          The name of the database at the endpoint.

        • ExtraConnectionAttributes (string) --

          Additional connection attributes used to connect to the endpoint.

        • Status (string) --

          The status of the endpoint.

        • KmsKeyId (string) --

          An AWS KMS key identifier that is used to encrypt the connection parameters for the endpoint.

          If you don't specify a value for the KmsKeyId parameter, then AWS DMS uses your default encryption key.

          AWS KMS creates the default encryption key for your AWS account. Your AWS account has a different default encryption key for each AWS Region.

        • EndpointArn (string) --

          The Amazon Resource Name (ARN) string that uniquely identifies the endpoint.

        • CertificateArn (string) --

          The Amazon Resource Name (ARN) used for SSL connection to the endpoint.

        • SslMode (string) --

          The SSL mode used to connect to the endpoint. The default value is none .

        • ServiceAccessRoleArn (string) --

          The Amazon Resource Name (ARN) used by the service access IAM role.

        • ExternalTableDefinition (string) --

          The external table definition.

        • ExternalId (string) --

          Value returned by a call to CreateEndpoint that can be used for cross-account validation. Use it on a subsequent call to CreateEndpoint to create the endpoint with a cross-account.

        • DynamoDbSettings (dict) --

          The settings for the target DynamoDB database. For more information, see the DynamoDBSettings structure.

          • ServiceAccessRoleArn (string) --

            The Amazon Resource Name (ARN) used by the service access IAM role.

        • S3Settings (dict) --

          The settings for the S3 target endpoint. For more information, see the S3Settings structure.

          • ServiceAccessRoleArn (string) --

            The Amazon Resource Name (ARN) used by the service access IAM role.

          • ExternalTableDefinition (string) --

            The external table definition.

          • CsvRowDelimiter (string) --

            The delimiter used to separate rows in the source files. The default is a carriage return (\n ).

          • CsvDelimiter (string) --

            The delimiter used to separate columns in the source files. The default is a comma.

          • BucketFolder (string) --

            An optional parameter to set a folder name in the S3 bucket. If provided, tables are created in the path `` bucketFolder /schema_name /table_name /`` . If this parameter is not specified, then the path used is `` schema_name /table_name /`` .

          • BucketName (string) --

            The name of the S3 bucket.

          • CompressionType (string) --

            An optional parameter to use GZIP to compress the target files. Set to GZIP to compress the target files. Set to NONE (the default) or do not use to leave the files uncompressed. Applies to both .csv and .parquet file formats.

          • EncryptionMode (string) --

            The type of server-side encryption that you want to use for your data. This encryption type is part of the endpoint settings or the extra connections attributes for Amazon S3. You can choose either SSE_S3 (the default) or SSE_KMS . To use SSE_S3 , you need an AWS Identity and Access Management (IAM) role with permission to allow "arn:aws:s3:::dms-*" to use the following actions:

            • s3:CreateBucket

            • s3:ListBucket

            • s3:DeleteBucket

            • s3:GetBucketLocation

            • s3:GetObject

            • s3:PutObject

            • s3:DeleteObject

            • s3:GetObjectVersion

            • s3:GetBucketPolicy

            • s3:PutBucketPolicy

            • s3:DeleteBucketPolicy

          • ServerSideEncryptionKmsKeyId (string) --

            If you are using SSE_KMS for the EncryptionMode , provide the AWS KMS key ID. The key that you use needs an attached policy that enables AWS Identity and Access Management (IAM) user permissions and allows use of the key.

            Here is a CLI example: ``aws dms create-endpoint --endpoint-identifier value --endpoint-type target --engine-name s3 --s3-settings ServiceAccessRoleArn=*value* ,BucketFolder=*value* ,BucketName=*value* ,EncryptionMode=SSE_KMS,ServerSideEncryptionKmsKeyId=*value* ``

          • DataFormat (string) --

            The format of the data that you want to use for output. You can choose one of the following:

            • csv : This is a row-based file format with comma-separated values (.csv).

            • parquet : Apache Parquet (.parquet) is a columnar storage file format that features efficient compression and provides faster query response.

          • EncodingType (string) --

            The type of encoding you are using:

            • RLE_DICTIONARY uses a combination of bit-packing and run-length encoding to store repeated values more efficiently. This is the default.

            • PLAIN doesn't use encoding at all. Values are stored as they are.

            • PLAIN_DICTIONARY builds a dictionary of the values encountered in a given column. The dictionary is stored in a dictionary page for each column chunk.

          • DictPageSizeLimit (integer) --

            The maximum size of an encoded dictionary page of a column. If the dictionary page exceeds this, this column is stored using an encoding type of PLAIN . This parameter defaults to 1024 * 1024 bytes (1 MiB), the maximum size of a dictionary page before it reverts to PLAIN encoding. This size is used for .parquet file format only.

          • RowGroupLength (integer) --

            The number of rows in a row group. A smaller row group size provides faster reads. But as the number of row groups grows, the slower writes become. This parameter defaults to 10,000 rows. This number is used for .parquet file format only.

            If you choose a value larger than the maximum, RowGroupLength is set to the max row group length in bytes (64 * 1024 * 1024).

          • DataPageSize (integer) --

            The size of one data page in bytes. This parameter defaults to 1024 * 1024 bytes (1 MiB). This number is used for .parquet file format only.

          • ParquetVersion (string) --

            The version of the Apache Parquet format that you want to use: parquet_1_0 (the default) or parquet_2_0 .

          • EnableStatistics (boolean) --

            A value that enables statistics for Parquet pages and row groups. Choose true to enable statistics, false to disable. Statistics include NULL , DISTINCT , MAX , and MIN values. This parameter defaults to true . This value is used for .parquet file format only.

          • IncludeOpForFullLoad (boolean) --

            A value that enables a full load to write INSERT operations to the comma-separated value (.csv) output files only to indicate how the rows were added to the source database.

            Note

            AWS DMS supports IncludeOpForFullLoad in versions 3.1.4 and later.

            For full load, records can only be inserted. By default (the false setting), no information is recorded in these output files for a full load to indicate that the rows were inserted at the source database. If IncludeOpForFullLoad is set to true or y , the INSERT is recorded as an I annotation in the first field of the .csv file. This allows the format of your target records from a full load to be consistent with the target records from a CDC load.

            Note

            This setting works together with CdcInsertsOnly for output to .csv files only. For more information about how these settings work together, see Indicating Source DB Operations in Migrated S3 Data in the AWS Database Migration Service User Guide. .

          • CdcInsertsOnly (boolean) --

            A value that enables a change data capture (CDC) load to write only INSERT operations to .csv or columnar storage (.parquet) output files. By default (the false setting), the first field in a .csv or .parquet record contains the letter I (INSERT), U (UPDATE), or D (DELETE). These values indicate whether the row was inserted, updated, or deleted at the source database for a CDC load to the target.

            If cdcInsertsOnly is set to true or y , only INSERTs from the source database are migrated to the .csv or .parquet file. For .csv format only, how these INSERTs are recorded depends on the value of IncludeOpForFullLoad . If IncludeOpForFullLoad is set to true , the first field of every CDC record is set to I to indicate the INSERT operation at the source. If IncludeOpForFullLoad is set to false , every CDC record is written without a first field to indicate the INSERT operation at the source. For more information about how these settings work together, see Indicating Source DB Operations in Migrated S3 Data in the AWS Database Migration Service User Guide. .

            Note

            AWS DMS supports this interaction between CdcInsertsOnly and IncludeOpForFullLoad in versions 3.1.4 and later.

          • TimestampColumnName (string) --

            A value that includes a timestamp column in the Amazon S3 target endpoint data. AWS DMS includes an additional column in the migrated data when you set timestampColumnName to a non-blank value.

            Note

            AWS DMS supports TimestampColumnName in versions 3.1.4 and later.

            For a full load, each row of the timestamp column contains a timestamp for when the data was transferred from the source to the target by DMS. For a CDC load, each row of the timestamp column contains the timestamp for the commit of that row in the source database. The format for the timestamp column value is yyyy-MM-dd HH:mm:ss.SSSSSS . For CDC, the microsecond precision depends on the commit timestamp supported by DMS for the source database. When the AddColumnName setting is set to true , DMS also includes the name for the timestamp column that you set as the nonblank value of timestampColumnName .

        • DmsTransferSettings (dict) --

          The settings in JSON format for the DMS transfer type of source endpoint.

          Possible attributes include the following:

          • serviceAccessRoleArn - The IAM role that has permission to access the Amazon S3 bucket.

          • bucketName - The name of the S3 bucket to use.

          • compressionType - An optional parameter to use GZIP to compress the target files. To use GZIP, set this value to NONE (the default). To keep the files uncompressed, don't use this value.

          Shorthand syntax for these attributes is as follows: ServiceAccessRoleArn=string,BucketName=string,CompressionType=string

          JSON syntax for these attributes is as follows: { "ServiceAccessRoleArn": "string", "BucketName": "string", "CompressionType": "none"|"gzip" }

          • ServiceAccessRoleArn (string) --

            The IAM role that has permission to access the Amazon S3 bucket.

          • BucketName (string) --

            The name of the S3 bucket to use.

        • MongoDbSettings (dict) --

          The settings for the MongoDB source endpoint. For more information, see the MongoDbSettings structure.

          • Username (string) --

            The user name you use to access the MongoDB source endpoint.

          • Password (string) --

            The password for the user account you use to access the MongoDB source endpoint.

          • ServerName (string) --

            The name of the server on the MongoDB source endpoint.

          • Port (integer) --

            The port value for the MongoDB source endpoint.

          • DatabaseName (string) --

            The database name on the MongoDB source endpoint.

          • AuthType (string) --

            The authentication type you use to access the MongoDB source endpoint.

            Valid values: NO, PASSWORD

            When NO is selected, user name and password parameters are not used and can be empty.

          • AuthMechanism (string) --

            The authentication mechanism you use to access the MongoDB source endpoint.

            Valid values: DEFAULT, MONGODB_CR, SCRAM_SHA_1

            DEFAULT – For MongoDB version 2.x, use MONGODB_CR. For MongoDB version 3.x, use SCRAM_SHA_1. This attribute is not used when authType=No.

          • NestingLevel (string) --

            Specifies either document or table mode.

            Valid values: NONE, ONE

            Default value is NONE. Specify NONE to use document mode. Specify ONE to use table mode.

          • ExtractDocId (string) --

            Specifies the document ID. Use this attribute when NestingLevel is set to NONE.

            Default value is false.

          • DocsToInvestigate (string) --

            Indicates the number of documents to preview to determine the document organization. Use this attribute when NestingLevel is set to ONE.

            Must be a positive value greater than 0. Default value is 1000.

          • AuthSource (string) --

            The MongoDB database name. This attribute is not used when authType=NO .

            The default is admin.

          • KmsKeyId (string) --

            The AWS KMS key identifier that is used to encrypt the content on the replication instance. If you don't specify a value for the KmsKeyId parameter, then AWS DMS uses your default encryption key. AWS KMS creates the default encryption key for your AWS account. Your AWS account has a different default encryption key for each AWS Region.

        • KinesisSettings (dict) --

          The settings for the Amazon Kinesis source endpoint. For more information, see the KinesisSettings structure.

          • StreamArn (string) --

            The Amazon Resource Name (ARN) for the Amazon Kinesis Data Streams endpoint.

          • MessageFormat (string) --

            The output format for the records created on the endpoint. The message format is JSON .

          • ServiceAccessRoleArn (string) --

            The Amazon Resource Name (ARN) for the IAM role that DMS uses to write to the Amazon Kinesis data stream.

        • ElasticsearchSettings (dict) --

          The settings for the Elasticsearch source endpoint. For more information, see the ElasticsearchSettings structure.

          • ServiceAccessRoleArn (string) --

            The Amazon Resource Name (ARN) used by service to access the IAM role.

          • EndpointUri (string) --

            The endpoint for the Elasticsearch cluster.

          • FullLoadErrorPercentage (integer) --

            The maximum percentage of records that can fail to be written before a full load operation stops.

          • ErrorRetryDuration (integer) --

            The maximum number of seconds that DMS retries failed API requests to the Elasticsearch cluster.

        • RedshiftSettings (dict) --

          Settings for the Amazon Redshift endpoint.

          • AcceptAnyDate (boolean) --

            A value that indicates to allow any date format, including invalid formats such as 00/00/00 00:00:00, to be loaded without generating an error. You can choose true or false (the default).

            This parameter applies only to TIMESTAMP and DATE columns. Always use ACCEPTANYDATE with the DATEFORMAT parameter. If the date format for the data doesn't match the DATEFORMAT specification, Amazon Redshift inserts a NULL value into that field.

          • AfterConnectScript (string) --

            Code to run after connecting. This parameter should contain the code itself, not the name of a file containing the code.

          • BucketFolder (string) --

            The location where the comma-separated value (.csv) files are stored before being uploaded to the S3 bucket.

          • BucketName (string) --

            The name of the S3 bucket you want to use

          • ConnectionTimeout (integer) --

            A value that sets the amount of time to wait (in milliseconds) before timing out, beginning from when you initially establish a connection.

          • DatabaseName (string) --

            The name of the Amazon Redshift data warehouse (service) that you are working with.

          • DateFormat (string) --

            The date format that you are using. Valid values are auto (case-sensitive), your date format string enclosed in quotes, or NULL. If this parameter is left unset (NULL), it defaults to a format of 'YYYY-MM-DD'. Using auto recognizes most strings, even some that aren't supported when you use a date format string.

            If your date and time values use formats different from each other, set this to auto .

          • EmptyAsNull (boolean) --

            A value that specifies whether AWS DMS should migrate empty CHAR and VARCHAR fields as NULL. A value of true sets empty CHAR and VARCHAR fields to null. The default is false .

          • EncryptionMode (string) --

            The type of server-side encryption that you want to use for your data. This encryption type is part of the endpoint settings or the extra connections attributes for Amazon S3. You can choose either SSE_S3 (the default) or SSE_KMS . To use SSE_S3 , create an AWS Identity and Access Management (IAM) role with a policy that allows "arn:aws:s3:::*" to use the following actions: "s3:PutObject", "s3:ListBucket"

          • FileTransferUploadStreams (integer) --

            The number of threads used to upload a single file. This parameter accepts a value from 1 through 64. It defaults to 10.

          • LoadTimeout (integer) --

            The amount of time to wait (in milliseconds) before timing out, beginning from when you begin loading.

          • MaxFileSize (integer) --

            The maximum size (in KB) of any .csv file used to transfer data to Amazon Redshift. This accepts a value from 1 through 1,048,576. It defaults to 32,768 KB (32 MB).

          • Password (string) --

            The password for the user named in the username property.

          • Port (integer) --

            The port number for Amazon Redshift. The default value is 5439.

          • RemoveQuotes (boolean) --

            A value that specifies to remove surrounding quotation marks from strings in the incoming data. All characters within the quotation marks, including delimiters, are retained. Choose true to remove quotation marks. The default is false .

          • ReplaceInvalidChars (string) --

            A list of characters that you want to replace. Use with ReplaceChars .

          • ReplaceChars (string) --

            A value that specifies to replaces the invalid characters specified in ReplaceInvalidChars , substituting the specified characters instead. The default is "?" .

          • ServerName (string) --

            The name of the Amazon Redshift cluster you are using.

          • ServiceAccessRoleArn (string) --

            The Amazon Resource Name (ARN) of the IAM role that has access to the Amazon Redshift service.

          • ServerSideEncryptionKmsKeyId (string) --

            The AWS KMS key ID. If you are using SSE_KMS for the EncryptionMode , provide this key ID. The key that you use needs an attached policy that enables IAM user permissions and allows use of the key.

          • TimeFormat (string) --

            The time format that you want to use. Valid values are auto (case-sensitive), 'timeformat_string' , 'epochsecs' , or 'epochmillisecs' . It defaults to 10. Using auto recognizes most strings, even some that aren't supported when you use a time format string.

            If your date and time values use formats different from each other, set this parameter to auto .

          • TrimBlanks (boolean) --

            A value that specifies to remove the trailing white space characters from a VARCHAR string. This parameter applies only to columns with a VARCHAR data type. Choose true to remove unneeded white space. The default is false .

          • TruncateColumns (boolean) --

            A value that specifies to truncate data in columns to the appropriate number of characters, so that the data fits in the column. This parameter applies only to columns with a VARCHAR or CHAR data type, and rows with a size of 4 MB or less. Choose true to truncate data. The default is false .

          • Username (string) --

            An Amazon Redshift user name for a registered user.

          • WriteBufferSize (integer) --

            The size of the write buffer to use in rows. Valid values range from 1 through 2,048. The default is 1,024. Use this setting to tune performance.

DescribeOrderableReplicationInstances (updated) Link ¶
Changes (response)
{'OrderableReplicationInstances': {'ReleaseStatus': 'beta'}}

Returns information about the replication instance types that can be created in the specified region.

See also: AWS API Documentation

Request Syntax

client.describe_orderable_replication_instances(
    MaxRecords=123,
    Marker='string'
)
type MaxRecords

integer

param MaxRecords

The maximum number of records to include in the response. If more records exist than the specified MaxRecords value, a pagination token called a marker is included in the response so that the remaining results can be retrieved.

Default: 100

Constraints: Minimum 20, maximum 100.

type Marker

string

param Marker

An optional pagination token provided by a previous request. If this parameter is specified, the response includes only records beyond the marker, up to the value specified by MaxRecords .

rtype

dict

returns

Response Syntax

{
    'OrderableReplicationInstances': [
        {
            'EngineVersion': 'string',
            'ReplicationInstanceClass': 'string',
            'StorageType': 'string',
            'MinAllocatedStorage': 123,
            'MaxAllocatedStorage': 123,
            'DefaultAllocatedStorage': 123,
            'IncludedAllocatedStorage': 123,
            'AvailabilityZones': [
                'string',
            ],
            'ReleaseStatus': 'beta'
        },
    ],
    'Marker': 'string'
}

Response Structure

  • (dict) --

    • OrderableReplicationInstances (list) --

      The order-able replication instances available.

      • (dict) --

        • EngineVersion (string) --

          The version of the replication engine.

        • ReplicationInstanceClass (string) --

          The compute and memory capacity of the replication instance.

          Valid Values: dms.t2.micro | dms.t2.small | dms.t2.medium | dms.t2.large | dms.c4.large | dms.c4.xlarge | dms.c4.2xlarge | dms.c4.4xlarge

        • StorageType (string) --

          The type of storage used by the replication instance.

        • MinAllocatedStorage (integer) --

          The minimum amount of storage (in gigabytes) that can be allocated for the replication instance.

        • MaxAllocatedStorage (integer) --

          The minimum amount of storage (in gigabytes) that can be allocated for the replication instance.

        • DefaultAllocatedStorage (integer) --

          The default amount of storage (in gigabytes) that is allocated for the replication instance.

        • IncludedAllocatedStorage (integer) --

          The amount of storage (in gigabytes) that is allocated for the replication instance.

        • AvailabilityZones (list) --

          List of Availability Zones for this replication instance.

          • (string) --

        • ReleaseStatus (string) --

          The value returned when the specified EngineVersion of the replication instance is in Beta or test mode. This indicates some features might not work as expected.

          Note

          AWS DMS supports ReleaseStatus in versions 3.1.4 and later.

    • Marker (string) --

      An optional pagination token provided by a previous request. If this parameter is specified, the response includes only records beyond the marker, up to the value specified by MaxRecords .

ModifyEndpoint (updated) Link ¶
Changes (request, response)
Request
{'S3Settings': {'IncludeOpForFullLoad': 'boolean',
                'TimestampColumnName': 'string'}}
Response
{'Endpoint': {'S3Settings': {'IncludeOpForFullLoad': 'boolean',
                             'TimestampColumnName': 'string'}}}

Modifies the specified endpoint.

See also: AWS API Documentation

Request Syntax

client.modify_endpoint(
    EndpointArn='string',
    EndpointIdentifier='string',
    EndpointType='source'|'target',
    EngineName='string',
    Username='string',
    Password='string',
    ServerName='string',
    Port=123,
    DatabaseName='string',
    ExtraConnectionAttributes='string',
    CertificateArn='string',
    SslMode='none'|'require'|'verify-ca'|'verify-full',
    ServiceAccessRoleArn='string',
    ExternalTableDefinition='string',
    DynamoDbSettings={
        'ServiceAccessRoleArn': 'string'
    },
    S3Settings={
        'ServiceAccessRoleArn': 'string',
        'ExternalTableDefinition': 'string',
        'CsvRowDelimiter': 'string',
        'CsvDelimiter': 'string',
        'BucketFolder': 'string',
        'BucketName': 'string',
        'CompressionType': 'none'|'gzip',
        'EncryptionMode': 'sse-s3'|'sse-kms',
        'ServerSideEncryptionKmsKeyId': 'string',
        'DataFormat': 'csv'|'parquet',
        'EncodingType': 'plain'|'plain-dictionary'|'rle-dictionary',
        'DictPageSizeLimit': 123,
        'RowGroupLength': 123,
        'DataPageSize': 123,
        'ParquetVersion': 'parquet-1-0'|'parquet-2-0',
        'EnableStatistics': True|False,
        'IncludeOpForFullLoad': True|False,
        'CdcInsertsOnly': True|False,
        'TimestampColumnName': 'string'
    },
    DmsTransferSettings={
        'ServiceAccessRoleArn': 'string',
        'BucketName': 'string'
    },
    MongoDbSettings={
        'Username': 'string',
        'Password': 'string',
        'ServerName': 'string',
        'Port': 123,
        'DatabaseName': 'string',
        'AuthType': 'no'|'password',
        'AuthMechanism': 'default'|'mongodb_cr'|'scram_sha_1',
        'NestingLevel': 'none'|'one',
        'ExtractDocId': 'string',
        'DocsToInvestigate': 'string',
        'AuthSource': 'string',
        'KmsKeyId': 'string'
    },
    KinesisSettings={
        'StreamArn': 'string',
        'MessageFormat': 'json',
        'ServiceAccessRoleArn': 'string'
    },
    ElasticsearchSettings={
        'ServiceAccessRoleArn': 'string',
        'EndpointUri': 'string',
        'FullLoadErrorPercentage': 123,
        'ErrorRetryDuration': 123
    },
    RedshiftSettings={
        'AcceptAnyDate': True|False,
        'AfterConnectScript': 'string',
        'BucketFolder': 'string',
        'BucketName': 'string',
        'ConnectionTimeout': 123,
        'DatabaseName': 'string',
        'DateFormat': 'string',
        'EmptyAsNull': True|False,
        'EncryptionMode': 'sse-s3'|'sse-kms',
        'FileTransferUploadStreams': 123,
        'LoadTimeout': 123,
        'MaxFileSize': 123,
        'Password': 'string',
        'Port': 123,
        'RemoveQuotes': True|False,
        'ReplaceInvalidChars': 'string',
        'ReplaceChars': 'string',
        'ServerName': 'string',
        'ServiceAccessRoleArn': 'string',
        'ServerSideEncryptionKmsKeyId': 'string',
        'TimeFormat': 'string',
        'TrimBlanks': True|False,
        'TruncateColumns': True|False,
        'Username': 'string',
        'WriteBufferSize': 123
    }
)
type EndpointArn

string

param EndpointArn

[REQUIRED]

The Amazon Resource Name (ARN) string that uniquely identifies the endpoint.

type EndpointIdentifier

string

param EndpointIdentifier

The database endpoint identifier. Identifiers must begin with a letter; must contain only ASCII letters, digits, and hyphens; and must not end with a hyphen or contain two consecutive hyphens.

type EndpointType

string

param EndpointType

The type of endpoint. Valid values are source and target .

type EngineName

string

param EngineName

The type of engine for the endpoint. Valid values, depending on the EndpointType, include mysql, oracle, postgres, mariadb, aurora, aurora-postgresql, redshift, s3, db2, azuredb, sybase, dynamodb, mongodb, and sqlserver.

type Username

string

param Username

The user name to be used to login to the endpoint database.

type Password

string

param Password

The password to be used to login to the endpoint database.

type ServerName

string

param ServerName

The name of the server where the endpoint database resides.

type Port

integer

param Port

The port used by the endpoint database.

type DatabaseName

string

param DatabaseName

The name of the endpoint database.

type ExtraConnectionAttributes

string

param ExtraConnectionAttributes

Additional attributes associated with the connection. To reset this parameter, pass the empty string ("") as an argument.

type CertificateArn

string

param CertificateArn

The Amazon Resource Name (ARN) of the certificate used for SSL connection.

type SslMode

string

param SslMode

The SSL mode used to connect to the endpoint. The default value is none .

type ServiceAccessRoleArn

string

param ServiceAccessRoleArn

The Amazon Resource Name (ARN) for the service access role you want to use to modify the endpoint.

type ExternalTableDefinition

string

param ExternalTableDefinition

The external table definition.

type DynamoDbSettings

dict

param DynamoDbSettings

Settings in JSON format for the target Amazon DynamoDB endpoint. For more information about the available settings, see Using Object Mapping to Migrate Data to DynamoDB in the AWS Database Migration Service User Guide.

  • ServiceAccessRoleArn (string) -- [REQUIRED]

    The Amazon Resource Name (ARN) used by the service access IAM role.

type S3Settings

dict

param S3Settings

Settings in JSON format for the target Amazon S3 endpoint. For more information about the available settings, see Extra Connection Attributes When Using Amazon S3 as a Target for AWS DMS in the AWS Database Migration Service User Guide.

  • ServiceAccessRoleArn (string) --

    The Amazon Resource Name (ARN) used by the service access IAM role.

  • ExternalTableDefinition (string) --

    The external table definition.

  • CsvRowDelimiter (string) --

    The delimiter used to separate rows in the source files. The default is a carriage return (\n ).

  • CsvDelimiter (string) --

    The delimiter used to separate columns in the source files. The default is a comma.

  • BucketFolder (string) --

    An optional parameter to set a folder name in the S3 bucket. If provided, tables are created in the path `` bucketFolder /schema_name /table_name /`` . If this parameter is not specified, then the path used is `` schema_name /table_name /`` .

  • BucketName (string) --

    The name of the S3 bucket.

  • CompressionType (string) --

    An optional parameter to use GZIP to compress the target files. Set to GZIP to compress the target files. Set to NONE (the default) or do not use to leave the files uncompressed. Applies to both .csv and .parquet file formats.

  • EncryptionMode (string) --

    The type of server-side encryption that you want to use for your data. This encryption type is part of the endpoint settings or the extra connections attributes for Amazon S3. You can choose either SSE_S3 (the default) or SSE_KMS . To use SSE_S3 , you need an AWS Identity and Access Management (IAM) role with permission to allow "arn:aws:s3:::dms-*" to use the following actions:

    • s3:CreateBucket

    • s3:ListBucket

    • s3:DeleteBucket

    • s3:GetBucketLocation

    • s3:GetObject

    • s3:PutObject

    • s3:DeleteObject

    • s3:GetObjectVersion

    • s3:GetBucketPolicy

    • s3:PutBucketPolicy

    • s3:DeleteBucketPolicy

  • ServerSideEncryptionKmsKeyId (string) --

    If you are using SSE_KMS for the EncryptionMode , provide the AWS KMS key ID. The key that you use needs an attached policy that enables AWS Identity and Access Management (IAM) user permissions and allows use of the key.

    Here is a CLI example: ``aws dms create-endpoint --endpoint-identifier value --endpoint-type target --engine-name s3 --s3-settings ServiceAccessRoleArn=*value* ,BucketFolder=*value* ,BucketName=*value* ,EncryptionMode=SSE_KMS,ServerSideEncryptionKmsKeyId=*value* ``

  • DataFormat (string) --

    The format of the data that you want to use for output. You can choose one of the following:

    • csv : This is a row-based file format with comma-separated values (.csv).

    • parquet : Apache Parquet (.parquet) is a columnar storage file format that features efficient compression and provides faster query response.

  • EncodingType (string) --

    The type of encoding you are using:

    • RLE_DICTIONARY uses a combination of bit-packing and run-length encoding to store repeated values more efficiently. This is the default.

    • PLAIN doesn't use encoding at all. Values are stored as they are.

    • PLAIN_DICTIONARY builds a dictionary of the values encountered in a given column. The dictionary is stored in a dictionary page for each column chunk.

  • DictPageSizeLimit (integer) --

    The maximum size of an encoded dictionary page of a column. If the dictionary page exceeds this, this column is stored using an encoding type of PLAIN . This parameter defaults to 1024 * 1024 bytes (1 MiB), the maximum size of a dictionary page before it reverts to PLAIN encoding. This size is used for .parquet file format only.

  • RowGroupLength (integer) --

    The number of rows in a row group. A smaller row group size provides faster reads. But as the number of row groups grows, the slower writes become. This parameter defaults to 10,000 rows. This number is used for .parquet file format only.

    If you choose a value larger than the maximum, RowGroupLength is set to the max row group length in bytes (64 * 1024 * 1024).

  • DataPageSize (integer) --

    The size of one data page in bytes. This parameter defaults to 1024 * 1024 bytes (1 MiB). This number is used for .parquet file format only.

  • ParquetVersion (string) --

    The version of the Apache Parquet format that you want to use: parquet_1_0 (the default) or parquet_2_0 .

  • EnableStatistics (boolean) --

    A value that enables statistics for Parquet pages and row groups. Choose true to enable statistics, false to disable. Statistics include NULL , DISTINCT , MAX , and MIN values. This parameter defaults to true . This value is used for .parquet file format only.

  • IncludeOpForFullLoad (boolean) --

    A value that enables a full load to write INSERT operations to the comma-separated value (.csv) output files only to indicate how the rows were added to the source database.

    Note

    AWS DMS supports IncludeOpForFullLoad in versions 3.1.4 and later.

    For full load, records can only be inserted. By default (the false setting), no information is recorded in these output files for a full load to indicate that the rows were inserted at the source database. If IncludeOpForFullLoad is set to true or y , the INSERT is recorded as an I annotation in the first field of the .csv file. This allows the format of your target records from a full load to be consistent with the target records from a CDC load.

    Note

    This setting works together with CdcInsertsOnly for output to .csv files only. For more information about how these settings work together, see Indicating Source DB Operations in Migrated S3 Data in the AWS Database Migration Service User Guide. .

  • CdcInsertsOnly (boolean) --

    A value that enables a change data capture (CDC) load to write only INSERT operations to .csv or columnar storage (.parquet) output files. By default (the false setting), the first field in a .csv or .parquet record contains the letter I (INSERT), U (UPDATE), or D (DELETE). These values indicate whether the row was inserted, updated, or deleted at the source database for a CDC load to the target.

    If cdcInsertsOnly is set to true or y , only INSERTs from the source database are migrated to the .csv or .parquet file. For .csv format only, how these INSERTs are recorded depends on the value of IncludeOpForFullLoad . If IncludeOpForFullLoad is set to true , the first field of every CDC record is set to I to indicate the INSERT operation at the source. If IncludeOpForFullLoad is set to false , every CDC record is written without a first field to indicate the INSERT operation at the source. For more information about how these settings work together, see Indicating Source DB Operations in Migrated S3 Data in the AWS Database Migration Service User Guide. .

    Note

    AWS DMS supports this interaction between CdcInsertsOnly and IncludeOpForFullLoad in versions 3.1.4 and later.

  • TimestampColumnName (string) --

    A value that includes a timestamp column in the Amazon S3 target endpoint data. AWS DMS includes an additional column in the migrated data when you set timestampColumnName to a non-blank value.

    Note

    AWS DMS supports TimestampColumnName in versions 3.1.4 and later.

    For a full load, each row of the timestamp column contains a timestamp for when the data was transferred from the source to the target by DMS. For a CDC load, each row of the timestamp column contains the timestamp for the commit of that row in the source database. The format for the timestamp column value is yyyy-MM-dd HH:mm:ss.SSSSSS . For CDC, the microsecond precision depends on the commit timestamp supported by DMS for the source database. When the AddColumnName setting is set to true , DMS also includes the name for the timestamp column that you set as the nonblank value of timestampColumnName .

type DmsTransferSettings

dict

param DmsTransferSettings

The settings in JSON format for the DMS transfer type of source endpoint.

Attributes include the following:

  • serviceAccessRoleArn - The IAM role that has permission to access the Amazon S3 bucket.

  • BucketName - The name of the S3 bucket to use.

  • compressionType - An optional parameter to use GZIP to compress the target files. Set to NONE (the default) or do not use to leave the files uncompressed.

Shorthand syntax: ServiceAccessRoleArn=string ,BucketName=string,CompressionType=string

JSON syntax:

{ "ServiceAccessRoleArn": "string", "BucketName": "string", "CompressionType": "none"|"gzip" }

  • ServiceAccessRoleArn (string) --

    The IAM role that has permission to access the Amazon S3 bucket.

  • BucketName (string) --

    The name of the S3 bucket to use.

type MongoDbSettings

dict

param MongoDbSettings

Settings in JSON format for the source MongoDB endpoint. For more information about the available settings, see the configuration properties section in Using MongoDB as a Target for AWS Database Migration Service in the AWS Database Migration Service User Guide.

  • Username (string) --

    The user name you use to access the MongoDB source endpoint.

  • Password (string) --

    The password for the user account you use to access the MongoDB source endpoint.

  • ServerName (string) --

    The name of the server on the MongoDB source endpoint.

  • Port (integer) --

    The port value for the MongoDB source endpoint.

  • DatabaseName (string) --

    The database name on the MongoDB source endpoint.

  • AuthType (string) --

    The authentication type you use to access the MongoDB source endpoint.

    Valid values: NO, PASSWORD

    When NO is selected, user name and password parameters are not used and can be empty.

  • AuthMechanism (string) --

    The authentication mechanism you use to access the MongoDB source endpoint.

    Valid values: DEFAULT, MONGODB_CR, SCRAM_SHA_1

    DEFAULT – For MongoDB version 2.x, use MONGODB_CR. For MongoDB version 3.x, use SCRAM_SHA_1. This attribute is not used when authType=No.

  • NestingLevel (string) --

    Specifies either document or table mode.

    Valid values: NONE, ONE

    Default value is NONE. Specify NONE to use document mode. Specify ONE to use table mode.

  • ExtractDocId (string) --

    Specifies the document ID. Use this attribute when NestingLevel is set to NONE.

    Default value is false.

  • DocsToInvestigate (string) --

    Indicates the number of documents to preview to determine the document organization. Use this attribute when NestingLevel is set to ONE.

    Must be a positive value greater than 0. Default value is 1000.

  • AuthSource (string) --

    The MongoDB database name. This attribute is not used when authType=NO .

    The default is admin.

  • KmsKeyId (string) --

    The AWS KMS key identifier that is used to encrypt the content on the replication instance. If you don't specify a value for the KmsKeyId parameter, then AWS DMS uses your default encryption key. AWS KMS creates the default encryption key for your AWS account. Your AWS account has a different default encryption key for each AWS Region.

type KinesisSettings

dict

param KinesisSettings

Settings in JSON format for the target Amazon Kinesis Data Streams endpoint. For more information about the available settings, see Using Object Mapping to Migrate Data to a Kinesis Data Stream in the AWS Database Migration User Guide.

  • StreamArn (string) --

    The Amazon Resource Name (ARN) for the Amazon Kinesis Data Streams endpoint.

  • MessageFormat (string) --

    The output format for the records created on the endpoint. The message format is JSON .

  • ServiceAccessRoleArn (string) --

    The Amazon Resource Name (ARN) for the IAM role that DMS uses to write to the Amazon Kinesis data stream.

type ElasticsearchSettings

dict

param ElasticsearchSettings

Settings in JSON format for the target Elasticsearch endpoint. For more information about the available settings, see Extra Connection Attributes When Using Elasticsearch as a Target for AWS DMS in the AWS Database Migration User Guide.

  • ServiceAccessRoleArn (string) -- [REQUIRED]

    The Amazon Resource Name (ARN) used by service to access the IAM role.

  • EndpointUri (string) -- [REQUIRED]

    The endpoint for the Elasticsearch cluster.

  • FullLoadErrorPercentage (integer) --

    The maximum percentage of records that can fail to be written before a full load operation stops.

  • ErrorRetryDuration (integer) --

    The maximum number of seconds that DMS retries failed API requests to the Elasticsearch cluster.

type RedshiftSettings

dict

param RedshiftSettings
  • AcceptAnyDate (boolean) --

    A value that indicates to allow any date format, including invalid formats such as 00/00/00 00:00:00, to be loaded without generating an error. You can choose true or false (the default).

    This parameter applies only to TIMESTAMP and DATE columns. Always use ACCEPTANYDATE with the DATEFORMAT parameter. If the date format for the data doesn't match the DATEFORMAT specification, Amazon Redshift inserts a NULL value into that field.

  • AfterConnectScript (string) --

    Code to run after connecting. This parameter should contain the code itself, not the name of a file containing the code.

  • BucketFolder (string) --

    The location where the comma-separated value (.csv) files are stored before being uploaded to the S3 bucket.

  • BucketName (string) --

    The name of the S3 bucket you want to use

  • ConnectionTimeout (integer) --

    A value that sets the amount of time to wait (in milliseconds) before timing out, beginning from when you initially establish a connection.

  • DatabaseName (string) --

    The name of the Amazon Redshift data warehouse (service) that you are working with.

  • DateFormat (string) --

    The date format that you are using. Valid values are auto (case-sensitive), your date format string enclosed in quotes, or NULL. If this parameter is left unset (NULL), it defaults to a format of 'YYYY-MM-DD'. Using auto recognizes most strings, even some that aren't supported when you use a date format string.

    If your date and time values use formats different from each other, set this to auto .

  • EmptyAsNull (boolean) --

    A value that specifies whether AWS DMS should migrate empty CHAR and VARCHAR fields as NULL. A value of true sets empty CHAR and VARCHAR fields to null. The default is false .

  • EncryptionMode (string) --

    The type of server-side encryption that you want to use for your data. This encryption type is part of the endpoint settings or the extra connections attributes for Amazon S3. You can choose either SSE_S3 (the default) or SSE_KMS . To use SSE_S3 , create an AWS Identity and Access Management (IAM) role with a policy that allows "arn:aws:s3:::*" to use the following actions: "s3:PutObject", "s3:ListBucket"

  • FileTransferUploadStreams (integer) --

    The number of threads used to upload a single file. This parameter accepts a value from 1 through 64. It defaults to 10.

  • LoadTimeout (integer) --

    The amount of time to wait (in milliseconds) before timing out, beginning from when you begin loading.

  • MaxFileSize (integer) --

    The maximum size (in KB) of any .csv file used to transfer data to Amazon Redshift. This accepts a value from 1 through 1,048,576. It defaults to 32,768 KB (32 MB).

  • Password (string) --

    The password for the user named in the username property.

  • Port (integer) --

    The port number for Amazon Redshift. The default value is 5439.

  • RemoveQuotes (boolean) --

    A value that specifies to remove surrounding quotation marks from strings in the incoming data. All characters within the quotation marks, including delimiters, are retained. Choose true to remove quotation marks. The default is false .

  • ReplaceInvalidChars (string) --

    A list of characters that you want to replace. Use with ReplaceChars .

  • ReplaceChars (string) --

    A value that specifies to replaces the invalid characters specified in ReplaceInvalidChars , substituting the specified characters instead. The default is "?" .

  • ServerName (string) --

    The name of the Amazon Redshift cluster you are using.

  • ServiceAccessRoleArn (string) --

    The Amazon Resource Name (ARN) of the IAM role that has access to the Amazon Redshift service.

  • ServerSideEncryptionKmsKeyId (string) --

    The AWS KMS key ID. If you are using SSE_KMS for the EncryptionMode , provide this key ID. The key that you use needs an attached policy that enables IAM user permissions and allows use of the key.

  • TimeFormat (string) --

    The time format that you want to use. Valid values are auto (case-sensitive), 'timeformat_string' , 'epochsecs' , or 'epochmillisecs' . It defaults to 10. Using auto recognizes most strings, even some that aren't supported when you use a time format string.

    If your date and time values use formats different from each other, set this parameter to auto .

  • TrimBlanks (boolean) --

    A value that specifies to remove the trailing white space characters from a VARCHAR string. This parameter applies only to columns with a VARCHAR data type. Choose true to remove unneeded white space. The default is false .

  • TruncateColumns (boolean) --

    A value that specifies to truncate data in columns to the appropriate number of characters, so that the data fits in the column. This parameter applies only to columns with a VARCHAR or CHAR data type, and rows with a size of 4 MB or less. Choose true to truncate data. The default is false .

  • Username (string) --

    An Amazon Redshift user name for a registered user.

  • WriteBufferSize (integer) --

    The size of the write buffer to use in rows. Valid values range from 1 through 2,048. The default is 1,024. Use this setting to tune performance.

rtype

dict

returns

Response Syntax

{
    'Endpoint': {
        'EndpointIdentifier': 'string',
        'EndpointType': 'source'|'target',
        'EngineName': 'string',
        'EngineDisplayName': 'string',
        'Username': 'string',
        'ServerName': 'string',
        'Port': 123,
        'DatabaseName': 'string',
        'ExtraConnectionAttributes': 'string',
        'Status': 'string',
        'KmsKeyId': 'string',
        'EndpointArn': 'string',
        'CertificateArn': 'string',
        'SslMode': 'none'|'require'|'verify-ca'|'verify-full',
        'ServiceAccessRoleArn': 'string',
        'ExternalTableDefinition': 'string',
        'ExternalId': 'string',
        'DynamoDbSettings': {
            'ServiceAccessRoleArn': 'string'
        },
        'S3Settings': {
            'ServiceAccessRoleArn': 'string',
            'ExternalTableDefinition': 'string',
            'CsvRowDelimiter': 'string',
            'CsvDelimiter': 'string',
            'BucketFolder': 'string',
            'BucketName': 'string',
            'CompressionType': 'none'|'gzip',
            'EncryptionMode': 'sse-s3'|'sse-kms',
            'ServerSideEncryptionKmsKeyId': 'string',
            'DataFormat': 'csv'|'parquet',
            'EncodingType': 'plain'|'plain-dictionary'|'rle-dictionary',
            'DictPageSizeLimit': 123,
            'RowGroupLength': 123,
            'DataPageSize': 123,
            'ParquetVersion': 'parquet-1-0'|'parquet-2-0',
            'EnableStatistics': True|False,
            'IncludeOpForFullLoad': True|False,
            'CdcInsertsOnly': True|False,
            'TimestampColumnName': 'string'
        },
        'DmsTransferSettings': {
            'ServiceAccessRoleArn': 'string',
            'BucketName': 'string'
        },
        'MongoDbSettings': {
            'Username': 'string',
            'Password': 'string',
            'ServerName': 'string',
            'Port': 123,
            'DatabaseName': 'string',
            'AuthType': 'no'|'password',
            'AuthMechanism': 'default'|'mongodb_cr'|'scram_sha_1',
            'NestingLevel': 'none'|'one',
            'ExtractDocId': 'string',
            'DocsToInvestigate': 'string',
            'AuthSource': 'string',
            'KmsKeyId': 'string'
        },
        'KinesisSettings': {
            'StreamArn': 'string',
            'MessageFormat': 'json',
            'ServiceAccessRoleArn': 'string'
        },
        'ElasticsearchSettings': {
            'ServiceAccessRoleArn': 'string',
            'EndpointUri': 'string',
            'FullLoadErrorPercentage': 123,
            'ErrorRetryDuration': 123
        },
        'RedshiftSettings': {
            'AcceptAnyDate': True|False,
            'AfterConnectScript': 'string',
            'BucketFolder': 'string',
            'BucketName': 'string',
            'ConnectionTimeout': 123,
            'DatabaseName': 'string',
            'DateFormat': 'string',
            'EmptyAsNull': True|False,
            'EncryptionMode': 'sse-s3'|'sse-kms',
            'FileTransferUploadStreams': 123,
            'LoadTimeout': 123,
            'MaxFileSize': 123,
            'Password': 'string',
            'Port': 123,
            'RemoveQuotes': True|False,
            'ReplaceInvalidChars': 'string',
            'ReplaceChars': 'string',
            'ServerName': 'string',
            'ServiceAccessRoleArn': 'string',
            'ServerSideEncryptionKmsKeyId': 'string',
            'TimeFormat': 'string',
            'TrimBlanks': True|False,
            'TruncateColumns': True|False,
            'Username': 'string',
            'WriteBufferSize': 123
        }
    }
}

Response Structure

  • (dict) --

    • Endpoint (dict) --

      The modified endpoint.

      • EndpointIdentifier (string) --

        The database endpoint identifier. Identifiers must begin with a letter; must contain only ASCII letters, digits, and hyphens; and must not end with a hyphen or contain two consecutive hyphens.

      • EndpointType (string) --

        The type of endpoint. Valid values are source and target .

      • EngineName (string) --

        The database engine name. Valid values, depending on the EndpointType, include mysql, oracle, postgres, mariadb, aurora, aurora-postgresql, redshift, s3, db2, azuredb, sybase, dynamodb, mongodb, and sqlserver.

      • EngineDisplayName (string) --

        The expanded name for the engine name. For example, if the EngineName parameter is "aurora," this value would be "Amazon Aurora MySQL."

      • Username (string) --

        The user name used to connect to the endpoint.

      • ServerName (string) --

        The name of the server at the endpoint.

      • Port (integer) --

        The port value used to access the endpoint.

      • DatabaseName (string) --

        The name of the database at the endpoint.

      • ExtraConnectionAttributes (string) --

        Additional connection attributes used to connect to the endpoint.

      • Status (string) --

        The status of the endpoint.

      • KmsKeyId (string) --

        An AWS KMS key identifier that is used to encrypt the connection parameters for the endpoint.

        If you don't specify a value for the KmsKeyId parameter, then AWS DMS uses your default encryption key.

        AWS KMS creates the default encryption key for your AWS account. Your AWS account has a different default encryption key for each AWS Region.

      • EndpointArn (string) --

        The Amazon Resource Name (ARN) string that uniquely identifies the endpoint.

      • CertificateArn (string) --

        The Amazon Resource Name (ARN) used for SSL connection to the endpoint.

      • SslMode (string) --

        The SSL mode used to connect to the endpoint. The default value is none .

      • ServiceAccessRoleArn (string) --

        The Amazon Resource Name (ARN) used by the service access IAM role.

      • ExternalTableDefinition (string) --

        The external table definition.

      • ExternalId (string) --

        Value returned by a call to CreateEndpoint that can be used for cross-account validation. Use it on a subsequent call to CreateEndpoint to create the endpoint with a cross-account.

      • DynamoDbSettings (dict) --

        The settings for the target DynamoDB database. For more information, see the DynamoDBSettings structure.

        • ServiceAccessRoleArn (string) --

          The Amazon Resource Name (ARN) used by the service access IAM role.

      • S3Settings (dict) --

        The settings for the S3 target endpoint. For more information, see the S3Settings structure.

        • ServiceAccessRoleArn (string) --

          The Amazon Resource Name (ARN) used by the service access IAM role.

        • ExternalTableDefinition (string) --

          The external table definition.

        • CsvRowDelimiter (string) --

          The delimiter used to separate rows in the source files. The default is a carriage return (\n ).

        • CsvDelimiter (string) --

          The delimiter used to separate columns in the source files. The default is a comma.

        • BucketFolder (string) --

          An optional parameter to set a folder name in the S3 bucket. If provided, tables are created in the path `` bucketFolder /schema_name /table_name /`` . If this parameter is not specified, then the path used is `` schema_name /table_name /`` .

        • BucketName (string) --

          The name of the S3 bucket.

        • CompressionType (string) --

          An optional parameter to use GZIP to compress the target files. Set to GZIP to compress the target files. Set to NONE (the default) or do not use to leave the files uncompressed. Applies to both .csv and .parquet file formats.

        • EncryptionMode (string) --

          The type of server-side encryption that you want to use for your data. This encryption type is part of the endpoint settings or the extra connections attributes for Amazon S3. You can choose either SSE_S3 (the default) or SSE_KMS . To use SSE_S3 , you need an AWS Identity and Access Management (IAM) role with permission to allow "arn:aws:s3:::dms-*" to use the following actions:

          • s3:CreateBucket

          • s3:ListBucket

          • s3:DeleteBucket

          • s3:GetBucketLocation

          • s3:GetObject

          • s3:PutObject

          • s3:DeleteObject

          • s3:GetObjectVersion

          • s3:GetBucketPolicy

          • s3:PutBucketPolicy

          • s3:DeleteBucketPolicy

        • ServerSideEncryptionKmsKeyId (string) --

          If you are using SSE_KMS for the EncryptionMode , provide the AWS KMS key ID. The key that you use needs an attached policy that enables AWS Identity and Access Management (IAM) user permissions and allows use of the key.

          Here is a CLI example: ``aws dms create-endpoint --endpoint-identifier value --endpoint-type target --engine-name s3 --s3-settings ServiceAccessRoleArn=*value* ,BucketFolder=*value* ,BucketName=*value* ,EncryptionMode=SSE_KMS,ServerSideEncryptionKmsKeyId=*value* ``

        • DataFormat (string) --

          The format of the data that you want to use for output. You can choose one of the following:

          • csv : This is a row-based file format with comma-separated values (.csv).

          • parquet : Apache Parquet (.parquet) is a columnar storage file format that features efficient compression and provides faster query response.

        • EncodingType (string) --

          The type of encoding you are using:

          • RLE_DICTIONARY uses a combination of bit-packing and run-length encoding to store repeated values more efficiently. This is the default.

          • PLAIN doesn't use encoding at all. Values are stored as they are.

          • PLAIN_DICTIONARY builds a dictionary of the values encountered in a given column. The dictionary is stored in a dictionary page for each column chunk.

        • DictPageSizeLimit (integer) --

          The maximum size of an encoded dictionary page of a column. If the dictionary page exceeds this, this column is stored using an encoding type of PLAIN . This parameter defaults to 1024 * 1024 bytes (1 MiB), the maximum size of a dictionary page before it reverts to PLAIN encoding. This size is used for .parquet file format only.

        • RowGroupLength (integer) --

          The number of rows in a row group. A smaller row group size provides faster reads. But as the number of row groups grows, the slower writes become. This parameter defaults to 10,000 rows. This number is used for .parquet file format only.

          If you choose a value larger than the maximum, RowGroupLength is set to the max row group length in bytes (64 * 1024 * 1024).

        • DataPageSize (integer) --

          The size of one data page in bytes. This parameter defaults to 1024 * 1024 bytes (1 MiB). This number is used for .parquet file format only.

        • ParquetVersion (string) --

          The version of the Apache Parquet format that you want to use: parquet_1_0 (the default) or parquet_2_0 .

        • EnableStatistics (boolean) --

          A value that enables statistics for Parquet pages and row groups. Choose true to enable statistics, false to disable. Statistics include NULL , DISTINCT , MAX , and MIN values. This parameter defaults to true . This value is used for .parquet file format only.

        • IncludeOpForFullLoad (boolean) --

          A value that enables a full load to write INSERT operations to the comma-separated value (.csv) output files only to indicate how the rows were added to the source database.

          Note

          AWS DMS supports IncludeOpForFullLoad in versions 3.1.4 and later.

          For full load, records can only be inserted. By default (the false setting), no information is recorded in these output files for a full load to indicate that the rows were inserted at the source database. If IncludeOpForFullLoad is set to true or y , the INSERT is recorded as an I annotation in the first field of the .csv file. This allows the format of your target records from a full load to be consistent with the target records from a CDC load.

          Note

          This setting works together with CdcInsertsOnly for output to .csv files only. For more information about how these settings work together, see Indicating Source DB Operations in Migrated S3 Data in the AWS Database Migration Service User Guide. .

        • CdcInsertsOnly (boolean) --

          A value that enables a change data capture (CDC) load to write only INSERT operations to .csv or columnar storage (.parquet) output files. By default (the false setting), the first field in a .csv or .parquet record contains the letter I (INSERT), U (UPDATE), or D (DELETE). These values indicate whether the row was inserted, updated, or deleted at the source database for a CDC load to the target.

          If cdcInsertsOnly is set to true or y , only INSERTs from the source database are migrated to the .csv or .parquet file. For .csv format only, how these INSERTs are recorded depends on the value of IncludeOpForFullLoad . If IncludeOpForFullLoad is set to true , the first field of every CDC record is set to I to indicate the INSERT operation at the source. If IncludeOpForFullLoad is set to false , every CDC record is written without a first field to indicate the INSERT operation at the source. For more information about how these settings work together, see Indicating Source DB Operations in Migrated S3 Data in the AWS Database Migration Service User Guide. .

          Note

          AWS DMS supports this interaction between CdcInsertsOnly and IncludeOpForFullLoad in versions 3.1.4 and later.

        • TimestampColumnName (string) --

          A value that includes a timestamp column in the Amazon S3 target endpoint data. AWS DMS includes an additional column in the migrated data when you set timestampColumnName to a non-blank value.

          Note

          AWS DMS supports TimestampColumnName in versions 3.1.4 and later.

          For a full load, each row of the timestamp column contains a timestamp for when the data was transferred from the source to the target by DMS. For a CDC load, each row of the timestamp column contains the timestamp for the commit of that row in the source database. The format for the timestamp column value is yyyy-MM-dd HH:mm:ss.SSSSSS . For CDC, the microsecond precision depends on the commit timestamp supported by DMS for the source database. When the AddColumnName setting is set to true , DMS also includes the name for the timestamp column that you set as the nonblank value of timestampColumnName .

      • DmsTransferSettings (dict) --

        The settings in JSON format for the DMS transfer type of source endpoint.

        Possible attributes include the following:

        • serviceAccessRoleArn - The IAM role that has permission to access the Amazon S3 bucket.

        • bucketName - The name of the S3 bucket to use.

        • compressionType - An optional parameter to use GZIP to compress the target files. To use GZIP, set this value to NONE (the default). To keep the files uncompressed, don't use this value.

        Shorthand syntax for these attributes is as follows: ServiceAccessRoleArn=string,BucketName=string,CompressionType=string

        JSON syntax for these attributes is as follows: { "ServiceAccessRoleArn": "string", "BucketName": "string", "CompressionType": "none"|"gzip" }

        • ServiceAccessRoleArn (string) --

          The IAM role that has permission to access the Amazon S3 bucket.

        • BucketName (string) --

          The name of the S3 bucket to use.

      • MongoDbSettings (dict) --

        The settings for the MongoDB source endpoint. For more information, see the MongoDbSettings structure.

        • Username (string) --

          The user name you use to access the MongoDB source endpoint.

        • Password (string) --

          The password for the user account you use to access the MongoDB source endpoint.

        • ServerName (string) --

          The name of the server on the MongoDB source endpoint.

        • Port (integer) --

          The port value for the MongoDB source endpoint.

        • DatabaseName (string) --

          The database name on the MongoDB source endpoint.

        • AuthType (string) --

          The authentication type you use to access the MongoDB source endpoint.

          Valid values: NO, PASSWORD

          When NO is selected, user name and password parameters are not used and can be empty.

        • AuthMechanism (string) --

          The authentication mechanism you use to access the MongoDB source endpoint.

          Valid values: DEFAULT, MONGODB_CR, SCRAM_SHA_1

          DEFAULT – For MongoDB version 2.x, use MONGODB_CR. For MongoDB version 3.x, use SCRAM_SHA_1. This attribute is not used when authType=No.

        • NestingLevel (string) --

          Specifies either document or table mode.

          Valid values: NONE, ONE

          Default value is NONE. Specify NONE to use document mode. Specify ONE to use table mode.

        • ExtractDocId (string) --

          Specifies the document ID. Use this attribute when NestingLevel is set to NONE.

          Default value is false.

        • DocsToInvestigate (string) --

          Indicates the number of documents to preview to determine the document organization. Use this attribute when NestingLevel is set to ONE.

          Must be a positive value greater than 0. Default value is 1000.

        • AuthSource (string) --

          The MongoDB database name. This attribute is not used when authType=NO .

          The default is admin.

        • KmsKeyId (string) --

          The AWS KMS key identifier that is used to encrypt the content on the replication instance. If you don't specify a value for the KmsKeyId parameter, then AWS DMS uses your default encryption key. AWS KMS creates the default encryption key for your AWS account. Your AWS account has a different default encryption key for each AWS Region.

      • KinesisSettings (dict) --

        The settings for the Amazon Kinesis source endpoint. For more information, see the KinesisSettings structure.

        • StreamArn (string) --

          The Amazon Resource Name (ARN) for the Amazon Kinesis Data Streams endpoint.

        • MessageFormat (string) --

          The output format for the records created on the endpoint. The message format is JSON .

        • ServiceAccessRoleArn (string) --

          The Amazon Resource Name (ARN) for the IAM role that DMS uses to write to the Amazon Kinesis data stream.

      • ElasticsearchSettings (dict) --

        The settings for the Elasticsearch source endpoint. For more information, see the ElasticsearchSettings structure.

        • ServiceAccessRoleArn (string) --

          The Amazon Resource Name (ARN) used by service to access the IAM role.

        • EndpointUri (string) --

          The endpoint for the Elasticsearch cluster.

        • FullLoadErrorPercentage (integer) --

          The maximum percentage of records that can fail to be written before a full load operation stops.

        • ErrorRetryDuration (integer) --

          The maximum number of seconds that DMS retries failed API requests to the Elasticsearch cluster.

      • RedshiftSettings (dict) --

        Settings for the Amazon Redshift endpoint.

        • AcceptAnyDate (boolean) --

          A value that indicates to allow any date format, including invalid formats such as 00/00/00 00:00:00, to be loaded without generating an error. You can choose true or false (the default).

          This parameter applies only to TIMESTAMP and DATE columns. Always use ACCEPTANYDATE with the DATEFORMAT parameter. If the date format for the data doesn't match the DATEFORMAT specification, Amazon Redshift inserts a NULL value into that field.

        • AfterConnectScript (string) --

          Code to run after connecting. This parameter should contain the code itself, not the name of a file containing the code.

        • BucketFolder (string) --

          The location where the comma-separated value (.csv) files are stored before being uploaded to the S3 bucket.

        • BucketName (string) --

          The name of the S3 bucket you want to use

        • ConnectionTimeout (integer) --

          A value that sets the amount of time to wait (in milliseconds) before timing out, beginning from when you initially establish a connection.

        • DatabaseName (string) --

          The name of the Amazon Redshift data warehouse (service) that you are working with.

        • DateFormat (string) --

          The date format that you are using. Valid values are auto (case-sensitive), your date format string enclosed in quotes, or NULL. If this parameter is left unset (NULL), it defaults to a format of 'YYYY-MM-DD'. Using auto recognizes most strings, even some that aren't supported when you use a date format string.

          If your date and time values use formats different from each other, set this to auto .

        • EmptyAsNull (boolean) --

          A value that specifies whether AWS DMS should migrate empty CHAR and VARCHAR fields as NULL. A value of true sets empty CHAR and VARCHAR fields to null. The default is false .

        • EncryptionMode (string) --

          The type of server-side encryption that you want to use for your data. This encryption type is part of the endpoint settings or the extra connections attributes for Amazon S3. You can choose either SSE_S3 (the default) or SSE_KMS . To use SSE_S3 , create an AWS Identity and Access Management (IAM) role with a policy that allows "arn:aws:s3:::*" to use the following actions: "s3:PutObject", "s3:ListBucket"

        • FileTransferUploadStreams (integer) --

          The number of threads used to upload a single file. This parameter accepts a value from 1 through 64. It defaults to 10.

        • LoadTimeout (integer) --

          The amount of time to wait (in milliseconds) before timing out, beginning from when you begin loading.

        • MaxFileSize (integer) --

          The maximum size (in KB) of any .csv file used to transfer data to Amazon Redshift. This accepts a value from 1 through 1,048,576. It defaults to 32,768 KB (32 MB).

        • Password (string) --

          The password for the user named in the username property.

        • Port (integer) --

          The port number for Amazon Redshift. The default value is 5439.

        • RemoveQuotes (boolean) --

          A value that specifies to remove surrounding quotation marks from strings in the incoming data. All characters within the quotation marks, including delimiters, are retained. Choose true to remove quotation marks. The default is false .

        • ReplaceInvalidChars (string) --

          A list of characters that you want to replace. Use with ReplaceChars .

        • ReplaceChars (string) --

          A value that specifies to replaces the invalid characters specified in ReplaceInvalidChars , substituting the specified characters instead. The default is "?" .

        • ServerName (string) --

          The name of the Amazon Redshift cluster you are using.

        • ServiceAccessRoleArn (string) --

          The Amazon Resource Name (ARN) of the IAM role that has access to the Amazon Redshift service.

        • ServerSideEncryptionKmsKeyId (string) --

          The AWS KMS key ID. If you are using SSE_KMS for the EncryptionMode , provide this key ID. The key that you use needs an attached policy that enables IAM user permissions and allows use of the key.

        • TimeFormat (string) --

          The time format that you want to use. Valid values are auto (case-sensitive), 'timeformat_string' , 'epochsecs' , or 'epochmillisecs' . It defaults to 10. Using auto recognizes most strings, even some that aren't supported when you use a time format string.

          If your date and time values use formats different from each other, set this parameter to auto .

        • TrimBlanks (boolean) --

          A value that specifies to remove the trailing white space characters from a VARCHAR string. This parameter applies only to columns with a VARCHAR data type. Choose true to remove unneeded white space. The default is false .

        • TruncateColumns (boolean) --

          A value that specifies to truncate data in columns to the appropriate number of characters, so that the data fits in the column. This parameter applies only to columns with a VARCHAR or CHAR data type, and rows with a size of 4 MB or less. Choose true to truncate data. The default is false .

        • Username (string) --

          An Amazon Redshift user name for a registered user.

        • WriteBufferSize (integer) --

          The size of the write buffer to use in rows. Valid values range from 1 through 2,048. The default is 1,024. Use this setting to tune performance.