2023/11/09 - Amazon Omics - 6 updated api methods
Changes Support UBAM filetype for Omics Storage and make referenceArn optional
{'sourceFileType': {'UBAM'}}
Begins a multipart read set upload.
See also: AWS API Documentation
Request Syntax
client.create_multipart_read_set_upload(
sequenceStoreId='string',
clientToken='string',
sourceFileType='FASTQ'|'BAM'|'CRAM'|'UBAM',
subjectId='string',
sampleId='string',
generatedFrom='string',
referenceArn='string',
name='string',
description='string',
tags={
'string': 'string'
}
)
string
[REQUIRED]
The sequence store ID for the store that is the destination of the multipart uploads.
string
An idempotency token that can be used to avoid triggering multiple multipart uploads.
string
[REQUIRED]
The type of file being uploaded.
string
[REQUIRED]
The source's subject ID.
string
[REQUIRED]
The source's sample ID.
string
Where the source originated.
string
The ARN of the reference.
string
[REQUIRED]
The name of the read set.
string
The description of the read set.
dict
Any tags to add to the read set.
(string) --
(string) --
dict
Response Syntax
{
'sequenceStoreId': 'string',
'uploadId': 'string',
'sourceFileType': 'FASTQ'|'BAM'|'CRAM'|'UBAM',
'subjectId': 'string',
'sampleId': 'string',
'generatedFrom': 'string',
'referenceArn': 'string',
'name': 'string',
'description': 'string',
'tags': {
'string': 'string'
},
'creationTime': datetime(2015, 1, 1)
}
Response Structure
(dict) --
sequenceStoreId (string) --
The sequence store ID for the store that the read set will be created in.
uploadId (string) --
he ID for the initiated multipart upload.
sourceFileType (string) --
The file type of the read set source.
subjectId (string) --
The source's subject ID.
sampleId (string) --
The source's sample ID.
generatedFrom (string) --
The source of the read set.
referenceArn (string) --
The read set source's reference ARN.
name (string) --
The name of the read set.
description (string) --
The description of the read set.
tags (dict) --
The tags to add to the read set.
(string) --
(string) --
creationTime (datetime) --
The creation time of the multipart upload.
{'sources': {'sourceFileType': {'UBAM'}}}
Gets information about a read set import job.
See also: AWS API Documentation
Request Syntax
client.get_read_set_import_job(
id='string',
sequenceStoreId='string'
)
string
[REQUIRED]
The job's ID.
string
[REQUIRED]
The job's sequence store ID.
dict
Response Syntax
{
'id': 'string',
'sequenceStoreId': 'string',
'roleArn': 'string',
'status': 'SUBMITTED'|'IN_PROGRESS'|'CANCELLING'|'CANCELLED'|'FAILED'|'COMPLETED'|'COMPLETED_WITH_FAILURES',
'statusMessage': 'string',
'creationTime': datetime(2015, 1, 1),
'completionTime': datetime(2015, 1, 1),
'sources': [
{
'sourceFiles': {
'source1': 'string',
'source2': 'string'
},
'sourceFileType': 'FASTQ'|'BAM'|'CRAM'|'UBAM',
'status': 'NOT_STARTED'|'IN_PROGRESS'|'FINISHED'|'FAILED',
'statusMessage': 'string',
'subjectId': 'string',
'sampleId': 'string',
'generatedFrom': 'string',
'referenceArn': 'string',
'name': 'string',
'description': 'string',
'tags': {
'string': 'string'
}
},
]
}
Response Structure
(dict) --
id (string) --
The job's ID.
sequenceStoreId (string) --
The job's sequence store ID.
roleArn (string) --
The job's service role ARN.
status (string) --
The job's status.
statusMessage (string) --
The job's status message.
creationTime (datetime) --
When the job was created.
completionTime (datetime) --
When the job completed.
sources (list) --
The job's source files.
(dict) --
A source for an import read set job.
sourceFiles (dict) --
The source files' location in Amazon S3.
source1 (string) --
The location of the first file in Amazon S3.
source2 (string) --
The location of the second file in Amazon S3.
sourceFileType (string) --
The source's file type.
status (string) --
The source's status.
statusMessage (string) --
The source's status message.
subjectId (string) --
The source's subject ID.
sampleId (string) --
The source's sample ID.
generatedFrom (string) --
Where the source originated.
referenceArn (string) --
The source's genome reference ARN.
name (string) --
The source's name.
description (string) --
The source's description.
tags (dict) --
The source's tags.
(string) --
(string) --
{'fileType': {'UBAM'}}
Gets details about a read set.
See also: AWS API Documentation
Request Syntax
client.get_read_set_metadata(
id='string',
sequenceStoreId='string'
)
string
[REQUIRED]
The read set's ID.
string
[REQUIRED]
The read set's sequence store ID.
dict
Response Syntax
{
'id': 'string',
'arn': 'string',
'sequenceStoreId': 'string',
'subjectId': 'string',
'sampleId': 'string',
'status': 'ARCHIVED'|'ACTIVATING'|'ACTIVE'|'DELETING'|'DELETED'|'PROCESSING_UPLOAD'|'UPLOAD_FAILED',
'name': 'string',
'description': 'string',
'fileType': 'FASTQ'|'BAM'|'CRAM'|'UBAM',
'creationTime': datetime(2015, 1, 1),
'sequenceInformation': {
'totalReadCount': 123,
'totalBaseCount': 123,
'generatedFrom': 'string',
'alignment': 'string'
},
'referenceArn': 'string',
'files': {
'source1': {
'totalParts': 123,
'partSize': 123,
'contentLength': 123
},
'source2': {
'totalParts': 123,
'partSize': 123,
'contentLength': 123
},
'index': {
'totalParts': 123,
'partSize': 123,
'contentLength': 123
}
},
'statusMessage': 'string',
'creationType': 'IMPORT'|'UPLOAD',
'etag': {
'algorithm': 'FASTQ_MD5up'|'BAM_MD5up'|'CRAM_MD5up',
'source1': 'string',
'source2': 'string'
}
}
Response Structure
(dict) --
id (string) --
The read set's ID.
arn (string) --
The read set's ARN.
sequenceStoreId (string) --
The read set's sequence store ID.
subjectId (string) --
The read set's subject ID.
sampleId (string) --
The read set's sample ID.
status (string) --
The read set's status.
name (string) --
The read set's name.
description (string) --
The read set's description.
fileType (string) --
The read set's file type.
creationTime (datetime) --
When the read set was created.
sequenceInformation (dict) --
The read set's sequence information.
totalReadCount (integer) --
The sequence's total read count.
totalBaseCount (integer) --
The sequence's total base count.
generatedFrom (string) --
Where the sequence originated.
alignment (string) --
The sequence's alignment setting.
referenceArn (string) --
The read set's genome reference ARN.
files (dict) --
The read set's files.
source1 (dict) --
The location of the first file in Amazon S3.
totalParts (integer) --
The file's total parts.
partSize (integer) --
The file's part size.
contentLength (integer) --
The file's content length.
source2 (dict) --
The location of the second file in Amazon S3.
totalParts (integer) --
The file's total parts.
partSize (integer) --
The file's part size.
contentLength (integer) --
The file's content length.
index (dict) --
The files' index.
totalParts (integer) --
The file's total parts.
partSize (integer) --
The file's part size.
contentLength (integer) --
The file's content length.
statusMessage (string) --
The status message for a read set. It provides more detail as to why the read set has a status.
creationType (string) --
The creation type of the read set.
etag (dict) --
The entity tag (ETag) is a hash of the object meant to represent its semantic content.
algorithm (string) --
The algorithm used to calculate the read set’s ETag(s).
source1 (string) --
The ETag hash calculated on Source1 of the read set.
source2 (string) --
The ETag hash calculated on Source2 of the read set.
{'uploads': {'sourceFileType': {'UBAM'}}}
Lists all multipart read set uploads and their statuses.
See also: AWS API Documentation
Request Syntax
client.list_multipart_read_set_uploads(
sequenceStoreId='string',
maxResults=123,
nextToken='string'
)
string
[REQUIRED]
The Sequence Store ID used for the multipart uploads.
integer
The maximum number of multipart uploads returned in a page.
string
Next token returned in the response of a previous ListMultipartReadSetUploads call. Used to get the next page of results.
dict
Response Syntax
{
'nextToken': 'string',
'uploads': [
{
'sequenceStoreId': 'string',
'uploadId': 'string',
'sourceFileType': 'FASTQ'|'BAM'|'CRAM'|'UBAM',
'subjectId': 'string',
'sampleId': 'string',
'generatedFrom': 'string',
'referenceArn': 'string',
'name': 'string',
'description': 'string',
'tags': {
'string': 'string'
},
'creationTime': datetime(2015, 1, 1)
},
]
}
Response Structure
(dict) --
nextToken (string) --
Next token returned in the response of a previous ListMultipartReadSetUploads call. Used to get the next page of results.
uploads (list) --
An array of multipart uploads.
(dict) --
Part of the response to ListMultipartReadSetUploads, excluding completed and aborted multipart uploads.
sequenceStoreId (string) --
The sequence store ID used for the multipart upload.
uploadId (string) --
The ID for the initiated multipart upload.
sourceFileType (string) --
The type of file the read set originated from.
subjectId (string) --
The read set source's subject ID.
sampleId (string) --
The read set source's sample ID.
generatedFrom (string) --
The source of an uploaded part.
referenceArn (string) --
The source's reference ARN.
name (string) --
The name of a read set.
description (string) --
The description of a read set.
tags (dict) --
Any tags you wish to add to a read set.
(string) --
(string) --
creationTime (datetime) --
The time stamp for when a direct upload was created.
{'readSets': {'fileType': {'UBAM'}}}
Retrieves a list of read sets.
See also: AWS API Documentation
Request Syntax
client.list_read_sets(
sequenceStoreId='string',
maxResults=123,
nextToken='string',
filter={
'name': 'string',
'status': 'ARCHIVED'|'ACTIVATING'|'ACTIVE'|'DELETING'|'DELETED'|'PROCESSING_UPLOAD'|'UPLOAD_FAILED',
'referenceArn': 'string',
'createdAfter': datetime(2015, 1, 1),
'createdBefore': datetime(2015, 1, 1),
'sampleId': 'string',
'subjectId': 'string',
'generatedFrom': 'string',
'creationType': 'IMPORT'|'UPLOAD'
}
)
string
[REQUIRED]
The jobs' sequence store ID.
integer
The maximum number of read sets to return in one page of results.
string
Specify the pagination token from a previous request to retrieve the next page of results.
dict
A filter to apply to the list.
name (string) --
A name to filter on.
status (string) --
A status to filter on.
referenceArn (string) --
A genome reference ARN to filter on.
createdAfter (datetime) --
The filter's start date.
createdBefore (datetime) --
The filter's end date.
sampleId (string) --
The read set source's sample ID.
subjectId (string) --
The read set source's subject ID.
generatedFrom (string) --
Where the source originated.
creationType (string) --
The creation type of the read set.
dict
Response Syntax
{
'nextToken': 'string',
'readSets': [
{
'id': 'string',
'arn': 'string',
'sequenceStoreId': 'string',
'subjectId': 'string',
'sampleId': 'string',
'status': 'ARCHIVED'|'ACTIVATING'|'ACTIVE'|'DELETING'|'DELETED'|'PROCESSING_UPLOAD'|'UPLOAD_FAILED',
'name': 'string',
'description': 'string',
'referenceArn': 'string',
'fileType': 'FASTQ'|'BAM'|'CRAM'|'UBAM',
'sequenceInformation': {
'totalReadCount': 123,
'totalBaseCount': 123,
'generatedFrom': 'string',
'alignment': 'string'
},
'creationTime': datetime(2015, 1, 1),
'statusMessage': 'string',
'creationType': 'IMPORT'|'UPLOAD',
'etag': {
'algorithm': 'FASTQ_MD5up'|'BAM_MD5up'|'CRAM_MD5up',
'source1': 'string',
'source2': 'string'
}
},
]
}
Response Structure
(dict) --
nextToken (string) --
A pagination token that's included if more results are available.
readSets (list) --
A list of read sets.
(dict) --
A read set.
id (string) --
The read set's ID.
arn (string) --
The read set's ARN.
sequenceStoreId (string) --
The read set's sequence store ID.
subjectId (string) --
The read set's subject ID.
sampleId (string) --
The read set's sample ID.
status (string) --
The read set's status.
name (string) --
The read set's name.
description (string) --
The read set's description.
referenceArn (string) --
The read set's genome reference ARN.
fileType (string) --
The read set's file type.
sequenceInformation (dict) --
Details about a sequence.
totalReadCount (integer) --
The sequence's total read count.
totalBaseCount (integer) --
The sequence's total base count.
generatedFrom (string) --
Where the sequence originated.
alignment (string) --
The sequence's alignment setting.
creationTime (datetime) --
When the read set was created.
statusMessage (string) --
The status for a read set. It provides more detail as to why the read set has a status.
creationType (string) --
The creation type of the read set.
etag (dict) --
The entity tag (ETag) is a hash of the object representing its semantic content.
algorithm (string) --
The algorithm used to calculate the read set’s ETag(s).
source1 (string) --
The ETag hash calculated on Source1 of the read set.
source2 (string) --
The ETag hash calculated on Source2 of the read set.
{'sources': {'sourceFileType': {'UBAM'}}}
Starts a read set import job.
See also: AWS API Documentation
Request Syntax
client.start_read_set_import_job(
sequenceStoreId='string',
roleArn='string',
clientToken='string',
sources=[
{
'sourceFiles': {
'source1': 'string',
'source2': 'string'
},
'sourceFileType': 'FASTQ'|'BAM'|'CRAM'|'UBAM',
'subjectId': 'string',
'sampleId': 'string',
'generatedFrom': 'string',
'referenceArn': 'string',
'name': 'string',
'description': 'string',
'tags': {
'string': 'string'
}
},
]
)
string
[REQUIRED]
The read set's sequence store ID.
string
[REQUIRED]
A service role for the job.
string
To ensure that jobs don't run multiple times, specify a unique token for each job.
list
[REQUIRED]
The job's source files.
(dict) --
A source for a read set import job.
sourceFiles (dict) -- [REQUIRED]
The source files' location in Amazon S3.
source1 (string) -- [REQUIRED]
The location of the first file in Amazon S3.
source2 (string) --
The location of the second file in Amazon S3.
sourceFileType (string) -- [REQUIRED]
The source's file type.
subjectId (string) -- [REQUIRED]
The source's subject ID.
sampleId (string) -- [REQUIRED]
The source's sample ID.
generatedFrom (string) --
Where the source originated.
referenceArn (string) --
The source's reference ARN.
name (string) --
The source's name.
description (string) --
The source's description.
tags (dict) --
The source's tags.
(string) --
(string) --
dict
Response Syntax
{
'id': 'string',
'sequenceStoreId': 'string',
'roleArn': 'string',
'status': 'SUBMITTED'|'IN_PROGRESS'|'CANCELLING'|'CANCELLED'|'FAILED'|'COMPLETED'|'COMPLETED_WITH_FAILURES',
'creationTime': datetime(2015, 1, 1)
}
Response Structure
(dict) --
id (string) --
The job's ID.
sequenceStoreId (string) --
The read set's sequence store ID.
roleArn (string) --
The job's service role ARN.
status (string) --
The job's status.
creationTime (datetime) --
When the job was created.