NeptuneData / Client / start_ml_model_transform_job

start_ml_model_transform_job#

NeptuneData.Client.start_ml_model_transform_job(**kwargs)#

Creates a new model transform job. See Use a trained model to generate new model artifacts.

When invoking this operation in a Neptune cluster that has IAM authentication enabled, the IAM user or role making the request must have a policy attached that allows the neptune-db:StartMLModelTransformJob IAM action in that cluster.

See also: AWS API Documentation

Request Syntax

response = client.start_ml_model_transform_job(
    id='string',
    dataProcessingJobId='string',
    mlModelTrainingJobId='string',
    trainingJobName='string',
    modelTransformOutputS3Location='string',
    sagemakerIamRoleArn='string',
    neptuneIamRoleArn='string',
    customModelTransformParameters={
        'sourceS3DirectoryPath': 'string',
        'transformEntryPointScript': 'string'
    },
    baseProcessingInstanceType='string',
    baseProcessingInstanceVolumeSizeInGB=123,
    subnets=[
        'string',
    ],
    securityGroupIds=[
        'string',
    ],
    volumeEncryptionKMSKey='string',
    s3OutputEncryptionKMSKey='string'
)
Parameters:
  • id (string) – A unique identifier for the new job. The default is an autogenerated UUID.

  • dataProcessingJobId (string) – The job ID of a completed data-processing job. You must include either dataProcessingJobId and a mlModelTrainingJobId, or a trainingJobName.

  • mlModelTrainingJobId (string) – The job ID of a completed model-training job. You must include either dataProcessingJobId and a mlModelTrainingJobId, or a trainingJobName.

  • trainingJobName (string) – The name of a completed SageMaker training job. You must include either dataProcessingJobId and a mlModelTrainingJobId, or a trainingJobName.

  • modelTransformOutputS3Location (string) –

    [REQUIRED]

    The location in Amazon S3 where the model artifacts are to be stored.

  • sagemakerIamRoleArn (string) – The ARN of an IAM role for SageMaker execution. This must be listed in your DB cluster parameter group or an error will occur.

  • neptuneIamRoleArn (string) – The ARN of an IAM role that provides Neptune access to SageMaker and Amazon S3 resources. This must be listed in your DB cluster parameter group or an error will occur.

  • customModelTransformParameters (dict) –

    Configuration information for a model transform using a custom model. The customModelTransformParameters object contains the following fields, which must have values compatible with the saved model parameters from the training job:

    • sourceS3DirectoryPath (string) – [REQUIRED]

      The path to the Amazon S3 location where the Python module implementing your model is located. This must point to a valid existing Amazon S3 location that contains, at a minimum, a training script, a transform script, and a model-hpo-configuration.json file.

    • transformEntryPointScript (string) –

      The name of the entry point in your module of a script that should be run after the best model from the hyperparameter search has been identified, to compute the model artifacts necessary for model deployment. It should be able to run with no command-line arguments. The default is transform.py.

  • baseProcessingInstanceType (string) – The type of ML instance used in preparing and managing training of ML models. This is an ML compute instance chosen based on memory requirements for processing the training data and model.

  • baseProcessingInstanceVolumeSizeInGB (integer) – The disk volume size of the training instance in gigabytes. The default is 0. Both input data and the output model are stored on disk, so the volume size must be large enough to hold both data sets. If not specified or 0, Neptune ML selects a disk volume size based on the recommendation generated in the data processing step.

  • subnets (list) –

    The IDs of the subnets in the Neptune VPC. The default is None.

    • (string) –

  • securityGroupIds (list) –

    The VPC security group IDs. The default is None.

    • (string) –

  • volumeEncryptionKMSKey (string) – The Amazon Key Management Service (KMS) key that SageMaker uses to encrypt data on the storage volume attached to the ML compute instances that run the training job. The default is None.

  • s3OutputEncryptionKMSKey (string) – The Amazon Key Management Service (KMS) key that SageMaker uses to encrypt the output of the processing job. The default is none.

Return type:

dict

Returns:

Response Syntax

{
    'id': 'string',
    'arn': 'string',
    'creationTimeInMillis': 123
}

Response Structure

  • (dict) –

    • id (string) –

      The unique ID of the new model transform job.

    • arn (string) –

      The ARN of the model transform job.

    • creationTimeInMillis (integer) –

      The creation time of the model transform job, in milliseconds.

Exceptions