java.lang.Object
com.amazonaws.services.machinelearning.model.S3DataSpec
All Implemented Interfaces:
Serializable, Cloneable

public class S3DataSpec extends Object implements Serializable, Cloneable

Describes the data specification of a DataSource.

See Also:
  • Constructor Details

    • S3DataSpec

      public S3DataSpec()
  • Method Details

    • setDataLocationS3

      public void setDataLocationS3(String dataLocationS3)

      The location of the data file(s) used by a DataSource. The URI specifies a data file or an Amazon Simple Storage Service (Amazon S3) directory or bucket containing data files.

      Parameters:
      dataLocationS3 - The location of the data file(s) used by a DataSource . The URI specifies a data file or an Amazon Simple Storage Service (Amazon S3) directory or bucket containing data files.
    • getDataLocationS3

      public String getDataLocationS3()

      The location of the data file(s) used by a DataSource. The URI specifies a data file or an Amazon Simple Storage Service (Amazon S3) directory or bucket containing data files.

      Returns:
      The location of the data file(s) used by a DataSource. The URI specifies a data file or an Amazon Simple Storage Service (Amazon S3) directory or bucket containing data files.
    • withDataLocationS3

      public S3DataSpec withDataLocationS3(String dataLocationS3)

      The location of the data file(s) used by a DataSource. The URI specifies a data file or an Amazon Simple Storage Service (Amazon S3) directory or bucket containing data files.

      Parameters:
      dataLocationS3 - The location of the data file(s) used by a DataSource . The URI specifies a data file or an Amazon Simple Storage Service (Amazon S3) directory or bucket containing data files.
      Returns:
      Returns a reference to this object so that method calls can be chained together.
    • setDataRearrangement

      public void setDataRearrangement(String dataRearrangement)

      Describes the splitting requirement of a Datasource.

      Parameters:
      dataRearrangement - Describes the splitting requirement of a Datasource.
    • getDataRearrangement

      public String getDataRearrangement()

      Describes the splitting requirement of a Datasource.

      Returns:
      Describes the splitting requirement of a Datasource.
    • withDataRearrangement

      public S3DataSpec withDataRearrangement(String dataRearrangement)

      Describes the splitting requirement of a Datasource.

      Parameters:
      dataRearrangement - Describes the splitting requirement of a Datasource.
      Returns:
      Returns a reference to this object so that method calls can be chained together.
    • setDataSchema

      public void setDataSchema(String dataSchema)

      A JSON string that represents the schema for an Amazon S3 DataSource. The DataSchema defines the structure of the observation data in the data file(s) referenced in the DataSource.

      Define your DataSchema as a series of key-value pairs. attributes and excludedVariableNames have an array of key-value pairs for their value. Use the following format to define your DataSchema.

      { "version": "1.0",

      "recordAnnotationFieldName": "F1",

      "recordWeightFieldName": "F2",

      "targetFieldName": "F3",

      "dataFormat": "CSV",

      "dataFileContainsHeader": true,

      "attributes": [

      { "fieldName": "F1", "fieldType": "TEXT" }, { "fieldName": "F2", "fieldType": "NUMERIC" }, { "fieldName": "F3", "fieldType": "CATEGORICAL" }, { "fieldName": "F4", "fieldType": "NUMERIC" }, { "fieldName": "F5", "fieldType": "CATEGORICAL" }, { "fieldName": "F6", "fieldType": "TEXT" }, { "fieldName": "F7", "fieldType": "WEIGHTED_INT_SEQUENCE" }, { "fieldName": "F8", "fieldType": "WEIGHTED_STRING_SEQUENCE" } ],

      "excludedVariableNames": [ "F6" ] }

      invalid input: '<'?oxy_insert_end>
      Parameters:
      dataSchema - A JSON string that represents the schema for an Amazon S3 DataSource. The DataSchema defines the structure of the observation data in the data file(s) referenced in the DataSource.

      Define your DataSchema as a series of key-value pairs. attributes and excludedVariableNames have an array of key-value pairs for their value. Use the following format to define your DataSchema.

      { "version": "1.0",

      "recordAnnotationFieldName": "F1",

      "recordWeightFieldName": "F2",

      "targetFieldName": "F3",

      "dataFormat": "CSV",

      "dataFileContainsHeader": true,

      "attributes": [

      { "fieldName": "F1", "fieldType": "TEXT" }, { "fieldName": "F2", "fieldType": "NUMERIC" }, { "fieldName": "F3", "fieldType": "CATEGORICAL" }, { "fieldName": "F4", "fieldType": "NUMERIC" }, { "fieldName": "F5", "fieldType": "CATEGORICAL" }, { "fieldName": "F6", "fieldType": "TEXT" }, { "fieldName": "F7", "fieldType": "WEIGHTED_INT_SEQUENCE" }, { "fieldName": "F8", "fieldType": "WEIGHTED_STRING_SEQUENCE" } ],

      "excludedVariableNames": [ "F6" ] }

    • getDataSchema

      public String getDataSchema()

      A JSON string that represents the schema for an Amazon S3 DataSource. The DataSchema defines the structure of the observation data in the data file(s) referenced in the DataSource.

      Define your DataSchema as a series of key-value pairs. attributes and excludedVariableNames have an array of key-value pairs for their value. Use the following format to define your DataSchema.

      { "version": "1.0",

      "recordAnnotationFieldName": "F1",

      "recordWeightFieldName": "F2",

      "targetFieldName": "F3",

      "dataFormat": "CSV",

      "dataFileContainsHeader": true,

      "attributes": [

      { "fieldName": "F1", "fieldType": "TEXT" }, { "fieldName": "F2", "fieldType": "NUMERIC" }, { "fieldName": "F3", "fieldType": "CATEGORICAL" }, { "fieldName": "F4", "fieldType": "NUMERIC" }, { "fieldName": "F5", "fieldType": "CATEGORICAL" }, { "fieldName": "F6", "fieldType": "TEXT" }, { "fieldName": "F7", "fieldType": "WEIGHTED_INT_SEQUENCE" }, { "fieldName": "F8", "fieldType": "WEIGHTED_STRING_SEQUENCE" } ],

      "excludedVariableNames": [ "F6" ] }

      invalid input: '<'?oxy_insert_end>
      Returns:
      A JSON string that represents the schema for an Amazon S3 DataSource. The DataSchema defines the structure of the observation data in the data file(s) referenced in the DataSource.

      Define your DataSchema as a series of key-value pairs. attributes and excludedVariableNames have an array of key-value pairs for their value. Use the following format to define your DataSchema.

      { "version": "1.0",

      "recordAnnotationFieldName": "F1",

      "recordWeightFieldName": "F2",

      "targetFieldName": "F3",

      "dataFormat": "CSV",

      "dataFileContainsHeader": true,

      "attributes": [

      { "fieldName": "F1", "fieldType": "TEXT" }, { "fieldName": "F2", "fieldType": "NUMERIC" }, { "fieldName": "F3", "fieldType": "CATEGORICAL" }, { "fieldName": "F4", "fieldType": "NUMERIC" }, { "fieldName": "F5", "fieldType": "CATEGORICAL" }, { "fieldName": "F6", "fieldType": "TEXT" }, { "fieldName": "F7", "fieldType": "WEIGHTED_INT_SEQUENCE" }, { "fieldName": "F8", "fieldType": "WEIGHTED_STRING_SEQUENCE" } ],

      "excludedVariableNames": [ "F6" ] }

    • withDataSchema

      public S3DataSpec withDataSchema(String dataSchema)

      A JSON string that represents the schema for an Amazon S3 DataSource. The DataSchema defines the structure of the observation data in the data file(s) referenced in the DataSource.

      Define your DataSchema as a series of key-value pairs. attributes and excludedVariableNames have an array of key-value pairs for their value. Use the following format to define your DataSchema.

      { "version": "1.0",

      "recordAnnotationFieldName": "F1",

      "recordWeightFieldName": "F2",

      "targetFieldName": "F3",

      "dataFormat": "CSV",

      "dataFileContainsHeader": true,

      "attributes": [

      { "fieldName": "F1", "fieldType": "TEXT" }, { "fieldName": "F2", "fieldType": "NUMERIC" }, { "fieldName": "F3", "fieldType": "CATEGORICAL" }, { "fieldName": "F4", "fieldType": "NUMERIC" }, { "fieldName": "F5", "fieldType": "CATEGORICAL" }, { "fieldName": "F6", "fieldType": "TEXT" }, { "fieldName": "F7", "fieldType": "WEIGHTED_INT_SEQUENCE" }, { "fieldName": "F8", "fieldType": "WEIGHTED_STRING_SEQUENCE" } ],

      "excludedVariableNames": [ "F6" ] }

      invalid input: '<'?oxy_insert_end>
      Parameters:
      dataSchema - A JSON string that represents the schema for an Amazon S3 DataSource. The DataSchema defines the structure of the observation data in the data file(s) referenced in the DataSource.

      Define your DataSchema as a series of key-value pairs. attributes and excludedVariableNames have an array of key-value pairs for their value. Use the following format to define your DataSchema.

      { "version": "1.0",

      "recordAnnotationFieldName": "F1",

      "recordWeightFieldName": "F2",

      "targetFieldName": "F3",

      "dataFormat": "CSV",

      "dataFileContainsHeader": true,

      "attributes": [

      { "fieldName": "F1", "fieldType": "TEXT" }, { "fieldName": "F2", "fieldType": "NUMERIC" }, { "fieldName": "F3", "fieldType": "CATEGORICAL" }, { "fieldName": "F4", "fieldType": "NUMERIC" }, { "fieldName": "F5", "fieldType": "CATEGORICAL" }, { "fieldName": "F6", "fieldType": "TEXT" }, { "fieldName": "F7", "fieldType": "WEIGHTED_INT_SEQUENCE" }, { "fieldName": "F8", "fieldType": "WEIGHTED_STRING_SEQUENCE" } ],

      "excludedVariableNames": [ "F6" ] }

      Returns:
      Returns a reference to this object so that method calls can be chained together.
    • setDataSchemaLocationS3

      public void setDataSchemaLocationS3(String dataSchemaLocationS3)

      Describes the schema Location in Amazon S3.

      Parameters:
      dataSchemaLocationS3 - Describes the schema Location in Amazon S3.
    • getDataSchemaLocationS3

      public String getDataSchemaLocationS3()

      Describes the schema Location in Amazon S3.

      Returns:
      Describes the schema Location in Amazon S3.
    • withDataSchemaLocationS3

      public S3DataSpec withDataSchemaLocationS3(String dataSchemaLocationS3)

      Describes the schema Location in Amazon S3.

      Parameters:
      dataSchemaLocationS3 - Describes the schema Location in Amazon S3.
      Returns:
      Returns a reference to this object so that method calls can be chained together.
    • toString

      public String toString()
      Returns a string representation of this object; useful for testing and debugging.
      Overrides:
      toString in class Object
      Returns:
      A string representation of this object.
      See Also:
    • equals

      public boolean equals(Object obj)
      Overrides:
      equals in class Object
    • hashCode

      public int hashCode()
      Overrides:
      hashCode in class Object
    • clone

      public S3DataSpec clone()
      Overrides:
      clone in class Object