classSageMakerProtobufFileFormat extends FileFormat with DataSourceRegister
A Spark FileFormat for serializing Dataframes of labeled vectors to the Amazon Record
protobuf file format encoded in RecordIO.
To write records from a DataFrame in this file format, run
dataframe.save
.format("sagemaker")
.option("labelColumnName", "myLabelColumn")
.option("featuresColumnName", "myFeaturesColumn")
.save("my_output_path")
A Spark FileFormat for serializing Dataframes of labeled vectors to the Amazon Record protobuf file format encoded in RecordIO.
To write records from a DataFrame in this file format, run
dataframe.save .format("sagemaker") .option("labelColumnName", "myLabelColumn") .option("featuresColumnName", "myFeaturesColumn") .save("my_output_path")
https://mxnet.incubator.apache.org/architecture/note_data_loading.html for more information on recordIO
https://aws.amazon.com/sagemaker/latest/dg/cdf-training.html/ for more information on the Amazon Record data format.