Introduction
This section provides best practice guidance and tools to migrate data processing applications from self-managed environments to Amazon EMR.
-
EMR Migration Guide : This is a comprehensive technical document that provides guidance for migrating various components including data, application, security configurations etc from self-managed data processing applictions to Amazon EMR
-
Data Migration: We recommend using AWS Datasync for migrating HDFS to S3. Start with this Data Sync support for HDFS blog to review Datasync capabilities and how to get started with Data migrations
-
Data pipelines Migrations: The following tools can be useful in migrating your current data pipelines to AWS
- Oozie to MWAA
- Oozie to stepfunctions
-
Data Governance: The following tools can helpful in migrating your current data catalogs to AWS
For further assistance reach out to aws-bdms-emr@amazon.com