How to Integrate Amazon S3 with Daton
Learn how to seamlessly connect and integrate Amazon S3 with Daton, enabling efficient data transfer and storage, and unlocking the full potential of your data management capabilities.
How to Integrate Amazon S3 with Daton
Learn how to seamlessly connect and integrate Amazon S3 with Daton, enabling efficient data transfer and storage, and unlocking the full potential of your data management capabilities.
Task 1: Create an Access Key and Secret Key
- Navigate to IAM and click on the user who has Read access to the S3 buckets where files to be replicated are located.
- Go to Security credentials and click on Create access key.
Save this Access Key and Secret Key for future use during Daton Integration.
Task 2: Select the File Path
- Navigate to your S3 buckets where files for replication reside
- Click on the Copy S3 URI button.
- S3 URI copied will have the following format -
“s3://amazon-rpa/OutputFiles/developer/Sales and Traffic by ASIN/”
- Remove the
s3://
from the URL and enter the rest of the string in the Daton UI. - Select the row number in the file that has the column names.
Task 3: Select the schema
Daton auto-detects the schema from the folder and extracts the column names and infers the data types.
Override the Daton detected values if necessary to control the schema in the warehouse.
Task 4: Integrate Amazon S3 with Daton
- Sign in to Daton.
- Select Amazon S3 from the list of Integrations.
- Provide Integration Name and Replication Frequency. The integration name would be used in creating tables for the integration and cannot be changed later.
- Provide AccessKey, SecretKey, and the AWS region where files are located.
- Post successful authentication, you will be prompted to enter a folder path.
- Select the file type - Daton supports CSV and XLS formats.
- Enter the row number where the column names (headers) are present.
- Then select the required fields for each table.
- Overwrite the column names and update any data types as required
- Submit the integration.
Workflow
- Integrations would be in the Pending state initially and will be moved to the Active state as soon as the first job loads data successfully into the configured warehouse.
- Users would be able to edit/pause/re-activate/delete integration anytime.
- Users can view job status and process logs from the integration details page by clicking on the integration name from the active list.