An honest review of AWS DataSync
How DataSync works (AWS DataSync Service Website)

An honest review of AWS DataSync

To start off with, its a great service if you want to continuously sync or a one time copy of data from one location to another. Following location types are supported.

1 - Amazon EFS file System

2 - Amazon FSx

3 - Amazon S3

4 - Hadoop Distributed File System (HDFS)

5 - Network File System (NFS)

6 - Object storage

7 - Server Message Block (SMB)

In my scenario, I had to copy data from an on-premises SMB share to a S3 bucket residing in a separate account than the one where DataSync service was commissioned.

I had a DataSync appliance deployed as an EC2 instance in a VPC with no direct internet access. Deploying it in a VPC meant I could use our DirectConnect connection to access on-premises storage. I also had a VPC endpoint configured for DataSync service to avoid sending data through the public internet. Even though in this case it would have been AWS's own network.?

First decision to be made was how to activate the agent on EC2 instance. The choice was either a manual activation by logging into the instance, generating an activation key and registering the agent, or create the agent via DataSync console and point to the instance which automatically activates it. Later option was the way to go, manually generating an activation key doesn't seem to work that smoothly.

Second step was to configure locations, both source and destination. Since I was planning to copy the data to a S3 bucket residing in a different account, I needed to configure it via CLI as the console only lets you choose buckets in the local account. However, this didn't work either. Apparently the only scenario it works for is when you are not using an appliance and configuring DataSync to copy data directly from one S3 bucket to another one in a different account. I gave up on that and decided to use a local S3 bucket and then use S3 sync feature to copy data to the destination bucket in the other account.

Now came the time to configure on-premises SMB share as a location. Location for SMB share was created smoothly but mounting of this share only happens when you go to create a task for data copy and if unsuccessful you are greeted with an error message. Which after some googling I found was either because of the share name, mine had a $ sign in it, or because of SMB version.?I tried different option combinations but none of them worked, including the use of an escape character "\" for $. While going through these trials, I realised how great it is to have AWS CloudShell where you can use AWS CLI to keep repeating these different options without developing a cramp in your hand from repeated ClickOps. Plus there is no need to assume roles or specify AWS profiles like you do from your local machine if you were using AWS CLI.

In the end what worked was to reconfigure on-premises share without $, after that everything was smooth sailing. However, following enhancements/fixes would be very welcomed.

1 - Use of $ in share names. At least allow escape characters when creating locations

2 - Copying data to S3 buckets in different accounts

3 - Bit more logging and ability to test mount shares from within the appliance. I had to test mine by launching a separate instance in the same VPC

4 - Mount SMB shares as soon as the location is created and not when a task is. At least the users wouldn't have to keep re-creating tasks just to see if the share is mounting successfully


Kashif Razzaq

ICT Infrastructure Specialist | Cloud & Virtualization | DevSecOps | Automation & IaC | System Design & Architect

3 年

That’s helpful. Thanks for sharing it Imran!

要查看或添加评论,请登录

Imran Sadiq的更多文章

  • My Shortlist Of re:Invent 2021 Announcements

    My Shortlist Of re:Invent 2021 Announcements

    AWS is in its third wave of services evaluation. First wave was the base services e.

    4 条评论
  • What does it mean to have a local AWS region

    What does it mean to have a local AWS region

    With the latest announcement from AWS on opening of their new Region in Auckland, I thought I should share some…

    6 条评论
  • Web client for AWS SFTP

    Web client for AWS SFTP

    FTP has been around for a long time and still is a strong contender when it comes to transferring data between ad-hoc…

    2 条评论
  • AWS's static stability and recent outage

    AWS's static stability and recent outage

    On January 22nd 2020 between 4:07 PM and 11:20PM PST, you could not create new resources in a VPC for AWS Sydney…

  • Eventful Days in Seattle

    Eventful Days in Seattle

    Sitting here in Starbucks Reserve (one of the best places to visit if you area coffee lover), I am going over the last…

    5 条评论
  • Getting to know AWS Control Tower

    Getting to know AWS Control Tower

    I first attended training on Landing Zones back in August 2018 when it was introduced to APN partners. My immediate…

    2 条评论
  • Recap from AWS Sydney Summit

    Recap from AWS Sydney Summit

    Since 2018, AWS has stepped up its game when it comes to hosting a tech conference in Southern hemisphere. AWS Sydney…

    7 条评论
  • Lancom Tech Talk: How to deploy S3 Static Websites to Test, UAT, or Production AWS Accounts from CodePipeline

    Lancom Tech Talk: How to deploy S3 Static Websites to Test, UAT, or Production AWS Accounts from CodePipeline

    In this blog post, I will demonstrate how to create a continuous deployment pipeline for Static Website deployment into…

    1 条评论
  • Why I abandoned Facebook...

    Why I abandoned Facebook...

    I abandoned Facebook couple of years back. Well, almost abandoned it.

    9 条评论
  • Off to re:Invent

    Off to re:Invent

    Its that time of the year again for me to pack my bags and head off to #re:Invent. It is perhaps the largest global…

    3 条评论

社区洞察

其他会员也浏览了