DataSync
💡 Definition
AWS DataSync is a secure, online service that automates and accelerates moving data between on-premises storage systems and AWS storage services, as well as between AWS storage services.
🔑 Key Concepts
- Online Transfer: Transfers data over the network (Internet or Direct Connect).
- Automation: Handles scheduling, encryption, verification, and network optimization.
- Agent-Based: Uses a software agent deployed in your on-premises environment.
- Destinations: Can transfer to/from S3, EFS, FSx, and Object Storage on Snowball.
⚙️ How it Works
- Deploy Agent: Install the DataSync agent on a VM in your data center.
- Connect: Connect the agent to your local storage (NFS/SMB) and your AWS storage.
- Task: Create a task to define what data to copy and when.
- Run: DataSync copies the data, validating it along the way.
🎯 Use Cases
- Migration: Active online migration of active data to the cloud.
- Archiving: Moving cold data to cloud storage.
- Replication: Replicating data for business continuity or processing in the cloud.
- Transfer between AWS Services: e.g., moving data from EFS to S3.
💰 Pricing Model
- Data Transfer: Charged flat fee per GB of data moved.
📝 Exam Tips (CLF-C02)
- Online data transfer (vs. Snow Family which is offline).
- Automated and accelerated (up to 10x faster than open-source tools).
- Uses an agent on-premises.
See Also: * Snow Family * Transfer Family * Storage Gateway