On-Premises Database Replication (DMS)
On-Premises Database Replication uses AWS Database Migration Service (DMS) to continuously replicate data from your on-premises databases into your BEEM cloud warehouse. This is ideal for organizations with data that cannot be accessed through standard API connectors.
Availability
On-Premises Database Replication is available on the Enterprise plan, or as a paid add-on for other tiers.
Step 1: Identify Your Source Database
- Determine which on-premises database you need to replicate:
- SQL Server
- PostgreSQL
- MySQL
- Oracle
- Other supported databases
- Note the database version, size, and network location
Step 2: Set Up Network Connectivity
- A VPN connection between your network and BEEM is typically required (see the VPN article)
- Alternatively, a Database Bridge or Cloud Connect can provide access
- Ensure the source database allows connections from the BEEM environment
- Work with your network team to open the required ports
Step 3: Contact BEEM for Configuration
- Contact support@beemdata.com or your BEEM account manager
- Provide:
- Source database type and version
- Tables or schemas to replicate
- Network connectivity method (VPN, Database Bridge, etc.)
- Desired replication frequency
- The BEEM team configures the DMS replication task
Step 4: Validate the Replication
- Monitor the initial full load in the Monitor section
- Verify data in the Datalake once the initial sync completes
- Check that ongoing change data capture (CDC) is working for near-real-time updates
How It Works
- Full Load: Initial complete copy of selected tables
- Change Data Capture (CDC): Ongoing replication of inserts, updates, and deletes
- Data lands in your dedicated BEEM warehouse, ready for transformation in the Warehouse
- Automated monitoring detects and alerts on replication issues
Key Details
- Requires network connectivity (VPN, Database Bridge, or Cloud Connect)
- Supports most major relational databases
- DMS consumption is tracked as part of your DPU usage
- The BEEM team manages the replication infrastructure