Data Integration

Use Redis Data Integration with Redis Cloud.

Redis Cloud

Redis Cloud now supports Redis Data Integration (RDI), a fast and simple way to bring your data into Redis from other types of primary databases.

A relational database usually handles queries much more slowly than a Redis database. If your application uses a relational database and makes many more reads than writes (which is the typical case) then you can improve performance by using Redis as a cache to handle the read queries quickly. Redis Cloud uses ingest to help you offload all read queries from the application database to Redis automatically.

Using a data pipeline lets you have a cache that is always ready for queries. RDI Data pipelines ensure that any changes made to your primary database are captured in your Redis cache within a few seconds, preventing cache misses and stale data within the cache.

RDI helps Redis customers sync Redis Cloud with live data from their primary databases to:

  • Meet the required speed and scale of read queries and provide an excellent and predictable user experience.
  • Save resources and time when building pipelines and coding data transformations.
  • Reduce the total cost of ownership by saving money on expensive database read replicas.

Using RDI with Redis Cloud simplifies managing your data integration pipeline. No need to worry about hardware or underlying infrastructure, as Redis Cloud manages that for you. Creating the data flow from source to target is much easier, and there are validations in place to reduce errors.

Data pipeline architecture

An RDI data pipeline sits between your source database and your target Redis database. Initially, the pipeline reads all of the data and imports it into the target database during the initial sync phase. After this initial sync is complete, the data pipeline enters the streaming phase, where changes are captured as they happen. Changes in the source database are added to the target within a few seconds of capture. The data pipeline translates relational database rows to Redis hashes or JSON documents.

For more info on how RDI works, see RDI Architecture.

Pipeline security

Data pipelines are set up to ensure a high level of data security. Source database credentials and TLS secrets are stored in AWS secret manager and shared using the Kubernetes CSI driver for secrets. See Share source database credentials to learn how to share your source database credentials and TLS certificates with Redis Cloud.

Connections to the source database use Java Database Connectivity (JDBC) through AWS PrivateLink, ensuring that the data pipeline is only exposed to the specific database endpoint. See Set up connectivity to learn how to connect your PrivateLink to the Redis Cloud VPC.

RDI encrypts all network connections with TLS. The pipeline will process data from the source database in-memory and write it to the target database using a TLS connection. There are no external connections to your data pipeline except from Redis Cloud management services.

Prerequisites

Before you can create a data pipeline, you must have:

  • A Redis Cloud Pro database hosted on Amazon Web Services (AWS). This will be the target database.
  • One supported source database, hosted on an AWS EC2 instance, AWS RDS, or AWS Aurora:
Database Versions AWS RDS Versions
Oracle 19c, 21c 19c, 21c
MariaDB 10.5, 11.4.3 10.4 to 10.11, 11.4.3
MySQL 5.7, 8.0.x, 8.2 8.0.x
PostgreSQL 10, 11, 12, 13, 14, 15, 16 11, 12, 13, 14, 15, 16
AWS Aurora PostgreSQL 15 15
SQL Server 2017, 2019, 2022 2016, 2017, 2019, 2022
Note:

Please be aware of the following limitations:

  • The target database must be a Redis Cloud Pro database hosted on Amazon Web Services (AWS). Redis Cloud Essentials databases and databases hosted on Google Cloud do not support Data Integration.
  • The target database must use multi-zone high availability.
  • The target database can use TLS, but can not use mutual TLS.
  • The target database cannot be in the same subscription as another database that has a data pipeline.
  • Source databases must also be hosted on AWS.
  • You must use a custom encryption key on AWS to create the instance hosting the database.
  • One source database can only be synced to one target database.
  • You must be able to set up AWS PrivateLink to connect your source database to your target database. RDI only works with AWS PrivateLink and not VPC Peering or other private connectivity options.
  • Mutual TLS is not supported for AWS RDS and AWS Aurora source databases.

Get started

To create a new data pipeline, you need to:

  1. Prepare your source database and any associated credentials.
  2. Define the source connection and data pipeline by selecting which tables to sync.

Once your data pipeline is defined, you can view and edit it.

RATE THIS PAGE
Back to top ↑