View and edit data pipeline

Observe and change your data pipeline.

Redis Cloud

To view or manage your data pipeline, select it from your workspace from the Data Integration page or from the Data Integration tab in your subscription or database.

The pipeline page has the following tabs:

The following sections describe each of these tabs, as well as any actions that you can perform.

Dashboard

The Dashboard tab shows an overview and high-level statistics for the data pipeline.

The Dashboard tab for a pipeline.
  • Pipeline health: The status of the data pipeline. Possible statuses include:
    Status Description
    Streaming The data pipeline is capturing new changes from the source database as they happen. Changes in the source database are added to the target database within a few seconds.
    Stopped The data pipeline has been stopped.
    Error There is an error in the data pipeline. Reset the pipeline and contact support if the issue persists.
  • Total ingested: Total number of records ingested from the source database.
  • Pending: Total number of records that are being processed and have not been inserted into the target database.
  • Record health: The number of records that were rejected from the database, and the number of records that were filtered from being inserted into the database.
  • Data latency: How long it takes for a new record to be ingested from the source database.

Change target database

You can change the target database for your pipeline if the new target database is in the same subscription of the current target database. To do this:

  1. From the Dashboard tab of your pipeline, select More Actions > Change pipeline target next to the target database of the pipeline.
The More Actions menu for the target database in the Pipeline's dashboard tab.
  1. Select the new target database from the list.
The Change pipeline target database menu.
  1. Select Change target to continue.

Changing the target database restarts the pipeline and re-ingests all data into the new target database.

Metrics

The Metrics tab shows the following metrics for each data stream:

The Metrics tab for a pipeline.
Metric Description
Name Name of the data stream. Each stream corresponds to a table from the source database.
Total Total number of records that arrived from the source table.
Pending Number of records from the source table that are waiting to be processed.
Inserted Number of new records from the source table that have been written to the target database.
Updated Number of updated records from the source table that have been updated in the target database.
Deleted Number of deleted records from the source table that have been deleted in the target database.
Filtered Number of records from the source table that were filtered from being inserted into the target database.
Rejected Number of records from the source table that could not be parsed or inserted into the target database.

Configuration

The Configuration tab shows your source connectivity, secrets, and collector configuration properties.

The Configuration tab for a pipeline.

If you accidentally overwrite the access for your load balancer or secrets, you can find the required ARNs in the Source connectivity or Secrets section of this tab.

Edit collector properties

From the Configuration tab of your pipeline, select Edit collector properties to change your collector properties.

The advanced properties section.

You can add any Debezium source property for your source database type in the Collector source properties section and any Redis server Debezium sink property in the Collector sink properties section. Select Save properties to save the collector properties.

Dataset

The Dataset tab shows the data from your source database that is ingested into your target database.

The Dataset tab for a pipeline.

Edit dataset

To change the dataset for your data pipeline:

  1. From the Dataset tab of your pipeline, select Edit.

    The Edit button.
  2. In the Schemas section, select the schema(s) you want to migrate to the target database from the list.

    The dataset step with a schema selected.
  3. When you select a schema, you will see its tables in the Tables section. Redis Cloud will automatically select all tables for import. You can de-select any columns you do not wish to import to your Redis database.

  4. Select a table to view its columns in the Columns section. You can de-select any columns you do not wish to import.

    The columns section, with a few columns selected from one table

    If any tables are missing a unique key, a warning will appear in the Data modeling section. Select Show affected to filter the Tables section to the tables without a unique key.

    The dataset step filtered to show tables that are missing a unique key.

    For these tables, select the key icon next to the column that defines a unique key.

    The unique key icon.
  5. Select Save changes to save your changes.

  6. Redis Cloud will display a warning that the data pipeline will restart. Select Apply and restart to proceed.

At this point, the data pipeline will apply the changes. The data pipeline will re-ingest data from the source database to the target database. After this initial sync is complete, the data pipeline enters the change streaming phase, where changes are captured as they happen.

Transformations

The Transformations tab shows the default data structure for records in your target database and any transformation jobs or processor properties you've set for your pipeline.

The Transformations tab for a pipeline.

Edit transformations

To edit any of the information in the Transformations tab:

  1. From the Transformations tab of your pipeline, select Edit.

  2. Select how your records will be stored in Redis. You can choose Hash or JSON.

  3. Under Transformation jobs, you can supply one or more transformation job files that specify how you want to transform the captured data before writing it to the target. Select Upload jobs to upload your job files. When you upload job files, Redis Cloud will validate the job files to check for errors.

  4. Select Edit advanced properties to add any processor properties to control how the data is processed.

    The Advanced Processor properties.

    See the RDI configuration file reference for all available processor properties.

  5. Select Save changes.

  6. Redis Cloud will display a warning that the data pipeline will restart. Select Apply and restart to proceed.

At this point, the data pipeline will apply the changes. The data pipeline will re-ingest data from the source database to the target database. After this initial sync is complete, the data pipeline enters the change streaming phase, where changes are captured as they happen.

Reset data pipeline

Resetting the data pipeline creates a new baseline snapshot from the current state of your source database, and re-processes the data from the source database to the target Redis database. You may want to reset the pipeline if the source and target databases were disconnected or you made large changes to the data pipeline.

To reset the data pipeline and restart the ingest process:

  1. From your pipeline, select More actions, and then Reset pipeline.

  2. If you want to flush the database, check Flush target database.

  3. Select Reset data pipeline.

At this point, the data pipeline will re-ingest data from the source database to your target Redis database.

Stop and restart data pipeline

To stop the data pipeline from synchronizing new data:

  1. From your pipeline, select More actions, and then Stop pipeline.

  2. Select Stop data pipeline to confirm.

Stopping the data pipeline will suspend data processing. To restart the pipeline from the Data pipeline tab, select More actions, and then Start pipeline.

Delete pipeline

To delete the data pipeline:

  1. From your pipeline, select More actions, and then Delete pipeline.

  2. Select Delete data pipeline to confirm.

Deleted data pipelines cannot be recovered. You may also want to delete your workspace.

RATE THIS PAGE
Back to top ↑