Prometheus metrics v2 preview
V2 metrics available to Prometheus as of Redis Enterprise Software version 7.8.2.
You can integrate Redis Enterprise Software with Prometheus and Grafana to create dashboards for important metrics.
The v2 metrics in the following tables are available as of Redis Enterprise Software version 7.8.0. For help transitioning from v1 metrics to v2 PromQL, see Prometheus v1 metrics and equivalent v2 PromQL.
Database metrics
Metric | Type | Description |
---|---|---|
endpoint_client_connections | counter | Number of client connection establishment events |
endpoint_client_disconnections | counter | Number of client disconnections initiated by the client |
endpoint_client_connection_expired | counter | Total number of client connections with expired TTL (Time To Live) |
endpoint_client_establishment_failures | counter | Number of client connections that failed to establish properly |
endpoint_client_expiration_refresh | counter | Number of expiration time changes of clients |
endpoint_client_tracking_off_requests | counter | Total number of CLIENT TRACKING OFF requests |
endpoint_client_tracking_on_requests | counter | Total number of CLIENT TRACKING ON requests |
endpoint_disconnected_cba_client | counter | Number of certificate-based clients disconnected |
endpoint_disconnected_ldap_client | counter | Number of LDAP clients disconnected |
endpoint_disconnected_user_password_client | counter | Number of user&password clients disconnected |
endpoint_disposed_commands_after_client_caching | counter | Total number of client caching commands that were disposed due to misuse |
endpoint_egress | counter | Number of egress bytes |
endpoint_egress_pending | counter | Number of send-pending bytes |
endpoint_egress_pending_discarded | counter | Number of send-pending bytes that were discarded due to disconnection |
endpoint_failed_cba_authentication | counter | Number of clients that failed certificate-based authentication |
endpoint_failed_ldap_authentication | counter | Number of clients that failed LDAP authentication |
endpoint_failed_user_password_authentication | counter | Number of clients that failed user password authentication |
endpoint_ingress | counter | Number of ingress bytes |
endpoint_longest_pipeline_histogram | counter | Client connections with the longest pipeline lengths |
endpoint_other_requests | counter | Number of other requests |
endpoint_other_requests_latency_histogram | histogram | Latency (in µs) histogram of other commands |
endpoint_other_requests_latency_histogram_bucket | histogram | Latency histograms for commands other than read or write commands. Can be used to represent different latency percentiles. p99.9 example: histogram_quantile(0.999, sum(rate(endpoint_other_requests_latency_histogram_bucket{cluster="$cluster", db="$db"}[$__rate_interval]) ) by (le, db)) |
endpoint_other_responses | counter | Number of other responses |
endpoint_proxy_disconnections | counter | Number of client disconnections initiated by the proxy |
endpoint_read_requests | counter | Number of read requests |
endpoint_read_requests_latency_histogram | histogram | Latency (in µs) histogram of read commands |
endpoint_read_requests_latency_histogram_bucket | histogram | Latency histograms for read commands. Can be used to represent different latency percentiles. p99.9 example: histogram_quantile(0.999, sum(rate(endpoint_read_requests_latency_histogram_bucket{cluster="$cluster", db="$db"}[$__rate_interval]) ) by (le, db)) |
endpoint_read_responses | counter | Number of read responses |
endpoint_successful_cba_authentication | counter | Number of clients that successfully authenticated with certificate-based authentication |
endpoint_successful_ldap_authentication | counter | Number of clients that successfully authenticated with LDAP |
endpoint_successful_user_password_authentication | counter | Number of clients that successfully authenticated with user&password |
endpoint_write_requests | counter | Number of write requests |
endpoint_write_requests_latency_histogram | histogram | Latency (in µs) histogram of write commands |
endpoint_write_requests_latency_histogram_bucket | histogram | Latency histograms for write commands. Can be used to represent different latency percentiles. p99.9 example: histogram_quantile(0.999, sum(rate(endpoint_write_requests_latency_histogram_bucket{cluster="$cluster", db="$db"}[$__rate_interval]) ) by (le, db)) |
endpoint_write_responses | counter | Number of write responses |
Node metrics
Metric | Type | Description |
---|---|---|
node_available_flash_bytes | gauge | Available flash in the node (bytes) |
node_available_flash_no_overbooking_bytes | gauge | Available flash in the node (bytes), without taking into account overbooking |
node_available_memory_bytes | gauge | Amount of free memory in the node (bytes) that is available for database provisioning |
node_available_memory_no_overbooking_bytes | gauge | Available RAM in the node (bytes) without taking into account overbooking |
node_bigstore_free_bytes | gauge | Sum of free space of back-end flash (used by flash database's [BigRedis]) on all cluster nodes (bytes); returned only when BigRedis is enabled |
node_cert_expires_in_seconds | gauge | Certificate expiration (in seconds) per given node; read more about certificates in Redis Enterprise and monitoring certificates |
node_ephemeral_storage_avail_bytes | gauge | Disk space available to RLEC processes on configured ephemeral disk (bytes) |
node_ephemeral_storage_free_bytes | gauge | Free disk space on configured ephemeral disk (bytes) |
node_memory_MemFree_bytes | gauge | Free memory in the node (bytes) |
node_persistent_storage_avail_bytes | gauge | Disk space available to RLEC processes on configured persistent disk (bytes) |
node_persistent_storage_free_bytes | gauge | Free disk space on configured persistent disk (bytes) |
node_provisional_flash_bytes | gauge | Amount of flash available for new shards on this node, taking into account overbooking, max Redis servers, reserved flash, and provision and migration thresholds (bytes) |
node_provisional_flash_no_overbooking_bytes | gauge | Amount of flash available for new shards on this node, without taking into account overbooking, max Redis servers, reserved flash, and provision and migration thresholds (bytes) |
node_provisional_memory_bytes | gauge | Amount of RAM that is available for provisioning to databases out of the total RAM allocated for databases |
node_provisional_memory_no_overbooking_bytes | gauge | Amount of RAM that is available for provisioning to databases out of the total RAM allocated for databases, without taking into account overbooking |
node_metrics_up | gauge | Node is part of the cluster and is connected |
Cluster metrics
Metric | Type | Description |
---|---|---|
generation{cluster_wd=<node_uid>} | gauge | Generation number of the specific cluster_wd |
has_qourum{cluster_wd=<node_uid>, has_witness_disk=BOOL} | gauge | Has_qourum = 1 No quorum = 0 |
is_primary{cluster_wd=<node_uid>} | gauge | primary = 1 secondary = 0 |
license_shards_limit | Total shard limit by the license by shard type (ram / flash) | |
total_live_nodes_count{cluster_wd=<node_uid>} | gauge | Number of live nodes |
total_node_count{cluster_wd=<node_uid>} | gauge | Number of nodes |
total_primary_selection_ended{cluster_wd=<node_uid>} | counter | Monotonic counter for each selection process that ended |
total_primary_selections{cluster_wd=<node_uid>} | counter | Monotonic counter for each selection process that started |
Replication metrics
Metric | Description |
---|---|
database_syncer_config | Used as a placeholder for configuration labels |
database_syncer_current_status | Syncer status for traffic; 0 = in-sync, 2 = out of sync |
database_syncer_dst_connectivity_state | Destination connectivity state |
database_syncer_dst_connectivity_state_ms | Destination connectivity state duration |
database_syncer_dst_lag | Lag in milliseconds between the syncer and the destination |
database_syncer_dst_repl_offset | Offset of the last command acknowledged |
database_syncer_flush_counter | Number of destination flushes |
database_syncer_ingress_bytes | Number of bytes read from source shard |
database_syncer_ingress_bytes_decompressed | Number of bytes read from source shard |
database_syncer_internal_state | Internal state of the syncer |
database_syncer_lag_ms | Lag time between the source and the destination for traffic in milliseconds |
database_syncer_rdb_size | The source's RDB size in bytes to be transferred during the syncing phase |
database_syncer_rdb_transferred | Number of bytes transferred from the source's RDB during the syncing phase |
database_syncer_src_connectivity_state | Source connectivity state |
database_syncer_src_connectivity_state_ms | Source connectivity state duration |
database_syncer_src_repl_offset | Last known source offset |
database_syncer_state | Internal state of the shard syncer |
database_syncer_syncer_repl_offset | Offset of the last command handled by the syncer |
database_syncer_total_requests | Number of destination writes |
database_syncer_total_responses | Number of destination writes acknowledged |
Shard metrics
Metric | Description |
---|---|
redis_server_active_defrag_running | Automatic memory defragmentation current aggressiveness (% cpu) |
redis_server_allocator_active | Total used memory, including external fragmentation |
redis_server_allocator_allocated | Total allocated memory |
redis_server_allocator_resident | Total resident memory (RSS) |
redis_server_aof_last_cow_size | Last AOFR, CopyOnWrite memory |
redis_server_aof_rewrite_in_progress | The number of simultaneous AOF rewrites that are in progress |
redis_server_aof_rewrites | Number of AOF rewrites this process executed |
redis_server_aof_delayed_fsync | Number of times an AOF fsync caused delays in the main Redis thread (inducing latency); this can indicate that the disk is slow or overloaded |
redis_server_blocked_clients | Count the clients waiting on a blocking call |
redis_server_connected_clients | Number of client connections to the specific shard |
redis_server_connected_slaves | Number of connected replicas |
redis_server_db0_avg_ttl | Average TTL of all volatile keys |
redis_server_expired_keys | Total count of volatile keys |
redis_server_db0_keys | Total key count |
redis_server_evicted_keys | Keys evicted so far (since restart) |
redis_server_expire_cycle_cpu_milliseconds | The cumulative amount of time spent on active expiry cycles |
redis_server_expired_keys | Keys expired so far (since restart) |
redis_server_forwarding_state | Shard forwarding state (on or off) |
redis_server_keys_trimmed | The number of keys that were trimmed in the current or last resharding process |
redis_server_keyspace_read_hits | Number of read operations accessing an existing keyspace |
redis_server_keyspace_read_misses | Number of read operations accessing a non-existing keyspace |
redis_server_keyspace_write_hits | Number of write operations accessing an existing keyspace |
redis_server_keyspace_write_misses | Number of write operations accessing a non-existing keyspace |
redis_server_master_link_status | Indicates if the replica is connected to its master |
redis_server_master_repl_offset | Number of bytes sent to replicas by the shard; calculate the throughput for a time period by comparing the value at different times |
redis_server_master_sync_in_progress | The master shard is synchronizing (1 true |
redis_server_max_process_mem | Current memory limit configured by redis_mgr according to node free memory |
redis_server_maxmemory | Current memory limit configured by redis_mgr according to database memory limits |
redis_server_mem_aof_buffer | Current size of AOF buffer |
redis_server_mem_clients_normal | Current memory used for input and output buffers of non-replica clients |
redis_server_mem_clients_slaves | Current memory used for input and output buffers of replica clients |
redis_server_mem_fragmentation_ratio | Memory fragmentation ratio (1.3 means 30% overhead) |
redis_server_mem_not_counted_for_evict | Portion of used_memory (in bytes) that's not counted for eviction and OOM error |
redis_server_mem_replication_backlog | Size of replication backlog |
redis_server_module_fork_in_progress | A binary value that indicates if there is an active fork spawned by a module (1) or not (0) |
namedprocess_namegroup_cpu_seconds_total | Shard process CPU usage percentage |
namedprocess_namegroup_thread_cpu_seconds_total | Shard main thread CPU time spent in seconds |
namedprocess_namegroup_open_filedesc | Shard number of open file descriptors |
namedprocess_namegroup_memory_bytes | Shard memory size in bytes |
namedprocess_namegroup_oldest_start_time_seconds | Shard start time of the process since unix epoch in seconds |
redis_server_rdb_bgsave_in_progress | Indication if bgsave is currently in progress |
redis_server_rdb_last_cow_size | Last bgsave (or SYNC fork) used CopyOnWrite memory |
redis_server_rdb_saves | Total count of bgsaves since the process was restarted (including replica fullsync and persistence) |
redis_server_repl_touch_bytes | Number of bytes sent to replicas as TOUCH commands by the shard as a result of a READ command that was processed; calculate the throughput for a time period by comparing the value at different times |
redis_server_total_commands_processed | Number of commands processed by the shard; calculate the number of commands for a time period by comparing the value at different times |
redis_server_total_connections_received | Number of connections received by the shard; calculate the number of connections for a time period by comparing the value at different times |
redis_server_total_net_input_bytes | Number of bytes received by the shard; calculate the throughput for a time period by comparing the value at different times |
redis_server_total_net_output_bytes | Number of bytes sent by the shard; calculate the throughput for a time period by comparing the value at different times |
redis_server_up | Shard is up and running |
redis_server_used_memory | Memory used by shard (in BigRedis this includes flash) (bytes) |