# Redis cache-aside with redis-rb

```json metadata
{
  "title": "Redis cache-aside with redis-rb",
  "description": "Implement a Redis cache-aside layer in Ruby with redis-rb",
  "categories": ["docs","develop","stack","oss","rs","rc"],
  "tableOfContents": {"sections":[{"id":"overview","title":"Overview"},{"id":"how-it-works","title":"How it works"},{"children":[{"id":"data-model","title":"Data model"}],"id":"the-cache-aside-helper","title":"The cache-aside helper"},{"id":"cache-aside-reads","title":"Cache-aside reads"},{"id":"stampede-protection-with-a-lua-lock","title":"Stampede protection with a Lua lock"},{"id":"invalidation-on-write","title":"Invalidation on write"},{"id":"field-level-updates","title":"Field-level updates"},{"id":"hitmiss-accounting","title":"Hit/miss accounting"},{"id":"prerequisites","title":"Prerequisites"},{"id":"running-the-demo","title":"Running the demo"},{"id":"the-mock-primary-store","title":"The mock primary store"},{"children":[{"id":"choose-a-ttl-that-matches-your-staleness-tolerance","title":"Choose a TTL that matches your staleness tolerance"},{"id":"invalidate-dont-try-to-keep-the-cache-in-sync","title":"Invalidate, don't try to keep the cache in sync"},{"id":"handle-missing-records-explicitly","title":"Handle missing records explicitly"},{"id":"tune-the-single-flight-lock-ttl","title":"Tune the single-flight lock TTL"},{"id":"use-a-connection-pool-under-high-concurrency","title":"Use a connection pool under high concurrency"},{"id":"namespace-cache-keys-in-shared-redis-deployments","title":"Namespace cache keys in shared Redis deployments"},{"id":"inspect-cached-entries-directly-in-redis","title":"Inspect cached entries directly in Redis"}],"id":"production-usage","title":"Production usage"},{"id":"learn-more","title":"Learn more"}]}

,
  "codeExamples": []
}
```
This guide shows you how to implement a Redis cache-aside layer in Ruby with the [`redis-rb`](https://redis.io/docs/latest/develop/clients/ruby) gem. It includes a small local web server built on Ruby's standard `webrick` library so you can see cache hits, misses, invalidation on write, and stampede protection in action.

## Overview

Cache-aside is one of the most common Redis use cases for read-heavy applications. Instead of querying the primary database on every request, the application checks Redis first and only falls back to the primary on a miss. The result is written back to Redis with a TTL so the next read is served from memory.

That gives you:

* Sub-millisecond reads for the hot working set
* Bounded staleness — every entry expires within a known window
* Reduced primary database load proportional to hit rate
* Field-level updates without re-serializing the full record
* Protection against cache stampedes when popular keys expire under load

In this example, each cached product is stored as a Redis hash under a key like `cache:product:{id}`. The hash holds the product fields (`id`, `name`, `price_cents`, `stock`) and the key has a TTL so stale data is bounded automatically.

## How it works

The flow on every read looks like this:

1. The application calls `cache.get(product_id) { |id| primary.read(id) }`
2. The helper runs `HGETALL` against `cache:product:{id}`
3. On a hit, the cached hash is returned directly
4. On a miss, the helper acquires a Lua-backed single-flight lock and yields the loader to fetch from the primary
5. The helper writes the result back to Redis with `HSET` plus `EXPIRE` and releases the lock
6. Concurrent threads that fail to acquire the lock wait briefly for the cache to populate, then return that value instead of issuing their own primary read

On a write, the application updates the primary and then deletes the cache key, so the next read repopulates from the new source value.

## The cache-aside helper

The `RedisCache` class wraps the cache-aside operations
([source](cache.rb)):

```ruby
require "redis"
require_relative "cache"
require_relative "primary"

redis = Redis.new(host: "localhost", port: 6379)
primary = MockPrimaryStore.new(read_latency_ms: 150)
cache = RedisCache.new(redis: redis, ttl: 30)

# Read through the cache.
result = cache.get("p-001") { |id| primary.read(id) }
puts "hit=#{result[:hit]} latency=#{result[:redis_latency_ms]} ms record=#{result[:record]}"

# Update a single field without rewriting the whole record.
cache.update_field("p-001", "stock", "41")

# Invalidate the cache key on a write to the primary.
primary.update_field("p-001", "price_cents", "699")
cache.invalidate("p-001")
```

### Data model

Each cached product is stored in a Redis hash:

```text
cache:product:p-001
  id          = p-001
  name        = Sourdough Loaf
  price_cents = 650
  stock       = 42
```

The implementation uses:

* [`HGETALL`](https://redis.io/docs/latest/commands/hgetall) to read the cached record
* [`HSET`](https://redis.io/docs/latest/commands/hset) plus [`EXPIRE`](https://redis.io/docs/latest/commands/expire) to repopulate after a miss
* [`DEL`](https://redis.io/docs/latest/commands/del) to invalidate on writes
* [`TTL`](https://redis.io/docs/latest/commands/ttl) to surface remaining staleness in the demo UI
* [`EVAL`](https://redis.io/docs/latest/commands/eval) for the Lua single-flight lock that prevents stampedes
* [`WATCH`](https://redis.io/docs/latest/commands/watch)/[`MULTI`](https://redis.io/docs/latest/commands/multi)/[`EXEC`](https://redis.io/docs/latest/commands/exec) for the conditional field update path

## Cache-aside reads

The `get` method runs `HGETALL` on the cache key first. On a hit it returns the cached hash and records the hit in an in-process counter. On a miss, it delegates to a single-flight loader:

```ruby
def get(entity_id, &loader)
  cache_key = cache_key(entity_id)

  started = monotonic_ms
  cached = @redis.hgetall(cache_key)
  redis_latency_ms = monotonic_ms - started

  if cached && !cached.empty?
    record_hit
    return { record: cached, hit: true, redis_latency_ms: redis_latency_ms }
  end

  record_miss
  record = load_with_single_flight(entity_id, &loader)
  { record: record, hit: false, redis_latency_ms: redis_latency_ms }
end
```

The returned hash includes the measured Redis round-trip time so the demo UI can show the latency difference between a hit and a miss.

## Stampede protection with a Lua lock

When a popular key expires, every concurrent reader observes the miss at the same instant. Without coordination, all of them would query the primary and overwrite the cache redundantly — a *cache stampede*.

The helper uses a tiny Lua script to acquire a short-lived lock atomically. Only the thread that wins the `SET NX` becomes the primary loader; the rest poll the cache briefly and return the value the lock holder writes:

```lua
-- Acquire a short-lived lock with SET NX PX. Returns 1 on acquire, 0 otherwise.
if redis.call('SET', KEYS[1], ARGV[1], 'NX', 'PX', ARGV[2]) then
    return 1
end
return 0
```

A second script releases the lock only if the caller still owns it, so a lock that timed out and was re-acquired by someone else cannot be released by mistake:

```lua
if redis.call('GET', KEYS[1]) == ARGV[1] then
    return redis.call('DEL', KEYS[1])
end
return 0
```

The Ruby side runs both scripts via `EVAL` on every miss:

```ruby
def load_with_single_flight(entity_id, &loader)
  cache_key = cache_key(entity_id)
  lock_key = lock_key(entity_id)
  token = SecureRandom.hex(8)

  acquired = @redis.eval(ACQUIRE_LOCK_SCRIPT, keys: [lock_key], argv: [token, @lock_ttl_ms.to_s])

  if acquired == 1
    begin
      record = loader.call(entity_id)
      return nil if record.nil?
      @redis.multi do |pipe|
        pipe.del(cache_key)
        pipe.hset(cache_key, record)
        pipe.expire(cache_key, @ttl)
      end
      return record
    ensure
      @redis.eval(RELEASE_LOCK_SCRIPT, keys: [lock_key], argv: [token])
    end
  end

  record_stampede_suppressed
  deadline = monotonic_ms + @lock_ttl_ms
  while monotonic_ms < deadline
    sleep(@wait_poll_ms / 1000.0)
    cached = @redis.hgetall(cache_key)
    return cached if cached && !cached.empty?
  end
  loader.call(entity_id)
end
```

The unique `token` per caller is what makes the release script safe — only the thread that actually holds the lock can release it.

## Invalidation on write

When a write hits the primary, the application invalidates the cache key. The next read pulls fresh data from the primary:

```ruby
def invalidate(entity_id)
  @redis.del(cache_key(entity_id)) == 1
end
```

This is the simplest and safest pattern: never try to keep the cache and primary in sync directly, just delete the cache entry and let the next read repopulate it.

## Field-level updates

Because each record is stored as a hash, the cache helper can also update a single field in place without re-serializing the full record. The update only writes if the entry is already cached, so a partial record can never appear in Redis:

```ruby
def update_field(entity_id, field, value)
  cache_key = cache_key(entity_id)
  loop do
    @redis.watch(cache_key)
    unless @redis.exists?(cache_key)
      @redis.unwatch
      return false
    end
    result = @redis.multi do |pipe|
      pipe.hset(cache_key, field, value.to_s)
      pipe.expire(cache_key, @ttl)
    end
    return true if result
    # nil result means WATCH detected a change — retry.
  end
end
```

This is useful for hot fields that change more often than the rest of the record (a stock counter, a view count) and would otherwise force a full reload.

## Hit/miss accounting

The helper keeps in-process counters for hits, misses, and stampedes that were suppressed by the single-flight lock. The demo UI surfaces these so you can see the cache absorbing load:

```ruby
def stats
  @stats_mutex.synchronize do
    total = @stats[:hits] + @stats[:misses]
    hit_rate = total.zero? ? 0.0 : (1000 * @stats[:hits] / total).to_f / 10
    {
      "hits" => @stats[:hits],
      "misses" => @stats[:misses],
      "stampedes_suppressed" => @stats[:stampedes_suppressed],
      "hit_rate_pct" => hit_rate,
    }
  end
end
```

In production you would emit these as Prometheus counters or push them into your metrics pipeline rather than holding them in process memory.

## Prerequisites

Before running the demo, make sure that:

* Redis is running and accessible. By default, the demo connects to `localhost:6379`.
* The `redis` gem is installed:

```bash
gem install redis webrick
```

If your Redis server is running elsewhere, start the demo with `--redis-host` and `--redis-port`.

## Running the demo

A local demo server is included to show the cache-aside layer in action
([source](demo_server.rb)):

```bash
ruby demo_server.rb
```

The demo server uses Ruby's standard library for HTTP handling and concurrency:

* [`webrick`](https://github.com/ruby/webrick) for the multi-threaded web server
* [`uri`](https://docs.ruby-lang.org/en/master/URI.html) for query and form decoding
* [`securerandom`](https://docs.ruby-lang.org/en/master/SecureRandom.html) for the per-caller lock token

It exposes a small interactive page where you can:

* Read a product through the cache and see whether it was a hit or a miss
* Compare the measured Redis round-trip against the simulated primary read latency
* Watch the cache TTL count down between requests
* Update a field on the primary and see the cache invalidate automatically
* Run a stampede test that fires many concurrent reads at a freshly-invalidated key and confirms only one of them reaches the primary
* Reset the hit/miss counters at any time

After starting the server, visit `http://localhost:8080`.

## The mock primary store

To make the demo self-contained, the example includes a `MockPrimaryStore` that stands in for a slow disk-backed database
([source](primary.rb)):

```ruby
class MockPrimaryStore
  def initialize(read_latency_ms: 150)
    @read_latency_ms = read_latency_ms
    # ...
  end

  def read(id)
    sleep(@read_latency_ms / 1000.0)
    # ...
  end
end
```

Every call to `read` sleeps for `read_latency_ms` so the difference between a cache hit and a miss is obvious in the UI. The store also tracks the total number of primary reads, which the stampede test uses to confirm that single-flight is working — for N concurrent readers against a cold key, you should see exactly one primary read.

In a real application this would be replaced by an ActiveRecord query, an HTTP call to a downstream service, or any other slow-but-authoritative source.

## Production usage

This guide uses a deliberately small local demo so you can focus on the cache-aside pattern. In production, you will usually want to harden several aspects of it.

### Choose a TTL that matches your staleness tolerance

The TTL is the upper bound on how long a stale value can be served. Shorter TTLs mean lower hit rates and more primary load; longer TTLs mean higher hit rates and more stale reads between writes. Pick the value that matches your business tolerance for stale data, and combine it with explicit invalidation on writes for the cases where you cannot tolerate any staleness.

### Invalidate, don't try to keep the cache in sync

When the underlying record changes, delete the cache key rather than rewriting it. Cache-aside is robust precisely because it never assumes the cache holds the latest value — the next read always re-fetches from the primary on a miss.

### Handle missing records explicitly

In this demo, a missing record returns `nil` and nothing is cached. In a real system you may want to cache "not found" sentinels with a short TTL to absorb load from probing for non-existent IDs, while making sure the sentinel TTL is shorter than the positive cache entry so a newly-created record becomes visible quickly.

### Tune the single-flight lock TTL

The lock TTL needs to be longer than the worst-case primary read latency so a slow loader does not lose the lock midway. The unique token in `RELEASE_LOCK_SCRIPT` ensures the original caller does not delete someone else's lock if its lock has expired.

### Use a connection pool under high concurrency

The demo shares a single `Redis` client across WEBrick request threads. For high-concurrency production workloads, use a connection pool such as [`connection_pool`](https://github.com/mperham/connection_pool) so that concurrent threads do not contend on a single TCP connection.

### Namespace cache keys in shared Redis deployments

If multiple applications share a Redis deployment, prefix cache keys with the application name (`cache:billing:product:{id}`) so different services cannot clobber each other's entries.

### Inspect cached entries directly in Redis

When testing or troubleshooting, inspect the stored cache key directly to confirm the application is writing the fields and TTL you expect:

```bash
redis-cli HGETALL cache:product:p-001
redis-cli TTL cache:product:p-001
```

## Learn more

* [redis-rb guide](https://redis.io/docs/latest/develop/clients/ruby) - Install and use the Ruby Redis client
* [SET command](https://redis.io/docs/latest/commands/set) - Set a string with TTL options (`EX`, `PX`, `NX`)
* [HSET command](https://redis.io/docs/latest/commands/hset) - Write hash fields
* [HGETALL command](https://redis.io/docs/latest/commands/hgetall) - Read every field of a hash
* [EXPIRE command](https://redis.io/docs/latest/commands/expire) - Set key expiration in seconds
* [DEL command](https://redis.io/docs/latest/commands/del) - Delete a key on invalidation
* [Lua scripting](https://redis.io/docs/latest/develop/programmability/eval-intro) - Atomic single-flight locks and stampede mitigation

