# Redis feature store with Jedis

```json metadata
{
  "title": "Redis feature store with Jedis",
  "description": "Build a Redis-backed online feature store in Java with Jedis",
  "categories": ["docs","develop","stack","oss","rs","rc"],
  "tableOfContents": {"sections":[{"id":"overview","title":"Overview"},{"children":[{"id":"batch-path-per-materialization-cycle","title":"Batch path (per materialization cycle)"},{"id":"streaming-path-per-event","title":"Streaming path (per event)"},{"id":"inference-path-per-request","title":"Inference path (per request)"}],"id":"how-it-works","title":"How it works"},{"children":[{"id":"project-layout","title":"Project layout"},{"id":"data-model","title":"Data model"},{"id":"bulk-loading-batch-features","title":"Bulk-loading batch features"},{"id":"streaming-writes-with-per-field-ttl","title":"Streaming writes with per-field TTL"},{"id":"inference-reads-with-hmget","title":"Inference reads with HMGET"},{"id":"batch-scoring-with-pipelined-hmget","title":"Batch scoring with pipelined HMGET"}],"id":"the-feature-store-helper","title":"The feature-store helper"},{"id":"the-streaming-worker","title":"The streaming worker"},{"id":"the-batch-builder","title":"The batch builder"},{"id":"the-interactive-demo","title":"The interactive demo"},{"id":"prerequisites","title":"Prerequisites"},{"children":[{"id":"get-the-source-files","title":"Get the source files"},{"id":"start-the-demo-server","title":"Start the demo server"}],"id":"running-the-demo","title":"Running the demo"},{"children":[{"id":"pick-the-batch-ttl-to-outlast-a-failed-refresher","title":"Pick the batch TTL to outlast a failed refresher"},{"id":"co-locate-the-online-store-with-serving-not-with-training","title":"Co-locate the online store with serving, not with training"},{"id":"pipeline-batch-reads-across-shards","title":"Pipeline batch reads across shards"},{"id":"make-hexpire-part-of-every-streaming-write","title":"Make HEXPIRE part of every streaming write"},{"id":"avoid-hgetall-on-the-request-path","title":"Avoid HGETALL on the request path"},{"id":"size-the-jedispool-for-the-request-shape","title":"Size the JedisPool for the request shape"},{"id":"inspect-the-store-directly-with-redis-cli","title":"Inspect the store directly with redis-cli"}],"id":"production-usage","title":"Production usage"},{"id":"learn-more","title":"Learn more"}]}

,
  "codeExamples": []
}
```
This guide shows you how to build a small Redis-backed online feature store in
Java with [Jedis](https://redis.io/docs/latest/develop/clients/jedis). It includes a
local web server built with the JDK's `com.sun.net.httpserver.HttpServer` so
you can bulk-load a batch of users with a key-level TTL, run a streaming
worker that overwrites real-time features with per-field TTL, retrieve any
subset of features for one user under 2 ms, and pipeline `HMGET` across a
hundred users for batch scoring.

## Overview

Each entity (here, a user) is one Redis
[Hash](https://redis.io/docs/latest/develop/data-types/hashes) at a deterministic key —
`fs:user:{id}`. The hash holds every feature for that entity as one field per
feature: batch-materialized aggregates (refreshed once a day) alongside
streaming-updated signals (refreshed every few seconds). One
[`HMGET`](https://redis.io/docs/latest/commands/hmget) returns whichever subset the model
needs in one network round trip.

Two TTL layers solve the *mixed staleness* problem without an application-side
cleaner:

* A **key-level** [`EXPIRE`](https://redis.io/docs/latest/commands/expire) aligned with the
  batch materialization cycle (24 hours in the demo). If the batch refresher
  fails, the whole entity disappears at the next cycle and inference sees a
  missing entity — which the model handler can detect and fall back on —
  rather than silently outdated values.
* A **per-field** [`HEXPIRE`](https://redis.io/docs/latest/commands/hexpire) (Redis 7.4+) on
  each streaming feature gives that field its own shorter expiry, independent
  of the rest of the hash. If the streaming pipeline stops updating a feature,
  the field self-cleans while the batch fields stay populated.

In this example, the batch features describe a user's longer-term shape
(`country_iso`, `risk_segment`, `account_age_days`, `tx_count_7d`,
`avg_amount_30d`, `chargeback_count_180d`) and are bulk-loaded by
`BuildFeatures.java` — the demo's stand-in for a nightly Spark / Feast
materialization job. The streaming features describe what the user is doing
right now (`last_login_ts`, `last_device_id`, `tx_count_5m`,
`failed_logins_15m`, `session_country`) and are written by
`StreamingWorker.java` — the demo's stand-in for a Flink / Kafka Streams job.
The inference handlers of the demo server read any subset of those features
through `FeatureStore.java`'s helper class.

That gives you:

* A single round trip for retrieval — any subset of features for one entity
  in one [`HMGET`](https://redis.io/docs/latest/commands/hmget).
* Sub-millisecond hot path. The Redis-side work is microseconds; in practice
  the bottleneck is the network round trip plus the model's own feature-prep.
* Pipelined batch scoring — one round trip for `N` users at once.
* Independent freshness per feature, expressed as a server-side TTL rather
  than as application logic.
* Self-cleanup on pipeline failure: a stalled batch refresher lets entities
  expire on schedule, and a stalled streaming worker lets each affected field
  expire on its own timer.

## How it works

There are three paths: a **batch path** that bulk-loads features once per
materialization cycle, a **streaming path** that updates real-time features
as events arrive, and an **inference path** that reads features on the
request side.

### Batch path (per materialization cycle)

1. The batch job calls `BuildFeatures.synthesizeUsers(N, seed)` (in
   production, the equivalent computation lives in an offline pipeline against
   the warehouse). The result is `Map<String, Map<String, Object>>` keyed by
   user ID.
2. `store.bulkLoad(rows, ttlSeconds)` queues one
   [`HSET`](https://redis.io/docs/latest/commands/hset) plus one
   [`EXPIRE`](https://redis.io/docs/latest/commands/expire) per user through Jedis's
   [`Pipeline`](https://redis.io/docs/latest/develop/clients/jedis/transpipe), then
   `pipe.sync()` ships the whole batch in a single round trip. The `HSET`
   writes every batch field; the `EXPIRE` is what makes the entity disappear
   if the next batch run fails, so inference reads a missing entity rather
   than silently outdated values.

### Streaming path (per event)

When a user does something (login, transaction, page view) the streaming
layer computes whatever real-time signals fall out of that event and calls
`store.updateStreaming(userId, fields, ttlSeconds)`. That batches:

1. An [`HSET`](https://redis.io/docs/latest/commands/hset) writing the new field values.
   Redis is single-threaded per shard, so this is atomic against any
   concurrent batch write on the same hash — no version columns, no locks.
2. An [`HEXPIRE`](https://redis.io/docs/latest/commands/hexpire) over exactly the fields
   that were written, with the streaming TTL. Each streaming field carries
   its own per-field expiry independent of the rest of the hash. Stop the
   worker and these fields drop out one by one as their TTLs elapse, while
   the batch fields remain populated under the longer key-level TTL.

### Inference path (per request)

1. The model server picks the feature subset it needs (the schema is owned by
   the model, not the store).
2. It calls `store.getFeatures(userId, names)`, which is one
   [`HMGET`](https://redis.io/docs/latest/commands/hmget). Redis returns the values in
   the same order as the requested fields, with `null` for any field that
   doesn't exist (or has expired).
3. For batch inference, the model server calls
   `store.batchGetFeatures(userIds, names)`, which pipelines one
   [`HMGET`](https://redis.io/docs/latest/commands/hmget) per user across all `N` users
   in a single network round trip via Jedis's `Pipeline.sync()`.

## The feature-store helper

The `FeatureStore` class wraps the read/write paths
([source](https://github.com/redis/docs/blob/main/content/develop/use-cases/feature-store/java-jedis/FeatureStore.java)):

```java
import redis.clients.jedis.JedisPool;
import redis.clients.jedis.JedisPoolConfig;

JedisPoolConfig cfg = new JedisPoolConfig();
cfg.setMaxTotal(64);
JedisPool pool = new JedisPool(cfg, "localhost", 6379);
FeatureStore store = new FeatureStore(pool,
    "fs:user:",
    24L * 60L * 60L,    // whole-entity TTL aligned with the daily batch cycle
    5L * 60L            // per-field TTL on each streaming feature
);

// Batch materialization: one HSET + EXPIRE per user, all pipelined.
Map<String, Map<String, Object>> rows = Map.of(
    "u0001", Map.of(
        "country_iso", "US", "risk_segment", "low",
        "tx_count_7d", 14, "avg_amount_30d", 92.40,
        "account_age_days", 612, "chargeback_count_180d", 0),
    "u0002", Map.of(
        "country_iso", "GB", "risk_segment", "medium",
        "tx_count_7d", 47, "avg_amount_30d", 220.10,
        "account_age_days", 1840, "chargeback_count_180d", 1));
store.bulkLoad(rows);

// Streaming write: HSET + HEXPIRE on just the fields that changed.
store.updateStreaming("u0001", Map.of(
    "last_login_ts", System.currentTimeMillis(),
    "last_device_id", "ios-9f02",
    "tx_count_5m", 3,
    "failed_logins_15m", 0,
    "session_country", "US"));

// Inference read: HMGET of whatever the model needs.
Map<String, String> features = store.getFeatures("u0001", List.of(
    "risk_segment", "tx_count_7d", "avg_amount_30d",
    "tx_count_5m", "failed_logins_15m"));

// Batch scoring: pipelined HMGET across many users.
Map<String, Map<String, String>> batch = store.batchGetFeatures(
    List.of("u0001", "u0002", "u0003"),
    List.of("risk_segment", "tx_count_5m", "failed_logins_15m"));
```

### Project layout

The four `.java` files and the `pom.xml` live in the same directory — the
`build-helper-maven-plugin` adds the project root as the source directory so
the Java sources sit alongside the build descriptor. Run with:

```bash
mvn package
mvn exec:java -Dexec.mainClass=DemoServer
```

### Data model

Each user is one Redis Hash. Every value is stored as a string — Redis hash
fields are bytes on the wire, so the helper encodes booleans as `"true"` /
`"false"` (`encodeValue(Object)` in `FeatureStore.java`) and renders
everything else with `Object.toString()`. The model server is responsible for
parsing back to the right type, the same way it would when reading any
serialized feature store.

```text
fs:user:u0001                                   TTL = 86400 s (key-level)
  country_iso=US                                <no field TTL>
  risk_segment=low                              <no field TTL>
  account_age_days=612                          <no field TTL>
  tx_count_7d=14                                <no field TTL>
  avg_amount_30d=92.40                          <no field TTL>
  chargeback_count_180d=0                       <no field TTL>
  last_login_ts=1716998413541                   TTL = 300 s (per field, HEXPIRE)
  last_device_id=ios-9f02                       TTL = 300 s (per field, HEXPIRE)
  tx_count_5m=3                                 TTL = 300 s (per field, HEXPIRE)
  failed_logins_15m=0                           TTL = 300 s (per field, HEXPIRE)
  session_country=US                            TTL = 300 s (per field, HEXPIRE)
```

The batch fields sit under the key-level `EXPIRE`. The streaming fields each
carry their own [`HEXPIRE`](https://redis.io/docs/latest/commands/hexpire). If the
streaming pipeline stops, the streaming fields drop one by one as their
per-field TTLs elapse; the batch fields stay until the daily key-level
`EXPIRE` fires (or the next batch cycle re-pins them).

### Bulk-loading batch features

`bulkLoad` queues one `HSET` and one `EXPIRE` per user into a single
`Pipeline` and calls `sync()` to ship the lot. With 500 users that's 1000
commands in one network call — Redis processes them sequentially on the
server side but the client only pays one RTT.

```java
public int bulkLoad(Map<String, Map<String, Object>> rows, long ttlSeconds) {
    if (rows.isEmpty()) return 0;
    try (Jedis jedis = pool.getResource()) {
        Pipeline pipe = jedis.pipelined();
        for (Map.Entry<String, Map<String, Object>> e : rows.entrySet()) {
            String key = keyFor(e.getKey());
            Map<String, String> encoded = encode(e.getValue());
            pipe.hset(key, encoded);
            pipe.expire(key, ttlSeconds);
        }
        pipe.sync();
    }
    ...
}
```

Jedis's `pipelined()` is a non-transactional batch: commands queue up and
ship in one round trip, but they don't run inside a `MULTI/EXEC` block.
That's the right choice here because each user's `HSET` + `EXPIRE` pair is
independent of every other user's, and an all-or-nothing transaction would
block the server for the duration of the batch. For the rare case where the
pair has to be inseparable (a server crash between the two would leave the
entity without a key-level TTL) you'd wrap each user in a `Transaction` or a
[Lua script](https://redis.io/docs/latest/develop/programmability/eval-intro); for a
daily ingestion job that runs end-to-end every cycle, the next run re-pins
the TTL — no extra machinery needed.

In production, the equivalent of this script runs as an offline pipeline (a
Spark or Feast `materialize` job) that reads from the warehouse and writes
into Redis. The
[Feast `RedisOnlineStore`](https://docs.feast.dev/reference/online-stores/redis)
provider does exactly this under the hood; the in-house
[Redis Feature Form](https://redis.io/docs/latest/develop/ai/featureform) integration
covers the materialize + serve path end-to-end.

### Streaming writes with per-field TTL

`updateStreaming` is the linchpin of the mixed-staleness story:

```java
public void updateStreaming(String entityId, Map<String, Object> fields, long ttlSeconds) {
    if (fields.isEmpty()) return;
    String key = keyFor(entityId);
    Map<String, String> encoded = encode(fields);
    String[] names = encoded.keySet().toArray(new String[0]);

    List<Long> expireCodes;
    try (Jedis jedis = pool.getResource()) {
        Pipeline pipe = jedis.pipelined();
        pipe.hset(key, encoded);
        Response<List<Long>> expireResp = pipe.hexpire(key, ttlSeconds, names);
        pipe.sync();
        expireCodes = expireResp.get();
    }
    for (Long code : expireCodes) {
        if (code == null || code != 1L) {
            throw new IllegalStateException(
                "HEXPIRE did not set every field TTL for " + key + ": " + expireCodes);
        }
    }
    ...
}
```

[`HEXPIRE`](https://redis.io/docs/latest/commands/hexpire) sets the TTL on *individual*
hash fields, not on the whole key. The two commands are queued in one
`Pipeline` and Redis runs them in order: the `HSET` first creates or
overwrites the fields, then `HEXPIRE` attaches a TTL to each of those same
fields. `HEXPIRE` returns one status code per field — `1` if the TTL was
set, `2` if the expiry was 0 or in the past (so Redis deleted the field
instead), `0` if an `NX | XX | GT | LT` conditional flag was set and not met
(we never use one here), `-2` if the field doesn't exist on the key. The
helper throws if any code is anything other than `1`, so the "every
streaming write renews its TTL" invariant fails loudly rather than silently
leaving a streaming field with no expiry attached.

`Response<List<Long>>` is Jedis's deferred-result wrapper for pipelined
commands: queue the command, call `pipe.sync()` to ship the batch, then read
each result via `.get()`. The `Response` for `hexpire` returns the per-field
codes; that list is what the helper validates above.

If a streaming pipeline stops, the streaming fields drop out one by one as
their per-field TTLs elapse — there is no application-side cleaner involved.
[`HTTL`](https://redis.io/docs/latest/commands/httl) lets the model side inspect the
remaining TTL on any field, which is useful both for debugging ("why is this
feature missing?" → "it expired three seconds ago") and as a freshness signal
in the model itself.

> **HEXPIRE requires Redis 7.4 or later.** `HEXPIRE` and the field-level TTL
> commands (`HTTL`, `HPERSIST`, `HEXPIREAT`, `HPEXPIRE`, `HPEXPIREAT`,
> `HPTTL`, `HEXPIRETIME`, `HPEXPIRETIME`) were added in Redis 7.4. Jedis 5.2
> was the first release with the bindings; the demo's `pom.xml` pins 6.2.
> On older Redis builds you would have to put streaming features on their
> own keys (one key per feature, or one key per feature group) and set a
> key-level `EXPIRE` instead — at the cost of giving up the single-`HMGET`
> retrieval.

### Inference reads with HMGET

`getFeatures` is one `HMGET`:

```java
public Map<String, String> getFeatures(String entityId, List<String> fieldNames) {
    String key = keyFor(entityId);
    Map<String, String> out = new LinkedHashMap<>();
    if (fieldNames == null) {
        try (Jedis jedis = pool.getResource()) {
            Map<String, String> all = jedis.hgetAll(key);
            if (all != null) out.putAll(all);
        }
        return out;
    }
    if (fieldNames.isEmpty()) return out;
    List<String> values;
    try (Jedis jedis = pool.getResource()) {
        values = jedis.hmget(key, fieldNames.toArray(new String[0]));
    }
    for (int i = 0; i < fieldNames.size(); i++) {
        String v = values.get(i);
        if (v != null) out.put(fieldNames.get(i), v);
    }
    return out;
}
```

The model knows exactly which features it consumes, so the request path
always takes the `HMGET` branch with an explicit field list — that's the
sub-millisecond path. `HGETALL` is the right call for debugging (which is
what the demo's "Inspect" panel does) but not for serving: it forces Redis
to serialize every field, including ones the model doesn't need.

Fields that don't exist (because they were never written, or because they
expired) come back as `null` in the `List<String>` Jedis returns. The helper
drops them from the result `Map` so the caller sees only the features that
are actually available. A real model server would either treat missing
values as a feature ("this user has no streaming signal yet") or fall back
to a default from the model's training data.

### Batch scoring with pipelined HMGET

For batch inference, the same `HMGET` shape pipelines across users:

```java
public Map<String, Map<String, String>> batchGetFeatures(
        List<String> entityIds, List<String> fieldNames) {
    if (entityIds.isEmpty() || fieldNames.isEmpty()) {
        return Collections.emptyMap();
    }
    String[] names = fieldNames.toArray(new String[0]);
    List<Response<List<String>>> responses = new ArrayList<>(entityIds.size());
    try (Jedis jedis = pool.getResource()) {
        Pipeline pipe = jedis.pipelined();
        for (String id : entityIds) {
            responses.add(pipe.hmget(keyFor(id), names));
        }
        pipe.sync();
    }
    Map<String, Map<String, String>> out = new LinkedHashMap<>();
    for (int i = 0; i < entityIds.size(); i++) {
        List<String> values = responses.get(i).get();
        Map<String, String> row = new LinkedHashMap<>();
        for (int j = 0; j < fieldNames.size(); j++) {
            String v = values.get(j);
            if (v != null) row.put(fieldNames.get(j), v);
        }
        out.put(entityIds.get(i), row);
    }
    return out;
}
```

One round trip for the whole batch — the demo regularly returns 30 users in
~1 ms against a local Redis. On a real network the round trip dominates;
pipelining is what keeps batch scoring practical.

A Redis Cluster is different: a single `Pipeline.sync()` is bound to one
shard, because cross-slot pipelines on a cluster connection don't make sense.
For batch reads on a cluster, use
[`JedisCluster`](https://redis.io/docs/latest/develop/clients/jedis) and either fan out
parallel `hmget` calls (the cluster client routes per-shard for you) or, for
tighter control, group the IDs by hash slot ahead of time and issue one
`Pipeline.sync()` against each shard's connection in parallel. A hash tag
like `fs:user:{vip}:u0001` forces a known set of keys onto the same shard so
one pipeline can cover all of them in a single round trip.

## The streaming worker

`StreamingWorker.java` is the demo's stand-in for whatever Flink, Kafka
Streams, or bespoke service computes the real-time features
([source](https://github.com/redis/docs/blob/main/content/develop/use-cases/feature-store/java-jedis/StreamingWorker.java)).
It runs as a daemon `Thread` next to the demo server so the UI can start,
pause, and resume it; in production this code would live in the streaming
layer.

Every tick the worker picks a few random users, generates a new value for
each streaming feature, and calls `store.updateStreaming(userId, fields)`.
The demo defaults to 5 users per tick at 1-second intervals — so a 200-user
store sees roughly half its users refreshed in the first minute, and most
after a few minutes. Raise `--users-per-tick` or drop `--seed-users` if
you'd rather touch every user quickly.

```java
private void doTick() {
    List<String> ids = store.listEntityIds(500);
    if (ids.isEmpty()) return;
    List<String> picks = sample(ids, usersPerTick);
    long nowMs = System.currentTimeMillis();
    for (String id : picks) {
        Map<String, Object> fields = new LinkedHashMap<>();
        fields.put("last_login_ts", nowMs);
        fields.put("last_device_id", choice(DEVICE_IDS));
        fields.put("tx_count_5m", intn(13));
        fields.put("failed_logins_15m", weightedInt(FAILED_LOGIN_BUCKETS, FAILED_LOGIN_WEIGHTS));
        fields.put("session_country", choice(SESSION_COUNTRIES));
        store.updateStreaming(id, fields);
    }
    ...
}
```

Pausing the worker is what shows off the mixed-staleness behavior: leave it
paused for longer than `streamingTtlSeconds` and the streaming fields
disappear from every user's hash one by one, while the batch fields remain
under the longer key-level `EXPIRE`. The demo's `Pause / resume` button lets
you see this happen in real time.

`pause()` only blocks *future* ticks from running — the thread checks the
flag at the top of the loop and skips its turn. A reset that's about to
`DEL` every key needs to wait out an already-running tick too, which is
what `waitForIdle()` is for: the demo's `Reset` handler calls
`worker.pause()` *and* `worker.waitForIdle()` before it issues the `DEL`
sweep, so a mid-flight tick can't recreate a user under a streaming-only
hash with no key-level TTL.

## The batch builder

`BuildFeatures.java` is the demo's nightly materializer
([source](https://github.com/redis/docs/blob/main/content/develop/use-cases/feature-store/java-jedis/BuildFeatures.java)).
It generates synthetic feature rows and calls `store.bulkLoad` once. The
synthesis itself is not the point — in a real deployment the equivalent code
reads from the offline store (Snowflake, BigQuery, Iceberg) and writes the
resulting hashes into Redis.

```java
public static Map<String, Map<String, Object>> synthesizeUsers(int count, long seed) {
    Random rng = new Random(seed);
    Map<String, Map<String, Object>> users = new LinkedHashMap<>(count);
    for (int i = 1; i <= count; i++) {
        String uid = String.format("u%04d", i);
        Map<String, Object> row = new LinkedHashMap<>();
        row.put("country_iso", COUNTRY_CHOICES.get(rng.nextInt(COUNTRY_CHOICES.size())));
        row.put("risk_segment", weightedChoice(rng, RISK_SEGMENTS, RISK_WEIGHTS));
        row.put("account_age_days", 7 + rng.nextInt(2394));
        row.put("tx_count_7d", rng.nextInt(81));
        row.put("avg_amount_30d", Math.round((5.0 + rng.nextDouble() * 345.0) * 100.0) / 100.0);
        row.put("chargeback_count_180d", weightedChoiceInt(rng, CHARGEBACK_BUCKETS, CHARGEBACK_WEIGHTS));
        users.put(uid, row);
    }
    return users;
}
```

You can run the builder on its own (independently of the demo server) to
populate Redis from the command line:

```bash
mvn exec:java -Dexec.mainClass=BuildFeatures -Dexec.args="--count 500 --ttl-seconds 3600"
```

That writes 500 users at `fs:user:*` with a one-hour key-level TTL, which is
how a typical operator would pre-seed a feature store from the command line
when debugging.

## The interactive demo

`DemoServer.java` runs the JDK `HttpServer` on port 8088 with a fixed thread
pool. The HTML page lets you:

* **Bulk-load** any number of users (default 200) with a configurable
  key-level TTL. Drop the TTL to 30 s and watch the entire store expire on
  schedule — the same thing that happens if a daily refresher fails.
* See the **store state** at a glance: user count, batch / streaming TTLs,
  cumulative read/write counters.
* See the **streaming worker** status (running / paused, ticks completed,
  writes performed) and **pause or resume** it. Leave it paused for longer
  than the streaming TTL to watch streaming fields drop out.
* Run an **inference read** for any user with a chosen feature subset, and
  see the value, the per-field TTL, and the read latency.
* Run **batch scoring** with a pipelined `HMGET` across `N` users and see
  the total elapsed time plus the per-user breakdown.
* **Inspect** any user's full hash with field-level TTLs and the key-level
  TTL — the right view for debugging "why is this feature missing?" at
  read time.

The server holds one `FeatureStore` and one `StreamingWorker` for the
lifetime of the process, plus a `JedisPool` that all handlers borrow
connections from. Endpoints:

| Endpoint                  | What it does                                                                        |
|---------------------------|-------------------------------------------------------------------------------------|
| `GET  /state`             | User count, TTL config, stats counters, worker status.                              |
| `POST /bulk-load`         | Pipelined `HSET` + `EXPIRE` over N synthetic users with a chosen TTL.               |
| `POST /worker/toggle`     | Pause / resume the streaming worker.                                                |
| `POST /read`              | `HMGET` a chosen feature subset for one user; report latency and per-field TTLs.    |
| `POST /batch-read`        | Pipeline `HMGET` across N users; report total latency and per-entity field counts.  |
| `GET  /inspect`           | `HGETALL` + `HTTL` for one user; full hash view with per-field TTLs.                |
| `POST /reset`             | Drop every user under the key prefix (used by the demo's reset button).             |

## Prerequisites

* **Redis 7.4 or later.** [`HEXPIRE`](https://redis.io/docs/latest/commands/hexpire) and
  [`HTTL`](https://redis.io/docs/latest/commands/httl) were added in Redis 7.4; the
  demo relies on per-field TTL for the mixed-staleness story.
* **Java 17 or later.** The demo uses switch expressions with arrow
  labels (`case "..." -> ...`), records, and text blocks.
* **Jedis 5.2 or later.** The demo's `pom.xml` pins
  `redis.clients:jedis:6.2.0`. Field-level TTL bindings (`hexpire`, `httl`,
  `hpersist`) ship from Jedis 5.2.

If your Redis server is running elsewhere, start the demo with `--redis-host`
and `--redis-port`.

## Running the demo

### Get the source files

The demo lives in a small Maven project under
[`feature-store/java-jedis`](https://github.com/redis/docs/tree/main/content/develop/use-cases/feature-store/java-jedis).
Clone the repo or copy the directory:

```bash
git clone https://github.com/redis/docs.git
cd docs/content/develop/use-cases/feature-store/java-jedis
mvn package
```

### Start the demo server

From the project directory:

```bash
mvn exec:java -Dexec.mainClass=DemoServer
```

You should see:

```text
Dropping any existing users under 'fs:user:*' for a clean demo run (pass --no-reset to keep them).
Redis feature-store demo server listening on http://127.0.0.1:8088
Using Redis at localhost:6379 with key prefix 'fs:user:' (batch TTL 86400s, streaming TTL 300s)
Materialized 200 user(s); streaming worker running.
```

By default the demo wipes the configured key prefix on startup so each run
starts from a clean state. Pass `--no-reset` to keep any existing data, or
`--key-prefix <prefix>` to point the demo at a different prefix entirely.
Maven exec passes CLI args via `-Dexec.args`:

```bash
mvn exec:java -Dexec.mainClass=DemoServer \
    -Dexec.args="--port 9000 --streaming-ttl-seconds 30"
```

Open [http://127.0.0.1:8088](http://127.0.0.1:8088) in a browser. Useful
things to try:

* Pick a user and click **Read features** with a mixed batch/streaming
  subset — you'll see batch fields with no per-field TTL (covered by the
  key-level TTL) and streaming fields with a positive per-field TTL.
* Click **Pipeline HMGET** with `count=100` to see the latency of a
  100-user batch read.
* Click **Pause / resume** on the streaming worker and leave it paused for
  ~5 minutes (or restart the server with `--streaming-ttl-seconds 30` to
  make it visible in seconds). Re-run **Read features** on any user and
  watch the streaming fields disappear while the batch fields stay.
* Click **Inspect** on a user to see the full hash with field-level TTLs.
* Click **Bulk-load** with a short TTL (say 30 seconds) and watch the user
  count fall to zero on the next minute — the same thing that happens if a
  daily batch run fails to land.
* Click **Reset** to drop every user and start over.

The server is read/write against your local Redis. The default key prefix
is `fs:user:`. Pass `--no-reset` to keep existing data across restarts, or
`--redis-host` / `--redis-port` to point at a different Redis.

## Production usage

The guidance below focuses on the production concerns that are specific to
running a feature store on Redis. For the generic Jedis production checklist
— `JedisPool` sizing, AUTH/ACL, retry policy, sentinel/cluster failover —
see the
[Jedis production usage guide](https://redis.io/docs/latest/develop/clients/jedis/produsage).
For TLS specifically, follow the
[connect-with-TLS recipe](https://redis.io/docs/latest/develop/clients/jedis/connect#connect-to-your-production-redis-with-tls).
The feature-store demo runs against `localhost` with the defaults; a real
deployment should harden the client first.

### Pick the batch TTL to outlast a failed refresher

The whole-entity `EXPIRE` is your safety net against silent staleness from a
broken batch pipeline. Set it longer than your worst-case batch outage so a
single missed run doesn't take the feature store offline, but short enough
that a sustained outage causes loud failures (missing entities) rather than
quiet ones (yesterday's features being scored as today's). The standard
choice is one cycle of "expected refresh interval × 2" — for a daily batch,
48 hours; for a 6-hour batch, 12 hours.

The same logic applies to the per-field streaming TTL: a few times the
expected update interval so a slow-but-alive streaming worker doesn't churn
features needlessly, but short enough that a stalled worker causes visible
freshness failures.

### Co-locate the online store with serving, not with training

The online store's hash representation does *not* have to match the schema
in your offline store. The batch materialization step is your chance to
flatten joins, encode categoricals, and project to whatever shape the model
server wants — so the request path is exactly one `HMGET` and zero
transforms.

The training pipeline reads from the offline store with its own schema; the
serving pipeline reads from Redis with the flattened serving schema.
Keeping those two pipelines as the same code path is what prevents
training-serving skew.

### Pipeline batch reads across shards

On a single Redis instance, `Pipeline.sync()` across `N` `hmget` calls is
one round trip. A Redis Cluster is different: a single `Pipeline.sync()` is
bound to one shard, because cross-slot pipelines on a cluster connection
don't make sense, and the keys for a typical user batch will land on
multiple shards. For batch reads on a cluster, use
[`JedisCluster`](https://redis.io/docs/latest/develop/clients/jedis) — its
implementation routes per-shard for you. For tighter control, group the IDs
by hash slot ahead of time and issue one `Pipeline.sync()` per shard's
connection in parallel. For a small number of frequently-queried users (a
top-N customer list, for example), a hash tag like `fs:user:{vip}:u0001`
forces a known set of keys onto the same shard so one pipeline can cover
all of them in a single round trip.

### Make HEXPIRE part of every streaming write

The single biggest correctness lever in this design is that the streaming
write applies `HEXPIRE` *every time*. If a streaming worker writes a field
without renewing its TTL, the field carries whatever expiry was there before
— possibly none, possibly stale — and the mixed-staleness invariant breaks.
Keep the `HSET` and `HEXPIRE` in the same pipeline (or, even safer, in the
same [Lua script](https://redis.io/docs/latest/develop/programmability/eval-intro) if
you don't trust the call site).

### Avoid HGETALL on the request path

`HGETALL` reads every field on the hash, including ones the model doesn't
need. With dozens of features per entity, that is wasted serialization work
on the server and wasted bandwidth on the wire. Always specify the field
list explicitly with `hmget` in the model server.

The exception is debugging and feature-set discovery, where you genuinely
want the full hash. The demo's "Inspect" button uses `hgetAll` for exactly
this reason.

### Size the JedisPool for the request shape

Every `FeatureStore` helper method borrows a connection from the
`JedisPool` for the duration of one Redis call (or one `Pipeline.sync()`)
and returns it via the try-with-resources block. One HTTP handler can
therefore borrow several connections sequentially — `/read`, for example,
makes one `hmget` call, one `httl` call, and one `ttl` call, each of
which is its own borrow.

The demo uses `maxTotal=64`. In production, size `maxTotal` to comfortably
exceed your peak concurrent borrow count: that's roughly
`(concurrent HTTP handlers × Redis calls per handler in flight at once) +
(background worker borrow rate)`. Setting it too low forces some borrows
to block waiting for a returned connection — a slow read-side cliff that
doesn't show up under load tests with very few clients.

### Inspect the store directly with redis-cli

When testing or troubleshooting, the cli tells you everything:

```bash
# How many users currently in the store
redis-cli --scan --pattern 'fs:user:*' | wc -l

# One user's full hash and key-level TTL
redis-cli HGETALL fs:user:u0001
redis-cli TTL    fs:user:u0001

# Per-field TTL on the streaming fields
redis-cli HTTL fs:user:u0001 FIELDS 5 \
  last_login_ts last_device_id tx_count_5m failed_logins_15m session_country

# Sample HMGET as the model would issue it
redis-cli HMGET fs:user:u0001 risk_segment tx_count_7d avg_amount_30d tx_count_5m
```

A streaming field that returns `-2` from `HTTL` doesn't exist on the hash
(either it was never written, or it expired); `-1` means the field has no
TTL set (and is therefore covered only by the key-level `EXPIRE`); any
positive value is the remaining TTL in seconds.

## Learn more

This example uses the following Redis commands:

* [`HSET`](https://redis.io/docs/latest/commands/hset) to write a feature or a whole
  feature row in one call.
* [`HMGET`](https://redis.io/docs/latest/commands/hmget) to retrieve any subset of
  features for one entity in one round trip.
* [`HGETALL`](https://redis.io/docs/latest/commands/hgetall) for debugging and
  feature-set discovery.
* [`HEXPIRE`](https://redis.io/docs/latest/commands/hexpire) and
  [`HTTL`](https://redis.io/docs/latest/commands/httl) for per-field TTL on streaming
  features (Redis 7.4+).
* [`EXPIRE`](https://redis.io/docs/latest/commands/expire) and
  [`TTL`](https://redis.io/docs/latest/commands/ttl) for the whole-entity TTL aligned
  with the batch materialization cycle.
* Pipelined `HMGET` across many entities for batch scoring with one network
  round trip — see
  [transactions and pipelining](https://redis.io/docs/latest/develop/clients/jedis/transpipe).

See the [Jedis documentation](https://redis.io/docs/latest/develop/clients/jedis) for
the full client reference, and the
[Hashes overview](https://redis.io/docs/latest/develop/data-types/hashes) for the deeper
conceptual model — including the listpack encoding that makes small hashes
particularly compact in memory, which matters at feature-store scale.