PFCOUNT

Syntax text Syntax diagram API methods

PFCOUNT key [key ...]

Client:

pfcount(
    *sources: KeyT  // One or more HyperLogLog keys
) → ResponseT  // The approximated number of unique elements observed via PFADD

PFCOUNT(
    keys: RedisVariadicArgument  // One or more keys of HyperLogLog structures to count
) → NumberReply  // The approximated number of unique elements

pfcount(
    key: byte[]  // The HyperLogLog key
) → long  // The approximated number of unique elements

pfcount(
    keys: byte[]...  // One or more HyperLogLog keys
) → long  // The approximated number of unique elements across all keys

pfcount(
    key: String  // The HyperLogLog key
) → long  // The approximated number of unique elements

pfcount(
    keys: String...  // One or more HyperLogLog keys
) → long  // The approximated number of unique elements across all keys

pfcount(
    keys: K...  // The keys
) → Long  // The approximated number of unique elements observed via PFADD

pfcount(
    keys: K...  // The keys
) → RedisFuture<Long>  // The approximated number of unique elements observed via PFADD

pfcount(
    keys: K...  // The keys
) → Mono<Long>  // The approximated number of unique elements observed via PFADD

PFCount(
    ctx: context.Context,  // Context for the command
    keys: ...string  // One or more HyperLogLog keys
) → *IntCmd  // The approximated number of unique elements

HyperLogLogLength(
    key: RedisKey,  // The HyperLogLog key
    flags: CommandFlags  // The flags to use for this operation
) → long  // The approximated number of unique elements observed via HyperLogLogAdd

HyperLogLogLength(
    keys: RedisKey[],  // The keys of the HyperLogLogs
    flags: CommandFlags  // The flags to use for this operation
) → long  // The approximated number of unique elements observed via HyperLogLogAdd

HyperLogLogLengthAsync(
    key: RedisKey,  // The HyperLogLog key
    flags: CommandFlags  // The flags to use for this operation
) → Task<long>  // The approximated number of unique elements observed via HyperLogLogAdd

HyperLogLogLengthAsync(
    keys: RedisKey[],  // The keys of the HyperLogLogs
    flags: CommandFlags  // The flags to use for this operation
) → Task<long>  // The approximated number of unique elements observed via HyperLogLogAdd

pfcount(
    $keyOrKeys: string[]|string,  // One or more HyperLogLog keys
    ...$keys: string  // Additional keys (optional)
) → int  // The approximated number of unique elements

pfcount(
    key: K  // The HyperLogLog key
) → usize  // The approximated number of unique elements

pfcount(
    key: K  // The HyperLogLog key
) → usize  // The approximated number of unique elements

Available since:: Redis Open Source 2.8.9
Time complexity:: O(1) with a very small average constant time when called with a single key. O(N) with N being the number of keys, and much bigger constant times, when called with multiple keys.
ACL categories:: @read, @hyperloglog, @slow,
Compatibility:: Redis Software and Redis Cloud compatibility

Note:

This command's behavior varies in clustered Redis environments. See the multi-key operations page for more information.

When called with a single key, PFCOUNT returns the approximated cardinality computed by the HyperLogLog data structure stored at the specified key, which is 0 if the key does not exist.

When called with multiple keys, PFCOUNT returns the approximated cardinality of the union of the HyperLogLogs passed by internally merging the HyperLogLogs stored at the provided keys into a temporary HyperLogLog.

The HyperLogLog data structure can be used in order to count unique elements in a set using just a small constant amount of memory, specifically 12k bytes for every HyperLogLog (plus a few bytes for the key itself).

The returned cardinality of the observed set is not exact, but approximated with a standard error of 0.81%.

For example, to take the count of all the unique search queries performed in a day, a program needs to call PFADD every time a query is processed. The estimated number of unique queries can be retrieved with PFCOUNT at any time.

Note: as a side effect of calling this function, it is possible that the HyperLogLog is modified, since the last 8 bytes encode the latest computed cardinality for caching purposes. So PFCOUNT is technically a write command.

Required arguments

key [key ...]

One or more HyperLogLog keys. With multiple keys, the command returns the approximated cardinality of their union.

Examples

Details

Performances

When PFCOUNT is called with a single key, performances are excellent even if in theory constant times to process a dense HyperLogLog are high. This is possible because the PFCOUNT uses caching in order to remember the cardinality previously computed, that rarely changes because most PFADD operations will not update any register. Hundreds of operations per second are possible.

When PFCOUNT is called with multiple keys, an on-the-fly merge of the HyperLogLogs is performed, which is slow, moreover the cardinality of the union can't be cached, so when used with multiple keys PFCOUNT may take a time in the order of magnitude of the millisecond, and should be not abused.

The user should take in mind that single-key and multiple-keys executions of this command are semantically different and have different performances.

HyperLogLog representation

Redis HyperLogLogs are represented using a double representation: the sparse representation suitable for HLLs counting a small number of elements (resulting in a small number of registers set to non-zero value), and a dense representation suitable for higher cardinalities. Redis automatically switches from the sparse to the dense representation when needed.

The sparse representation uses a run-length encoding optimized to store efficiently a big number of registers set to zero. The dense representation is a Redis string of 12288 bytes in order to store 16384 6-bit counters. The need for the double representation comes from the fact that using 12k (which is the dense representation memory requirement) to encode just a few registers for smaller cardinalities is extremely suboptimal.

Both representations are prefixed with a 16 bytes header, that includes a magic, an encoding / version field, and the cached cardinality estimation computed, stored in little endian format (the most significant bit is 1 if the estimation is invalid since the HyperLogLog was updated since the cardinality was computed).

The HyperLogLog, being a Redis string, can be retrieved with GET and restored with SET. Calling PFADD, PFCOUNT or PFMERGE commands with a corrupted HyperLogLog is never a problem, it may return random values but does not affect the stability of the server. Most of the times when corrupting a sparse representation, the server recognizes the corruption and returns an error.

The representation is neutral from the point of view of the processor word size and endianness, so the same representation is used by 32 bit and 64 bit processor, big endian or little endian.

More details about the Redis HyperLogLog implementation can be found in this blog post. The source code of the implementation in the hyperloglog.c file is also easy to read and understand, and includes a full specification for the exact encoding used for the sparse and dense representations.

Redis Software and Redis Cloud compatibility

Redis Software	Redis Cloud	Notes
✅ Standard ✅ Active-Active	✅ Standard ✅ Active-Active

Return information

RESP2 RESP3

Integer reply: the approximated number of unique elements observed via PFADD.

Integer reply: the approximated number of unique elements observed via PFADD