The real-time context engine for AI

Give your chatbots and agents high-accuracy context

LLMs and AI apps need just the right data at the right time to provide quality responses. Search, gather, and serve the right context for LLMs with the unified platform you already know and love.

Try free Read our AI docs

How it works
Customers
Key features
Get started

Get accurate responses with hybrid search

Enterprises need their search engines to combine filtering and exact matching with vector search in a high-performance, scalable way. Vector-only databases can’t keep up and result in bad answers and architectural redesigns.

Learn about Redis Search

Improve RAG & search with the world’s fastest vector database

Improve RAG with the fastest vector database

Give users fast answers with retrieval-augmented generation (RAG) from our benchmark-leading vector database and configure search the way you want.

Learn about Redis vector database

Recall key memories for agents

Assembling the right context for LLMs takes a thoughtful approach to identifying, summarizing, and retrieving relevant memories to deliver useful outputs. We manage it for you and work with leading third-party frameworks.

Try Agent Memory Server

Reduce redundant LLM calls with semantic caching

Cut LLM cost calls with semantic caching

Store the semantic meaning of frequent calls to LLMs so apps can answer commonly asked questions faster with lower LLM inference costs.

Try LangCache

Faster predictions with ML feature store

Serve real-time ML features with feature store

Deliver live features, like user behavior or risk scores, to your models with sub-millisecond latency. Our feature store orchestrates batch, streaming, and on-demand pipelines.

Learn more about feature store

"We would not have been able to scale ChatGPT without Redis."

"Using Redis, Bank of America has built fast, high-quality digital experiences for their clients at scale, from use cases like caching and session management, to event streaming and AI infrastructure."

"We’re using Redis Cloud for everything persistent in OpenGPTs, including as a vector store for retrieval and a database to store messages and agent configurations. The fact that you can do all of those in one database from Redis is really appealing.”

Harrison ChaseCEO

"Better answers and more current real-time information with up to 2.35X better performance with the Xeon 6 and Redis."

Learn more