Resource Center

Events & webinars Blog Videos Glossary Resources Architecture Diagrams Demo Center

Resource Center

Events & webinars Blog Videos Glossary Resources Architecture Diagrams Demo Center

Back to blog

Blog

Announcing LangChain RAG Template Powered by Redis

December 18, 20233 minute read

Redis

The recent launch of LangChain Templates introduces a transformative approach for developers to create and deploy generative AI APIs. LangChain Templates, including the new Redis Retrieval Augmented Generation (RAG) template, provide deployable reference architectures that blend efficiency with adaptability.

AI developers today face a deluge of technology choices between model providers, databases, and development frameworks such as LangChain. Additionally, getting to production requires significant investment beyond Jupyter notebooks and fancy Streamlit demos.

To reduce the friction in deploying APIs, LangChain offers a hub of deployable architectures. These templates encompass tool-specific chains, Large Language Model (LLM)-specific chains, and technique-specific chains, ensuring comprehensive developer options. Central to their deployment is LangServe, which uses FastAPI to transform LLM-based Chains or Agents into operational REST APIs, enhancing accessibility and production-readiness.

Redis partnered with LangChain to produce the Redis RAG template, a package optimized for creating factually consistent, LLM-powered chat applications. By using Redis as the vector database, this template ensures rapid context retrieval and grounded prompt construction, crucial for responsive and precise AI responses.

Getting started with our RAG template

The Redis RAG template serves a REST API for developers to chat with public financial PDF documents such as Nike’s 10k filings. The application uses:

FastAPI and Uvicorn to serve client requests via HTTP
UnstructuredFileLoader to parse the PDF documents into raw text
RecursiveCharacterTextSplitter to split the text into smaller chunks
‘all-MiniLM-L6-v2’ sentence transformer from HuggingFace to embed text chunks into vectors
Redis as the vector database for realtime context retrieval
OpenAI ‘gpt-3.5-turbo-16k’ LLM to generate answers to user queries

To run the RAG application with the template, you will need two things:

a running Redis instance (Redis Cloud or local Redis Stack)
an OpenAI API key

As always, refer to the official project README for the latest details. Here’s a step-by-step guide to build with the template locally:

1. Environment Setup: Set your OpenAI API key and Redis environment variables:

Alternatively, you can set the REDIS_URL environment variable instead of the individual components.

2. Create and activate a Python3.9 virtual environment (best practice). We will use venv:

3. Install the LangChain CLI and Pydantic:

3. Create a new LangChain project:

Running the LangChain CLI command shown above will create a new directory named test-rag.

When prompted to install the template, select the yes option, y. This step will download the rag-redis template contents under the ./test-rag/packages directory and attempt to install Python requirements.

4. Enter the new project directory:

Looking at the directory tree, we should see the following structure:

5. To use the rag-redis package, add the following snippet to your app/server.py file:

6. Ingest source data for demo app:

This may take a few minutes. The ingest.py script executes a pipeline, as visualized below, that loads the source PDF docs, converts text into smaller chunks, creates text embeddings using a HuggingFace sentence transformer model, and loads data into Redis.

7. Serve the FastAPI app with LangServe:

8. Access the API on port 8000. After spinning up, you should see the following output:

Navigate to http://127.0.0.1:8000/docs to see documentation.

Visit http://127.0.0.1:8000/rag-redis/playground to use the testing playground, seen below:

Use the playground to test your API by asking a question. The LangChain application responds with an answer that combines rich context from the Nike company PDF, retrieved from Redis, with the generative abilities of the OpenAI LLM.

Advance AI innovation with LangChain and Redis

Our ongoing partnership with LangChain reflects a commitment to continual innovation in AI. This collaboration fosters the development of tools such as the LangChain RAG template and supports initiatives such as the OpenGPTs project. The partnership also fuels our work in maintaining the Redis <> LangChain integrations, as well Redis’ own AI-native client, redisvl.

Redis is dedicated to equipping AI developers with the latest resources for creating performant and production-ready applications.

Next steps

The LangChain RAG template, powered by Redis’ vector database, simplifies the creation of AI applications. Build with this template and leverage these tools to create AI solutions that drive progress in the field.

Related resources

Powering LangChain

OpenGPTs With Redis Cloud open-source framework for building custom AI agents

Learn more

Vector database

Making it easy to build generative AI applications with Redis Enterprise.

Learn more

RedisVL client

Redis Vector Library (RedisVL) enables Redis as a realtime vector database for LLM Applications.

Learn more

Get started with Redis today

Speak to a Redis expert and learn more about enterprise-grade Redis today.

Try for free Talk to sales