Write SQL Queries for Redis
While Redis does not natively support SQL, RedisVL provides a SQLQuery class that translates SQL-like queries into Redis queries.
The SQLQuery class wraps the sql-redis package. This package is not installed by default, so install it with:
pip install redisvl[sql-redis]
Prerequisites
Before you begin, ensure you have:
- Installed RedisVL with SQL support:
pip install redisvl[sql-redis] - A running Redis instance (Redis 8+ or Redis Cloud)
What You'll Learn
By the end of this guide, you will be able to:
- Write SQL-like queries for Redis using
SQLQuery - Translate SELECT, WHERE, and ORDER BY clauses to Redis queries
- Combine SQL queries with vector search
- Use aggregate functions and grouping
Create an index to search
from redisvl.utils.vectorize import HFTextVectorizer
hf = HFTextVectorizer()
schema = {
"index": {
"name": "user_simple",
"prefix": "user_simple_docs",
"storage_type": "json",
},
"fields": [
{"name": "user", "type": "tag"},
{"name": "region", "type": "tag"},
{"name": "job", "type": "tag"},
{"name": "job_description", "type": "text"},
{"name": "age", "type": "numeric"},
{
"name": "job_embedding",
"type": "vector",
"attrs": {
"dims": len(hf.embed("get embed length")),
"distance_metric": "cosine",
"algorithm": "flat",
"datatype": "float32"
}
}
]
}
/Users/tyler.hutcherson/Documents/AppliedAI/redis-vl-python/.venv/lib/python3.13/site-packages/tqdm/auto.py:21: TqdmWarning: IProgress not found. Please update jupyter and ipywidgets. See https://ipywidgets.readthedocs.io/en/stable/user_install.html
from .autonotebook import tqdm as notebook_tqdm
Create sample dataset
data = [
{
'user': 'john',
'age': 34,
'job': 'software engineer',
'region': 'us-west',
'job_description': 'Designs, develops, and maintains software applications and systems.'
},
{
'user': 'bill',
'age': 54,
'job': 'engineer',
'region': 'us-central',
'job_description': 'Applies scientific and mathematical principles to solve technical problems.'
},
{
'user': 'mary',
'age': 24,
'job': 'doctor',
'region': 'us-central',
'job_description': 'Diagnoses and treats illnesses, injuries, and other medical conditions in the healthcare field.'
},
{
'user': 'joe',
'age': 27,
'job': 'dentist',
'region': 'us-east',
'job_description': 'Provides oral healthcare including diagnosing and treating teeth and gum issues.'
},
{
'user': 'stacy',
'age': 61,
'job': 'project manager',
'region': 'us-west',
'job_description': 'Plans, organizes, and oversees projects from inception to completion.'
}
]
data = [
{
**d,
"job_embedding": hf.embed(f"{d['job_description']=} {d['job']=}"),
}
for d in data
]
Create a SearchIndex
With the schema and sample dataset ready, create a SearchIndex.
Bring your own Redis connection instance
This is ideal in scenarios where you have custom settings on the connection instance or if your application will share a connection pool:
from redisvl.index import SearchIndex
from redis import Redis
client = Redis.from_url("redis://localhost:6379")
index = SearchIndex.from_dict(schema, redis_client=client, validate_on_load=True)
Let the index manage the connection instance
This is ideal for simple cases:
index = SearchIndex.from_dict(schema, redis_url="redis://localhost:6379", validate_on_load=True)
Create the index
Now that we are connected to Redis, we need to run the create command.
index.create(overwrite=True, drop=True)
Load Data to SearchIndex
Load the sample dataset to Redis.
Validate data entries on load
RedisVL uses pydantic validation under the hood to ensure loaded data is valid and confirms to your schema. This setting is optional and can be configured in the SearchIndex class.
keys = index.load(data)
print(keys)
['user_simple_docs:01KHKJGG26AR3VW2RJA381R8YK', 'user_simple_docs:01KHKJGG2R8EZP6H15MG1V4E53', 'user_simple_docs:01KHKJGG369F5R0R51PW2HP8MV', 'user_simple_docs:01KHKJGG3MGVPAZ6XEQVEWXZFC', 'user_simple_docs:01KHKJGG44ZEKJVRQJ0EF72PV7']
Create a SQLQuery Object
First, let's test a simple select statement such as the one below.
from redisvl.query import SQLQuery
sql_str = """
SELECT user, region, job, age
FROM user_simple
WHERE age > 17
"""
sql_query = SQLQuery(sql_str)
Check the created query string
sql_query.redis_query_string(redis_url="redis://localhost:6379")
'FT.SEARCH user_simple "@age:[(17 +inf]" RETURN 4 user region job age'
Executing the query
results = index.query(sql_query)
results
[{'user': 'john',
'region': 'us-west',
'job': 'software engineer',
'age': '34'},
{'user': 'bill', 'region': 'us-central', 'job': 'engineer', 'age': '54'},
{'user': 'mary', 'region': 'us-central', 'job': 'doctor', 'age': '24'},
{'user': 'joe', 'region': 'us-east', 'job': 'dentist', 'age': '27'},
{'user': 'stacy', 'region': 'us-west', 'job': 'project manager', 'age': '61'}]
Additional query support
Conditional operators
sql_str = """
SELECT user, region, job, age
FROM user_simple
WHERE age > 17 and region = 'us-west'
"""
sql_query = SQLQuery(sql_str)
redis_query = sql_query.redis_query_string(redis_url="redis://localhost:6379")
print("Resulting redis query: ", redis_query)
results = index.query(sql_query)
results
Resulting redis query: FT.SEARCH user_simple "@age:[(17 +inf] @region:{us\-west}" RETURN 4 user region job age
[{'user': 'john',
'region': 'us-west',
'job': 'software engineer',
'age': '34'},
{'user': 'stacy', 'region': 'us-west', 'job': 'project manager', 'age': '61'}]
sql_str = """
SELECT user, region, job, age
FROM user_simple
WHERE region = 'us-west' or region = 'us-central'
"""
sql_query = SQLQuery(sql_str)
redis_query = sql_query.redis_query_string(redis_url="redis://localhost:6379")
print("Resulting redis query: ", redis_query)
results = index.query(sql_query)
results
Resulting redis query: FT.SEARCH user_simple "((@region:{us\-west})|(@region:{us\-central}))" RETURN 4 user region job age
[{'user': 'john',
'region': 'us-west',
'job': 'software engineer',
'age': '34'},
{'user': 'bill', 'region': 'us-central', 'job': 'engineer', 'age': '54'},
{'user': 'stacy', 'region': 'us-west', 'job': 'project manager', 'age': '61'},
{'user': 'mary', 'region': 'us-central', 'job': 'doctor', 'age': '24'}]
# job is a tag field therefore this syntax works
sql_str = """
SELECT user, region, job, age
FROM user_simple
WHERE job IN ('software engineer', 'engineer', 'pancake tester')
"""
sql_query = SQLQuery(sql_str)
redis_query = sql_query.redis_query_string(redis_url="redis://localhost:6379")
print("Resulting redis query: ", redis_query)
results = index.query(sql_query)
results
Resulting redis query: FT.SEARCH user_simple "@job:{software engineer|engineer|pancake tester}" RETURN 4 user region job age
[{'user': 'john',
'region': 'us-west',
'job': 'software engineer',
'age': '34'},
{'user': 'bill', 'region': 'us-central', 'job': 'engineer', 'age': '54'}]
Text based searches
See the docs for available text queries in Redis.
For more on exact matching see here
# Prefix
sql_str = """
SELECT user, region, job, job_description, age
FROM user_simple
WHERE job_description = 'sci*'
"""
sql_query = SQLQuery(sql_str)
redis_query = sql_query.redis_query_string(redis_url="redis://localhost:6379")
print("Resulting redis query: ", redis_query)
results = index.query(sql_query)
results
Resulting redis query: FT.SEARCH user_simple "@job_description:sci*" RETURN 5 user region job job_description age
[{'user': 'bill',
'region': 'us-central',
'job': 'engineer',
'job_description': 'Applies scientific and mathematical principles to solve technical problems.',
'age': '54'}]
# Suffix
sql_str = """
SELECT user, region, job, job_description, age
FROM user_simple
WHERE job_description = '*care'
"""
sql_query = SQLQuery(sql_str)
redis_query = sql_query.redis_query_string(redis_url="redis://localhost:6379")
print("Resulting redis query: ", redis_query)
results = index.query(sql_query)
results
Resulting redis query: FT.SEARCH user_simple "@job_description:*care" RETURN 5 user region job job_description age
[{'user': 'mary',
'region': 'us-central',
'job': 'doctor',
'job_description': 'Diagnoses and treats illnesses, injuries, and other medical conditions in the healthcare field.',
'age': '24'},
{'user': 'joe',
'region': 'us-east',
'job': 'dentist',
'job_description': 'Provides oral healthcare including diagnosing and treating teeth and gum issues.',
'age': '27'}]
# Fuzzy
sql_str = """
SELECT user, region, job, job_description, age
FROM user_simple
WHERE job_description = '%diagnose%'
"""
sql_query = SQLQuery(sql_str)
redis_query = sql_query.redis_query_string(redis_url="redis://localhost:6379")
print("Resulting redis query: ", redis_query)
results = index.query(sql_query)
results
Resulting redis query: FT.SEARCH user_simple "@job_description:%diagnose%" RETURN 5 user region job job_description age
[{'user': 'mary',
'region': 'us-central',
'job': 'doctor',
'job_description': 'Diagnoses and treats illnesses, injuries, and other medical conditions in the healthcare field.',
'age': '24'}]
# Phrase no stop words
sql_str = """
SELECT user, region, job, job_description, age
FROM user_simple
WHERE job_description = 'healthcare including'
"""
sql_query = SQLQuery(sql_str)
redis_query = sql_query.redis_query_string(redis_url="redis://localhost:6379")
print("Resulting redis query: ", redis_query)
results = index.query(sql_query)
results
Resulting redis query: FT.SEARCH user_simple "@job_description:"healthcare including"" RETURN 5 user region job job_description age
[{'user': 'joe',
'region': 'us-east',
'job': 'dentist',
'job_description': 'Provides oral healthcare including diagnosing and treating teeth and gum issues.',
'age': '27'}]
# Phrase with stop words currently limitation of core Redis
sql_str = """
SELECT user, region, job, job_description, age
FROM user_simple
WHERE job_description = 'diagnosing and treating'
"""
sql_query = SQLQuery(sql_str)
redis_query = sql_query.redis_query_string(redis_url="redis://localhost:6379")
print("Resulting redis query: ", redis_query)
results = index.query(sql_query)
results
Resulting redis query: FT.SEARCH user_simple "@job_description:"diagnosing treating"" RETURN 5 user region job job_description age
/Users/tyler.hutcherson/Documents/AppliedAI/redis-vl-python/.venv/lib/python3.13/site-packages/sql_redis/translator.py:136: UserWarning: Stopwords ['and'] were removed from phrase search 'diagnosing and treating'. By default, Redis does not index stopwords. To include stopwords in your index, create it with STOPWORDS 0.
return self._query_builder.build_text_condition(
[{'user': 'joe',
'region': 'us-east',
'job': 'dentist',
'job_description': 'Provides oral healthcare including diagnosing and treating teeth and gum issues.',
'age': '27'}]
sql_str = """
SELECT user, region, job, age
FROM user_simple
WHERE age BETWEEN 40 and 60
"""
sql_query = SQLQuery(sql_str)
redis_query = sql_query.redis_query_string(redis_url="redis://localhost:6379")
print("Resulting redis query: ", redis_query)
results = index.query(sql_query)
results
Resulting redis query: FT.SEARCH user_simple "@age:[40 60]" RETURN 4 user region job age
[{'user': 'bill', 'region': 'us-central', 'job': 'engineer', 'age': '54'}]
Aggregations
See docs for redis supported reducer functions: docs.
sql_str = """
SELECT
user,
COUNT(age) as count_age,
COUNT_DISTINCT(age) as count_distinct_age,
MIN(age) as min_age,
MAX(age) as max_age,
AVG(age) as avg_age,
STDEV(age) as std_age,
FIRST_VALUE(age) as fist_value_age,
ARRAY_AGG(age) as to_list_age,
QUANTILE(age, 0.99) as quantile_age
FROM user_simple
GROUP BY region
"""
sql_query = SQLQuery(sql_str)
redis_query = sql_query.redis_query_string(redis_url="redis://localhost:6379")
print("Resulting redis query: ", redis_query)
results = index.query(sql_query)
results
Resulting redis query: FT.AGGREGATE user_simple "*" LOAD 2 age region GROUPBY 1 @region REDUCE COUNT 0 AS count_age REDUCE COUNT_DISTINCT 1 @age AS count_distinct_age REDUCE MIN 1 @age AS min_age REDUCE MAX 1 @age AS max_age REDUCE AVG 1 @age AS avg_age REDUCE STDDEV 1 @age AS std_age REDUCE FIRST_VALUE 1 @age AS fist_value_age REDUCE TOLIST 1 @age AS to_list_age REDUCE QUANTILE 2 @age 0.99 AS quantile_age
[{'region': 'us-west',
'count_age': '2',
'count_distinct_age': '2',
'min_age': '34',
'max_age': '61',
'avg_age': '47.5',
'std_age': '19.091883092',
'fist_value_age': '34',
'to_list_age': [b'34', b'61'],
'quantile_age': '61'},
{'region': 'us-central',
'count_age': '2',
'count_distinct_age': '2',
'min_age': '24',
'max_age': '54',
'avg_age': '39',
'std_age': '21.2132034356',
'fist_value_age': '54',
'to_list_age': [b'24', b'54'],
'quantile_age': '54'},
{'region': 'us-east',
'count_age': '1',
'count_distinct_age': '1',
'min_age': '27',
'max_age': '27',
'avg_age': '27',
'std_age': '0',
'fist_value_age': '27',
'to_list_age': [b'27'],
'quantile_age': '27'}]
Vector search
sql_str = """
SELECT user, job, job_description, cosine_distance(job_embedding, :vec) AS vector_distance
FROM user_simple
ORDER BY vector_distance ASC
"""
vec = hf.embed("looking for someone to use base principles to solve problems", as_buffer=True)
sql_query = SQLQuery(sql_str, params={"vec": vec})
redis_query = sql_query.redis_query_string(redis_url="redis://localhost:6379")
print("Resulting redis query: ", redis_query)
results = index.query(sql_query)
results
Resulting redis query: FT.SEARCH user_simple "*=>[KNN 10 @job_embedding $vector AS vector_distance]" PARAMS 2 vector $vector DIALECT 2 RETURN 4 user job job_description vector_distance SORTBY vector_distance ASC
[{'vector_distance': '0.82351064682',
'user': 'bill',
'job': 'engineer',
'job_description': 'Applies scientific and mathematical principles to solve technical problems.'},
{'vector_distance': '0.965160369873',
'user': 'john',
'job': 'software engineer',
'job_description': 'Designs, develops, and maintains software applications and systems.'},
{'vector_distance': '1.00401353836',
'user': 'mary',
'job': 'doctor',
'job_description': 'Diagnoses and treats illnesses, injuries, and other medical conditions in the healthcare field.'},
{'vector_distance': '1.00626885891',
'user': 'stacy',
'job': 'project manager',
'job_description': 'Plans, organizes, and oversees projects from inception to completion.'},
{'vector_distance': '1.01110625267',
'user': 'joe',
'job': 'dentist',
'job_description': 'Provides oral healthcare including diagnosing and treating teeth and gum issues.'}]
sql_str = """
SELECT user, region, cosine_distance(job_embedding, :vec) AS vector_distance
FROM user_simple
WHERE region = 'us-central'
ORDER BY vector_distance ASC
"""
vec = hf.embed("looking for someone to use base principles to solve problems", as_buffer=True)
sql_query = SQLQuery(sql_str, params={"vec": vec})
redis_query = sql_query.redis_query_string(redis_url="redis://localhost:6379")
print("Resulting redis query: ", redis_query)
results = index.query(sql_query)
results
Resulting redis query: FT.SEARCH user_simple "(@region:{us\-central})=>[KNN 10 @job_embedding $vector AS vector_distance]" PARAMS 2 vector $vector DIALECT 2 RETURN 3 user region vector_distance SORTBY vector_distance ASC
[{'vector_distance': '0.82351064682', 'user': 'bill', 'region': 'us-central'},
{'vector_distance': '1.00401353836', 'user': 'mary', 'region': 'us-central'}]
Next Steps
Now that you understand SQL queries for Redis, explore these related guides:
- Use Advanced Query Types - Learn about TextQuery, HybridQuery, and MultiVectorQuery
- Query and Filter Data - Apply filters using native RedisVL query syntax
- Getting Started - Review the basics of RedisVL indexes
Cleanup
To remove all data from Redis associated with the index, use the .clear() method. This leaves the index in place for future insertions or updates.
To remove everything including the index, use .delete() which removes both the index and the underlying data.
index.delete()