Rerankers

CohereReranker

class CohereReranker(model='rerank-english-v3.0', rank_by=None, limit=5, return_score=True, api_config=None)

Bases: BaseReranker

The CohereReranker class uses Cohere’s API to rerank documents based on an input query.

This reranker is designed to interact with Cohere’s /rerank API, requiring an API key for authentication. The key can be provided directly in the api_config dictionary or through the COHERE_API_KEY environment variable. User must obtain an API key from Cohere’s website (https://dashboard.cohere.com/). Additionally, the cohere python client must be installed with pip install cohere.

from redisvl.utils.rerank import CohereReranker

# set up the Cohere reranker with some configuration
reranker = CohereReranker(rank_by=["content"], limit=2)
# rerank raw search results based on user input/query
results = reranker.rank(
    query="your input query text here",
    docs=[
        {"content": "document 1"},
        {"content": "document 2"},
        {"content": "document 3"}
    ]
)

Initialize the CohereReranker with specified model, ranking criteria, and API configuration.

  • Parameters:
    • model (str) – The identifier for the Cohere model used for reranking. Defaults to ‘rerank-english-v3.0’.
    • rank_by (Optional [ List [ str ] ]) – Optional list of keys specifying the attributes in the documents that should be considered for ranking. None means ranking will rely on the model’s default behavior.
    • limit (int) – The maximum number of results to return after reranking. Must be a positive integer.
    • return_score (bool) – Whether to return scores alongside the reranked results.
    • api_config (Optional [ Dict ] , optional) – Dictionary containing the API key. Defaults to None.
  • Raises:
    • ImportError – If the cohere library is not installed.
    • ValueError – If the API key is not provided.

async arank(query, docs, **kwargs)

Rerank documents based on the provided query using the Cohere rerank API.

This method processes the user’s query and the provided documents to rerank them in a manner that is potentially more relevant to the query’s context.

  • Parameters:
    • query (str) – The user’s search query.
    • docs (Union [ List [ Dict [ str , Any ] ] , List [ str ] ]) – The list of documents to be ranked, either as dictionaries or strings.
  • Returns: The reranked list of documents and optionally associated scores.
  • Return type: Union[Tuple[Union[List[Dict[str, Any]], List[str]], float], List[Dict[str, Any]]]

model_post_init(context, /)

This function is meant to behave like a BaseModel method to initialise private attributes.

It takes context as an argument since that’s what pydantic-core passes when calling it.

  • Parameters:
    • self (BaseModel) – The BaseModel instance.
    • context (Any) – The context.
  • Return type: None

rank(query, docs, **kwargs)

Rerank documents based on the provided query using the Cohere rerank API.

This method processes the user’s query and the provided documents to rerank them in a manner that is potentially more relevant to the query’s context.

  • Parameters:
    • query (str) – The user’s search query.
    • docs (Union [ List [ Dict [ str , Any ] ] , List [ str ] ]) – The list of documents to be ranked, either as dictionaries or strings.
  • Returns: The reranked list of documents and optionally associated scores.
  • Return type: Union[Tuple[Union[List[Dict[str, Any]], List[str]], float], List[Dict[str, Any]]]

model_config: ClassVar[ConfigDict] = {}

Configuration for the model, should be a dictionary conforming to [ConfigDict][pydantic.config.ConfigDict].

HFCrossEncoderReranker

class HFCrossEncoderReranker(model='cross-encoder/ms-marco-MiniLM-L-6-v2', limit=3, return_score=True, *, rank_by=None)

Bases: BaseReranker

The HFCrossEncoderReranker class uses a cross-encoder models from Hugging Face to rerank documents based on an input query.

This reranker loads a cross-encoder model using the CrossEncoder class from the sentence_transformers library. It requires the sentence_transformers library to be installed.

from redisvl.utils.rerank import HFCrossEncoderReranker

# set up the HFCrossEncoderReranker with a specific model
reranker = HFCrossEncoderReranker(model_name="cross-encoder/ms-marco-MiniLM-L-6-v2", limit=3)
# rerank raw search results based on user input/query
results = reranker.rank(
    query="your input query text here",
    docs=[
        {"content": "document 1"},
        {"content": "document 2"},
        {"content": "document 3"}
    ]
)

Initialize the HFCrossEncoderReranker with a specified model and ranking criteria.

  • Parameters:
    • model (str) – The name or path of the cross-encoder model to use for reranking. Defaults to ‘cross-encoder/ms-marco-MiniLM-L-6-v2’.
    • limit (int) – The maximum number of results to return after reranking. Must be a positive integer.
    • return_score (bool) – Whether to return scores alongside the reranked results.
    • rank_by (List [ str ] | None)

async arank(query, docs, **kwargs)

Asynchronously rerank documents based on the provided query using the loaded cross-encoder model.

This method processes the user’s query and the provided documents to rerank them in a manner that is potentially more relevant to the query’s context.

  • Parameters:
    • query (str) – The user’s search query.
    • docs (Union [ List [ Dict [ str , Any ] ] , List [ str ] ]) – The list of documents to be ranked, either as dictionaries or strings.
  • Returns: The reranked list of documents and optionally associated scores.
  • Return type: Union[Tuple[List[Dict[str, Any]], List[float]], List[Dict[str, Any]]]

model_post_init(context, /)

This function is meant to behave like a BaseModel method to initialise private attributes.

It takes context as an argument since that’s what pydantic-core passes when calling it.

  • Parameters:
    • self (BaseModel) – The BaseModel instance.
    • context (Any) – The context.
  • Return type: None

rank(query, docs, **kwargs)

Rerank documents based on the provided query using the loaded cross-encoder model.

This method processes the user’s query and the provided documents to rerank them in a manner that is potentially more relevant to the query’s context.

  • Parameters:
    • query (str) – The user’s search query.
    • docs (Union [ List [ Dict [ str , Any ] ] , List [ str ] ]) – The list of documents to be ranked, either as dictionaries or strings.
  • Returns: The reranked list of documents and optionally associated scores.
  • Return type: Union[Tuple[List[Dict[str, Any]], List[float]], List[Dict[str, Any]]]

model_config: ClassVar[ConfigDict] = {}

Configuration for the model, should be a dictionary conforming to [ConfigDict][pydantic.config.ConfigDict].

VoyageAIReranker

class VoyageAIReranker(model, rank_by=None, limit=5, return_score=True, api_config=None)

Bases: BaseReranker

The VoyageAIReranker class uses VoyageAI’s API to rerank documents based on an input query.

This reranker is designed to interact with VoyageAI’s /rerank API, requiring an API key for authentication. The key can be provided directly in the api_config dictionary or through the VOYAGE_API_KEY environment variable. User must obtain an API key from VoyageAI’s website (https://dash.voyageai.com/). Additionally, the voyageai python client must be installed with pip install voyageai.

from redisvl.utils.rerank import VoyageAIReranker

# set up the VoyageAI reranker with some configuration
reranker = VoyageAIReranker(rank_by=["content"], limit=2)
# rerank raw search results based on user input/query
results = reranker.rank(
    query="your input query text here",
    docs=[
        {"content": "document 1"},
        {"content": "document 2"},
        {"content": "document 3"}
    ]
)

Initialize the VoyageAIReranker with specified model, ranking criteria, and API configuration.

  • Parameters:
    • model (str) – The identifier for the VoyageAI model used for reranking.
    • rank_by (Optional [ List [ str ] ]) – Optional list of keys specifying the attributes in the documents that should be considered for ranking. None means ranking will rely on the model’s default behavior.
    • limit (int) – The maximum number of results to return after reranking. Must be a positive integer.
    • return_score (bool) – Whether to return scores alongside the reranked results.
    • api_config (Optional [ Dict ] , optional) – Dictionary containing the API key. Defaults to None.
  • Raises:
    • ImportError – If the voyageai library is not installed.
    • ValueError – If the API key is not provided.

async arank(query, docs, **kwargs)

Rerank documents based on the provided query using the VoyageAI rerank API.

This method processes the user’s query and the provided documents to rerank them in a manner that is potentially more relevant to the query’s context.

  • Parameters:
    • query (str) – The user’s search query.
    • docs (Union [ List [ Dict [ str , Any ] ] , List [ str ] ]) – The list of documents to be ranked, either as dictionaries or strings.
  • Returns: The reranked list of documents and optionally associated scores.
  • Return type: Union[Tuple[Union[List[Dict[str, Any]], List[str]], float], List[Dict[str, Any]]]

model_post_init(context, /)

This function is meant to behave like a BaseModel method to initialise private attributes.

It takes context as an argument since that’s what pydantic-core passes when calling it.

  • Parameters:
    • self (BaseModel) – The BaseModel instance.
    • context (Any) – The context.
  • Return type: None

rank(query, docs, **kwargs)

Rerank documents based on the provided query using the VoyageAI rerank API.

This method processes the user’s query and the provided documents to rerank them in a manner that is potentially more relevant to the query’s context.

  • Parameters:
    • query (str) – The user’s search query.
    • docs (Union [ List [ Dict [ str , Any ] ] , List [ str ] ]) – The list of documents to be ranked, either as dictionaries or strings.
  • Returns: The reranked list of documents and optionally associated scores.
  • Return type: Union[Tuple[Union[List[Dict[str, Any]], List[str]], float], List[Dict[str, Any]]]

model_config: ClassVar[ConfigDict] = {}

Configuration for the model, should be a dictionary conforming to [ConfigDict][pydantic.config.ConfigDict].

RATE THIS PAGE
Back to top ↑