Listwise LLM Reranker

A Python project that trains a Large Language Model (LLM) to perform listwise reranking for information retrieval.

Usually, reranking tasks are performed by cross-encoder, so why use an LLM?

Compared to traditional cross-encoder rerankers, LLMs offer several advantages:

Listwise reasoning: LLMs can consider the entire candidate list jointly, enabling true listwise ranking instead of scoring documents independently.
Explainability: LLMs can justify their reranking.
Task adaptability: The same LLM can be adapted to different domains or ranking criteria (e.g. factuality, diversity, recency).

Overview

The pipeline:

Build a reranking dataset using BM25 as the first-stage retriever
Fine-tune an LLM to rerank candidate documents listwise
Evaluate reranking performance with standard IR metrics

Setup

Requirements

Python 3.12+
The project can be installed and run using uv.
Other dependancies are specified in pyproject.toml

Installation

Dependancies can be installed with:

uv sync

Usage

Configuration

@TODO

Build dataset (with BM25)

cd src
uv run dataset.py

Fine-tune the LLM

cd src
uv run train.py

Inference

cd src
uv run infer.py [--model-type {base,fine-tuned} ...] [--dataset {train,validation}]

Example usage for inference of using the fine-tuned model on the validation dataset:

uv run infer.py --model-type fine-tuned --dataset validation

Evaluation

cd src
uv run eval.py

Notes & Next steps

The queries from the original MSMarco dataset that are used in this project are the one which have only one corresponding document. This could change in the future.
Implement logging system.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
config		config
src		src
.gitignore		.gitignore
.python-version		.python-version
README.md		README.md
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Listwise LLM Reranker

Overview

Setup

Requirements

Installation

Usage

Configuration

Build dataset (with BM25)

Fine-tune the LLM

Inference

Evaluation

Notes & Next steps

About

Uh oh!

Releases

Packages

Languages

dboyker/llm-reranker

Folders and files

Latest commit

History

Repository files navigation

Listwise LLM Reranker

Overview

Setup

Requirements

Installation

Usage

Configuration

Build dataset (with BM25)

Fine-tune the LLM

Inference

Evaluation

Notes & Next steps

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages