A*-Decoding

This repository provides a reference implementation of A*-Decoding for SLMs. A*-Decoding is an advanced search algorithm designed to efficiently find high-quality completions from language models, especially for tasks requiring complex reasoning and step-by-step solutions.

In addition to A*-Decoding, the repository includes competitive baseline methods such as Best-of-N search, Self-Consistency decoding, and Particle Filtering, enabling direct comparison and benchmarking across different decoding strategies.

Key Features

A*-Decoding: Core implementation for efficient, guided search in LLMs.
Best-of-N Search: Selects the highest-scoring answer from N independently sampled completions, where scoring is done using a PRM.
Self-Consistency Decoding: Picks the most frequent answer from multiple Chain-Of-Thought samples.
Particle Filtering: Generates multiple trajectories by iteratively sampling next steps, with probabilities weighted by the PRM scores of the current partial solutions.
Evaluation Suite: Tools for Chain-Of-Thought benchmarking, grading, and verifying model outputs on math and reasoning datasets.
Configurable Experiments: Easily switch between models, datasets, and decoding strategies using JSON configuration files.

Repository Structure

a_star_decoding/ — Main A*-Decoding implementation.
best_of_n/ — Best-of-N search baseline.
self_consistency/ — Self-consistency decoding baseline.
pf/ — Particle Filtering guide.
configs/ — JSON configuration files for experiments.
data/ — Datasets and experiment results.
evaluate/ — Scripts and utilities for evaluation and grading.
models/ — Model configuration and utility code.
run_scripts/ — Shell scripts to run experiments.
constants.py, pyproject.toml, Dockerfile — Project setup and dependencies.

Getting Started

Clone the repository:
```
git clone <repo-url>
cd a_star_decoding
```
Install dependencies:
- Using Poetry:
```
poetry install
```
- Or with pip:
```
pip install -r requirements.txt
```

(Optional) Docker:

docker build -t a_star_decoding .
docker run -it a_star_decoding

Usage Examples

All config files can be found in configs/.

A* Decoding:

./run_scripts/run_a_star.sh HF_TOKEN MATH500 llama1b_qwen7b

Best-of-N Search:

./run_scripts/run_bon.sh HF_TOKEN MATH500 bon_llama1b

Self-Consistency:

./run_scripts/run_sco.sh HF_TOKEN CO_API_KEY MATH500 sco_llama1b_commandrp

Evaluation:

./run_scripts/run_bench.sh HF_TOKEN MATH500 meta-llama/Llama-3.2-1B-instruct

Datasets & Results

Datasets are in data/datasets/ (e.g., AIME24, MATH500).
Experiment results are saved in data/runs/.

Configuration

Experiment settings are controlled via JSON files in configs/.
Modify these files to change models, datasets, or decoding parameters.

Citation

If you use this codebase in your research, please cite the relevant papers or this repository. For example:

@misc{chatziveroglou2025adecodingtokenefficientinferencescaling,
      title={A*-Decoding: Token-Efficient Inference Scaling}, 
      author={Giannis Chatziveroglou},
      year={2025},
      eprint={2505.13672},
      archivePrefix={arXiv},
      primaryClass={cs.AI},
      url={https://arxiv.org/abs/2505.13672}, 
}

For questions or contributions, please open an issue or pull request.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

A*-Decoding

Key Features

Repository Structure

Getting Started

Usage Examples

Datasets & Results

Configuration

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
a_star_decoding		a_star_decoding
assets		assets
best_of_n		best_of_n
configs		configs
data		data
evaluate		evaluate
models		models
pf		pf
run_scripts		run_scripts
self_consistency		self_consistency
.dockerignore		.dockerignore
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
constants.py		constants.py
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

gchatz22/a_star_decoding

Folders and files

Latest commit

History

Repository files navigation

A*-Decoding

Key Features

Repository Structure

Getting Started

Usage Examples

Datasets & Results

Configuration

Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages