Agentar-Scale-SQL is a novel framework that leverages scalable computation to significantly improve Text-to-SQL performance on challenging benchmarks. By implementing an Orchestrated Test-Time Scaling strategy, our framework synergistically combines three distinct perspectives to bridge the gap between state-of-the-art models and human expert performance.
Figure 1: The proposed Agentar-Scale-SQL framework.| Methods | EX (Dev) | EX (Test) | R-VES (%) |
|---|---|---|---|
| Agentar-Scale-SQL (Ours) | 74.90 | 81.67 | 77.00 |
| AskData + GPT-4o | 76.14 | 80.88 | 76.24 |
| LongData-SQL | 74.32 | 77.53 | 71.89 |
| CHASE-SQL + Gemini | 74.90 | 76.02 | 69.94 |
| JoyDataAgent-SQL | 74.25 | 75.85 | 70.16 |
| TCDataAgent-SQL | 74.12 | 75.74 | - |
| Contextual-SQL | 73.50 | 75.63 | 70.02 |
| XiYan-SQL | 73.34 | 75.63 | 71.41 |
- π
2025.11.27: We are excited to release Agentar-Scale-SQL-Generation-32B on Hugging Face and ModelScope! Simultaneously, we have open-sourced the code for the Light Schema Engine and the Offline Data Preprocessing Pipeline! - π
2025.09.30: Our paper is available on arXiv. - π
2025.09.25: We are proud to announce that we have achieved #1 Rank on the official BIRD leaderboard with 81.67% execution accuracy!
We are committed to continuously improving Agentar-Scale-SQL. Here is our plan for upcoming features and releases.
- Paper
- Publish the paper on arXiv.
- Model Releases
- Release Agentar-Scale-SQL-Generation-32B on Hugging Face and ModelScope.
- Release Agentar-Scale-SQL-Selection-32B on Hugging Face and ModelScope.
- Code Releases
- Release the code for the light schema engine.
- Release the code for the offline data preprocessing pipeline.
- Release the code for task understanding and generating SQL candidates with closed-source models.
- Release the code for generating SQL candidates with the fine-tuned model.
- Release the code for the SQL selection module.
Agentar-Scale-SQL/
βββ ScaleSQL/ # Core source code directory
β βββ workflows/ # Main workflow scripts
β βββ config/ # Configuration files
βββ ddl_schema.sh
βββ requirements.txt # Dependency list
βββ .env # Environment variable
βββ .env.example # Environment variable template
βββ .gitignore
βββ README.md # Current document
βββ nltk_data.zip # For ddl schema generationconda create -n ScaleSQL python=3.10
conda activate ScaleSQL# Install PyTorch (CUDA 12.1)
pip install torch==2.5.1 torchvision==0.20.1 torchaudio==2.5.1 --index-url https://download.pytorch.org/whl/cu121pip install -r requirements.txtpip install https://github.com/vllm-project/vllm/releases/download/v0.8.5.post1/vllm-0.8.5.post1+cu121-cp38-abi3-manylinux1_x86_64.whlmodelscope download --model sentence-transformers/all-MiniLM-L6-v2 --local_dir ./ScaleSQL/model/all-MiniLM-L6-v2Modify the configuration file: .ScaleSQL/workflows/config/pipeline_config.yaml.
Note that, we need column meaning file in the evaluation. You can find the file in TA-SQL.
dataset_folder: /temp/bird_test # Change to the actual folder
column_meaning_path: /your_path/column_meaning.json # Change to the actual pathpython -m ScaleSQL.workflows.schema_generation --evaluation_type testOutput example:
.ScaleSQL/dataset/bird_test_light_schema.json
ANONYMIZED_TELEMETRY=False python -m ScaleSQL.workflows.train_skeleton_processOutput path:
/tmp/ScaleSQL/chroma/bird_train_skeleton
ANONYMIZED_TELEMETRY=False python -m ScaleSQL.workflows.database_cell_process --evaluation_type testOutput path:
/tmp/ScaleSQL/chroma/bird_test
bash ddl_schema.shOutput example:
.ScaleSQL/dataset/bird_test_ddl_schema.json
Unlock the power of your business data with natural language. We are excited to introduce Data Agent, our cutting-edge ChatBI product designed to transform complex data into clear, conversational insights.
Simply ask questions in plain English, and let Data Agent handle the complex queries for you. No code, no steep learning curveβjust instant answers.
-
Official Website: For a detailed overview, features, and use cases, please visit our product page: https://antdigital.com/products/DataAgent
-
Early Access & Feedback: If you are interested in trying our product and providing valuable feedback, please feel free to contact us.
@misc{wang2025agentarscalesqladvancingtexttosqlorchestrated,
title={Agentar-Scale-SQL: Advancing Text-to-SQL through Orchestrated Test-Time Scaling},
author={Pengfei Wang and Baolin Sun and Xuemei Dong and Yaxun Dai and Hongwei Yuan and Mengdie Chu and Yingqi Gao and Xiang Qi and Peng Zhang and Ying Yan},
year={2025},
eprint={2509.24403},
archivePrefix={arXiv},
primaryClass={cs.CL},
url={https://arxiv.org/abs/2509.24403},
}
