Feat/benches pipeline #21

jfdreis · 2025-05-20T22:05:01Z

No description provided.

jfdreis

⚠️ Performance Alert ⚠️

Possible performance regression was detected for benchmark 'Python Benchmark with pytest-benchmark'.
Benchmark result of this commit is worse than the previous benchmark result exceeding threshold 1.20.

Benchmark suite	Current: `233b835`	Previous: `7b76880`	Ratio
`benchmarks/test_rag.py::test_rag_pedantic`	`0.12853314014631542` iter/sec (`stddev: 0.03930125145461016`)	`0.16664327083853855` iter/sec (`stddev: 0.06262404233228976`)	`1.30`

This comment was automatically generated by workflow using github-action-benchmark.

jfdreis · 2025-05-27T09:07:16Z

Summary:

Add RAG performance benchmark to CI pipeline using pytest-benchmark and github-action-benchmark.

Description:
This introduces a new performance benchmark (benchmarks/test_rag.py) and integrates it into workflow (.github/workflows/ci.yml).
The benchmark measures the execution time of the rag.top_num_chunks_execute function.

Key aspects:

New Benchmark File: Adds benchmarks/test_rag.py containing the benchmark implementation.
CI Integration: Modifies the CI workflow to run this benchmark on every pull request and push to main.
Performance Tracking: Utilizes the github-action-benchmark to track performance changes over time and display results on the gh-pages branch. The benchmark performance graph is here.
Regression Alerting: Configures the benchmark action to fail the CI pipeline and post a comment on the PR if the top_num_chunks_execute function's execution time increases by more than 2% compared to the previous commit.
Non-Clustered Data: This specific benchmark is configured to test the non-clustered case of RAG execution. Later on we will update this to included a clustered case.
Dataset Size: The benchmark runs against a test dataset which contains 1000 paragraphs.

benchmarks/nilrag_quality.py

.github/workflows/ci.yml

manel1874 · 2025-05-28T09:42:27Z

.env.ci

I would put this .env.ci file inside .github/workflows. I believe you only need to adjust the ci file from below.

manel1874 · 2025-05-28T09:43:30Z

.github/workflows/ci.yml

+    
+    - name: Set up environment
+      run: |
+        cp .env.ci .env


In case you move .env.ci into .github/workflows:

Suggested change

cp .env.ci .env

cp .github/workflows/.env.ci .env

src/nilrag/nildb/operations.py

jfdreis force-pushed the feat/benches-pipeline branch from 3195a42 to 8dd122f Compare May 20, 2025 22:32

jfdreis commented May 26, 2025

View reviewed changes

jfdreis force-pushed the feat/benches-pipeline branch 2 times, most recently from 3bc980b to 6a19d9c Compare May 26, 2025 16:22

jfdreis marked this pull request as ready for review May 26, 2025 17:39

jfdreis requested review from a user and manel1874 May 26, 2025 17:41

ghost approved these changes May 27, 2025

View reviewed changes

benchmarks/nilrag_quality.py Show resolved Hide resolved

.github/workflows/ci.yml Outdated Show resolved Hide resolved

manel1874 approved these changes May 28, 2025

View reviewed changes

jfdreis force-pushed the feat/benches-pipeline branch 4 times, most recently from 6cf64a1 to f192e6d Compare May 29, 2025 23:14

feat: create benchmarks pipeline

92e1c05

jfdreis force-pushed the feat/benches-pipeline branch from f95c210 to 92e1c05 Compare May 30, 2025 09:26

jfdreis merged commit 96d8f3e into main May 30, 2025
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feat/benches pipeline #21

Feat/benches pipeline #21

Uh oh!

jfdreis commented May 20, 2025

Uh oh!

jfdreis left a comment •

edited

Loading

Uh oh!

jfdreis commented May 27, 2025

Uh oh!

Uh oh!

Uh oh!

manel1874 May 28, 2025

Uh oh!

manel1874 May 28, 2025

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Feat/benches pipeline #21

Feat/benches pipeline #21

Uh oh!

Conversation

jfdreis commented May 20, 2025

Uh oh!

jfdreis left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

⚠️ Performance Alert ⚠️

Uh oh!

jfdreis commented May 27, 2025

Uh oh!

Uh oh!

Uh oh!

manel1874 May 28, 2025

Choose a reason for hiding this comment

Uh oh!

manel1874 May 28, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

jfdreis left a comment •

edited

Loading