Add an option to return only chunks to LLMs #168

guill · 2025-06-02T00:56:47Z

When files are very large, returning the entire file can quickly blow out the context, even with a small number of results returned. This PR adds the option (specified in tool_opts) to only return chunks to the CodeCompanion LLM -- only_chunks. This allows the LLM to decide whether to request the entire file based on the chunk and whether it actually seems relevant to the question.

I've only added this to the CodeCompanion backend for now (just because I don't have CopilotChat configured to test it).

[BUG] StringChunker doesn't handle multi-line text. #146

codecov · 2025-06-02T00:59:28Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 99.28%. Comparing base (c186db0) to head (8aa380b).
Report is 2 commits behind head on main.

Additional details and impacted files

@@           Coverage Diff           @@
##             main     #168   +/-   ##
=======================================
  Coverage   99.28%   99.28%           
=======================================
  Files          22       22           
  Lines        1534     1534           
=======================================
  Hits         1523     1523           
  Misses         11       11

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

Davidyz · 2025-06-02T01:53:23Z

Hi, thanks for this PR! I haven't done this because of #146 . I'll get back to this once I fix that.

Davidyz · 2025-06-03T07:28:16Z

While we're at it, do you think it makes sense to give the option to the LLM? We could make this a boolean parameter that the LLM can call, so that it can decide whether it wants chunks or full documents. The downside of it is that the LLM doesn't know the project before the func call, so its decisions might be bad and unreliable.

guill · 2025-06-04T06:52:22Z

Yeah, I don't think it makes sense to expose that directly to an LLM -- whether it makes sense to return the chunk or the document is really going to be dependent on the size of the full document responses. It's not something I would expect the LLM to have enough context to make an intelligent decision on.
I suppose we could have a max_response_size argument provided by the LLM and automatically switch from document response to chunks if the total document response is over that size. I'm not sure how self-aware most LLMs are about their own context sizes though. Maybe the max_response_size should be what's exposed through the plugin opts though? 🤔

Davidyz · 2025-06-04T07:20:39Z

I suppose we could have a max_response_size argument provided by the LLM and automatically switch from document response to chunks if the total document response is over that size.

We might be able to utilise codecompanion's token count feature for this. The problem is how we can determine the max context window so that the token count actually makes sense (for example, a 100k-token conversation is close to being saturated for a 128k LLM, but is still very usable for a 1M LLM).

I also have this thought about using an auxiliary LLM as a response-rewriter that paraphrases the document content into concise, descriptive paragraphs as the response. Paraphrasing, compared to coding, is a simpler task and can hopefully be handled well enough by a small, cheap (or free) LLM. If it works, it could save the token count for the main LLM, allowing users to have longer conversations with the main coder LLM, and hopefully reduce the cost as well.

Davidyz

Apart from the comments in the changes, I think we should also make max_num and default_num tables (something like {document=10, chunk=100}), so that users can set different default_num and max_num for document and chunk mode. The chunk length and document length can differ by a lot, so it makes sense to set different numbers for them.

Also, to make sure it's backward-compatible, we should add a check that converts the old format of the config to the new format, and throw a warning via vim.deprecate (if you're not familiar with the syntax, this is how I do it). I'll remove the backward-compatibility stuff before making 0.7.0 release.

Davidyz · 2025-06-05T04:22:53Z

lua/vectorcode/integrations/codecompanion/func_calling_tool.lua

+          local args = { "query" }
          vim.list_extend(args, action.options.query)
+          vim.list_extend(args, { "--pipe", "-n", tostring(action.options.count) })
+          vim.list_extend(args, { "--include", "path", "chunk", "document" })


The chunk and document options in the --include flag are exclusive. You can have only one of them. path can stay there.

Davidyz · 2025-06-05T04:24:29Z

lua/vectorcode/integrations/codecompanion/func_calling_tool.lua

    auto_submit = { ls = false, query = false },
    ls_on_start = false,
    no_duplicate = true,
+    only_chunks = false,


Since the chunk and document options can never co-exist, maybe rename this to chunk_mode?

Davidyz · 2025-06-10T12:47:10Z

I'll checkout from this branch to refactor some of the code to prepare the codebase for #179 . All your commits will stay. I'll also make the changes that I mentioned here.

Davidyz added enhancement New feature or request feature labels Jun 2, 2025

Davidyz force-pushed the nvim_return_chunks_pr branch from 53a5390 to 293dac5 Compare June 4, 2025 09:32

Davidyz mentioned this pull request Jun 4, 2025

Handle line ranges for StringChunker #174

Merged

Davidyz force-pushed the nvim_return_chunks_pr branch from 293dac5 to 8abdfc2 Compare June 5, 2025 04:18

Davidyz requested changes Jun 5, 2025

View reviewed changes

Add an option to return only chunks to LLMs

8aa380b

Davidyz force-pushed the nvim_return_chunks_pr branch from 8abdfc2 to 8aa380b Compare June 5, 2025 11:35

Davidyz mentioned this pull request Jun 9, 2025

Optional retrieval result summarisation in CodeCompanion.nvim query tool #179

Merged

5 tasks

Davidyz mentioned this pull request Jun 10, 2025

Allow the codecompanion tool to send chunks instead of full documents. #180

Merged

Davidyz closed this Jun 11, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add an option to return only chunks to LLMs #168

Add an option to return only chunks to LLMs #168

Uh oh!

guill commented Jun 2, 2025 •

edited by Davidyz

Loading

Uh oh!

codecov bot commented Jun 2, 2025 •

edited

Loading

Uh oh!

Davidyz commented Jun 2, 2025

Uh oh!

Davidyz commented Jun 3, 2025

Uh oh!

guill commented Jun 4, 2025

Uh oh!

Davidyz commented Jun 4, 2025

Uh oh!

Davidyz left a comment

Uh oh!

Davidyz Jun 5, 2025

Uh oh!

Davidyz Jun 5, 2025

Uh oh!

Davidyz commented Jun 10, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Add an option to return only chunks to LLMs #168

Add an option to return only chunks to LLMs #168

Uh oh!

Conversation

guill commented Jun 2, 2025 • edited by Davidyz Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov bot commented Jun 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Davidyz commented Jun 2, 2025

Uh oh!

Davidyz commented Jun 3, 2025

Uh oh!

guill commented Jun 4, 2025

Uh oh!

Davidyz commented Jun 4, 2025

Uh oh!

Davidyz left a comment

Choose a reason for hiding this comment

Uh oh!

Davidyz Jun 5, 2025

Choose a reason for hiding this comment

Uh oh!

Davidyz Jun 5, 2025

Choose a reason for hiding this comment

Uh oh!

Davidyz commented Jun 10, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

guill commented Jun 2, 2025 •

edited by Davidyz

Loading

codecov bot commented Jun 2, 2025 •

edited

Loading