investigating separating out documents from the rest of the message h… #95

jason-raitz · 2025-12-17T22:31:14Z

…istory and instructions.

anarchivist

interesting work - i'm excited to see this proceeding.

anarchivist · 2025-12-18T23:02:03Z

willa/chatbot/graph_manager.py

+                "start_index": str(doc.metadata.get('start_index')) if doc.metadata.get('start_index') else '',
+                "total_pages": str(doc.metadata.get('total_pages')) if doc.metadata.get('total_pages') else '',


is there a reason why we're returning empty strings here and not None?

I think the model is expecting a string

And we don't actually need these two values. I just thought it might be helpful to the model. we could drop both if we want.

anarchivist · 2025-12-18T23:03:03Z

willa/chatbot/graph_manager.py

        prompt = get_langfuse_prompt()
-        system_messages = prompt.invoke({'context': docs_context,
-                                        'question': latest_message.content})
+        system_messages = prompt.invoke({})


what's this doing? where is the user's question being inserted?

The user's question is passed as a user message like it always is. There actually might be unrelated cleanup work for us to do with that. Right now, we're passing the user query as a user message twice (due to the summary bit) and then again as part of the system message (instruction prompt).

- this gets cohere specific response field that includes citations for the response text

jason-raitz · 2025-12-19T16:52:51Z

willa/chatbot/graph_manager.py

+            additional_model_request_fields={"documents": documents},
+            additional_model_response_field_paths=["/citations"]
+            )
+        citations = response.response_metadata.get('additionalModelResponseFields').get('citations') if response.response_metadata else None


What's neat about this, is that the citations returned by cohere can be used to cite specific parts of the response message to documents that we passed above in line 145.

An example list of citations look like this:

[ { "start": 184, "end": 229, "text": "admission criteria and standards of practice.", "document_ids": ["doc_0"] }, { "start": 260, "end": 275, "text": "different roles", "document_ids": ["doc_1", "doc_2"] } ]

- temporarily add raw citations to response.

investigating separating out documents from the rest of the message h…

ff6ab73

…istory and instructions.

anarchivist reviewed Dec 18, 2025

View reviewed changes

preserving cohere response citations

e999fa8

- this gets cohere specific response field that includes citations for the response text

jason-raitz force-pushed the AP-532_cohere_structured_query branch from 9cce4d4 to e999fa8 Compare December 19, 2025 16:28

jason-raitz commented Dec 19, 2025

View reviewed changes

jason-raitz added 2 commits December 22, 2025 14:51

add prepare generation node

cd976ff

- temporarily add raw citations to response.

improving citation output prep

326d731

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

investigating separating out documents from the rest of the message h… #95

investigating separating out documents from the rest of the message h… #95

Uh oh!

jason-raitz commented Dec 17, 2025

Uh oh!

anarchivist left a comment

Uh oh!

anarchivist Dec 18, 2025

Uh oh!

davezuckerman Dec 19, 2025

Uh oh!

jason-raitz Dec 19, 2025

Uh oh!

anarchivist Dec 18, 2025

Uh oh!

jason-raitz Dec 19, 2025

Uh oh!

jason-raitz Dec 19, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

		"start_index": str(doc.metadata.get('start_index')) if doc.metadata.get('start_index') else '',
		"total_pages": str(doc.metadata.get('total_pages')) if doc.metadata.get('total_pages') else '',

investigating separating out documents from the rest of the message h… #95

Are you sure you want to change the base?

investigating separating out documents from the rest of the message h… #95

Uh oh!

Conversation

jason-raitz commented Dec 17, 2025

Uh oh!

anarchivist left a comment

Choose a reason for hiding this comment

Uh oh!

anarchivist Dec 18, 2025

Choose a reason for hiding this comment

Uh oh!

davezuckerman Dec 19, 2025

Choose a reason for hiding this comment

Uh oh!

jason-raitz Dec 19, 2025

Choose a reason for hiding this comment

Uh oh!

anarchivist Dec 18, 2025

Choose a reason for hiding this comment

Uh oh!

jason-raitz Dec 19, 2025

Choose a reason for hiding this comment

Uh oh!

jason-raitz Dec 19, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants