(fix): Regenerate the system prompt to force the system not to reveal internal details by alvaro-mazcu · Pull Request #195 · Tanzania-AI-Community/twiga

alvaro-mazcu · 2026-01-28T11:27:59Z

Twiga keeps revealing internal tools names. This new system prompt forces the chatbot to avoid revealing these details. I want you guys to test Twiga with this new prompt. I have been testing it thoroughly and it has worked.

cc @jurmy24 @fredygerman

… internal details

app/assets/strings/english.yml

Ben-Temming · 2026-01-31T19:19:19Z

app/services/messaging_service.py

+            if self._are_the_tools_names_mentioned(llm_content):
+                self.logger.warning(
+                    "Tool name leakage detected in LLM response; sending fallback message."
+                )
+                await whatsapp_client.send_message(
+                    user.wa_id, strings.get_string(StringCategory.ERROR, "tool_leakage")
+                )
+                record_messages_generated("tool_names_mentioned_error")
+                return JSONResponse(content={"status": "ok"}, status_code=200)
+
+            self.logger.debug(
+                f"Sending message to {user.wa_id}: {llm_responses[-1].content}"
+            )


Nice, but something is not working right, I get the warning in the terminal

WARNING: 2026-01-31 19:11:31 - app.services.messaging_service - Tool name leakage detected in LLM response; sending fallback message.

But still see the original LLM message with the tools in the chat instead of the error message.

Ben-Temming · 2026-01-31T19:22:08Z

app/services/messaging_service.py


+        if llm_responses:
            # Update the database with the responses
            await db.create_new_messages(llm_responses)


Does this not mean that the response with tools is saved to the database and fetched for the history? Would it not make sense to have the tool check before saving to the database so that the history will reflect the expected response?

Yes, that's correct. But I believe that this is out of the scope of this PR. We will talk about this on Thursday and we design the exact message history, as I believe this would suffer a refactor

(fix): Regenerate the system prompt to force the system not to reveal…

2edc77b

… internal details

alvaro-mazcu requested review from fredygerman and jurmy24 January 28, 2026 11:27

alvaro-mazcu self-assigned this Jan 28, 2026

alvaro-mazcu added 4 commits January 30, 2026 19:17

Merge branch 'development' into fix/avoid_showing_tool_names_in_messages

a7ff478

(feat): Update agent prompt

bfa7cad

(feat): Drop duplicate answer to user

6df710e

(feat): Send Error message if tools are leaked

14d7bea

alvaro-mazcu requested a review from Ben-Temming January 30, 2026 18:53

Ben-Temming reviewed Jan 31, 2026

View reviewed changes

app/assets/strings/english.yml Outdated Show resolved Hide resolved

Ben-Temming reviewed Jan 31, 2026

View reviewed changes

alvaro-mazcu added 3 commits February 3, 2026 12:52

(feat): Update Error message when tool name is leaked

f6b8bac

(fix): rename old variable

c215994

(fix): reorganise error messages

4c2e60a

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

(fix): Regenerate the system prompt to force the system not to reveal internal details#195

(fix): Regenerate the system prompt to force the system not to reveal internal details#195
alvaro-mazcu wants to merge 8 commits intodevelopmentfrom
fix/avoid_showing_tool_names_in_messages

alvaro-mazcu commented Jan 28, 2026

Uh oh!

Uh oh!

Ben-Temming Jan 31, 2026 •

edited

Loading

Uh oh!

Ben-Temming Jan 31, 2026

Uh oh!

alvaro-mazcu Feb 3, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

alvaro-mazcu commented Jan 28, 2026

Uh oh!

Uh oh!

Ben-Temming Jan 31, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Ben-Temming Jan 31, 2026

Choose a reason for hiding this comment

Uh oh!

alvaro-mazcu Feb 3, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Ben-Temming Jan 31, 2026 •

edited

Loading