PS-10347 [8.0]: Ensure strict JSON compliance and robust field separation in audit_log_read()/AuditJsonHandler #5776

inikep · 2025-12-05T08:29:18Z

Refactors the JSON serialization logic within AuditJsonHandler to guarantee strict compliance with the JSON standard by eliminating trailing commas and centralizing field separation control.

The previous approach of appending ", " after every value handler was inconsistent and required error-prone comma removal logic in EndObject.

This change adopts a safer, state-driven approach:

Centralized Comma Management: The responsibility for adding the comma separator is moved entirely from the value handlers (Int, String, etc.) to the Key() handler.
State-Driven Separation: The new m_is_first_field state flag, set in StartObject() and checked/updated in Key(), ensures a comma is prepended only when necessary (i.e., not for the first field), thereby naturally preventing trailing commas within objects.
Inter-Event Separation: Confirmed that the ,\n separator is correctly appended to separate top-level audit event objects in the array.

The audit_log_read() UDF was failing to respect the max_array_length parameter, returning all records instead of the specified limit. This was caused by the read loop not checking the is_batch_end flag.

Additionally, attempting to read the remaining records in a subsequent call caused an infinite loop or parsing errors. This occurred because a new rapidjson::Reader was created for each call, losing the internal state required to resume parsing mid-stream (e.g., handling the comma separator between array elements).

The fix involves:

Respecting the is_batch_end flag in the AuditLogReader::read loop to stop processing when the limit is reached.
Storing the rapidjson::Reader instance within AuditLogReaderContext to preserve parsing state across multiple audit_log_read() calls.
Adding error checking for reader->HasParseError() to prevent infinite loops on malformed data or state mismatches.
Updating the udf_audit_log_read_validate_output test case to verify correct behavior for max_array_length.

dlenev

Hello Przemek!

I have a few questions/suggestions about this patch. Please see below.
Otherwise code changes look fine to me.

Do you plan to squash the second commit in the first one before push?
If not then I think it needs to reference some jira ticket and have its own [8.0] tag in the title.

plugin/audit_log_filter/json_reader/audit_json_handler.cc

plugin/audit_log_filter/tests/mtr/t/udf_audit_log_read_validate_output.test

plugin/audit_log_filter/audit_log_reader.cc

…tion in audit_log_read()/AuditJsonHandler Refactors the JSON serialization logic within AuditJsonHandler to guarantee strict compliance with the JSON standard by eliminating trailing commas and centralizing field separation control. The previous approach of appending ", " after every value handler was inconsistent and required error-prone comma removal logic in EndObject. This change adopts a safer, state-driven approach: - Centralized Comma Management: The responsibility for adding the comma separator is moved entirely from the value handlers (Int, String, etc.) to the Key() handler. - State-Driven Separation: The new m_is_first_field state flag, set in StartObject() and checked/updated in Key(), ensures a comma is prepended only when necessary (i.e., not for the first field), thereby naturally preventing trailing commas within objects. - Inter-Event Separation: Confirmed that the ,\n separator is correctly appended to separate top-level audit event objects in the array.

inikep · 2025-12-18T19:27:38Z

Do you plan to squash the second commit in the first one before push? If not then I think it needs to reference some jira ticket and have its own [8.0] tag in the title.

Manish reported a similar issue so I decided to create a separate JIRA ticket: https://perconadev.atlassian.net/browse/PS-10387

…d pagination issues The `audit_log_read()` UDF was failing to respect the `max_array_length` parameter, returning all records instead of the specified limit. This was caused by the read loop not checking the `is_batch_end` flag. Additionally, attempting to read the remaining records in a subsequent call caused an infinite loop or parsing errors. This occurred because a new `rapidjson::Reader` was created for each call, losing the internal state required to resume parsing mid-stream (e.g., handling the comma separator between array elements). The fix involves: - Respecting the `is_batch_end` flag in the `AuditLogReader::read` loop to stop processing when the limit is reached. - Storing the `rapidjson::Reader` instance within `AuditLogReaderContext` to preserve parsing state across multiple `audit_log_read()` calls. - Adding error checking for `reader->HasParseError()` to prevent infinite loops on malformed data or state mismatches. - Updating the `udf_audit_log_read_validate_output` test case to verify correct behavior for `max_array_length`.

dlenev

LGTM.

inikep requested a review from dlenev December 5, 2025 08:29

dlenev requested changes Dec 16, 2025

View reviewed changes

inikep force-pushed the PS-10347-8.0 branch from 5f3270b to 196664c Compare December 18, 2025 19:43

dlenev approved these changes Dec 22, 2025

View reviewed changes

inikep merged commit 74dd764 into percona:8.0 Dec 30, 2025
22 of 23 checks passed

inikep deleted the PS-10347-8.0 branch December 30, 2025 09:24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

PS-10347 [8.0]: Ensure strict JSON compliance and robust field separation in audit_log_read()/AuditJsonHandler #5776

PS-10347 [8.0]: Ensure strict JSON compliance and robust field separation in audit_log_read()/AuditJsonHandler #5776

Uh oh!

inikep commented Dec 5, 2025 •

edited

Loading

Uh oh!

dlenev left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

inikep commented Dec 18, 2025

Uh oh!

dlenev left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

PS-10347 [8.0]: Ensure strict JSON compliance and robust field separation in audit_log_read()/AuditJsonHandler #5776

PS-10347 [8.0]: Ensure strict JSON compliance and robust field separation in audit_log_read()/AuditJsonHandler #5776

Uh oh!

Conversation

inikep commented Dec 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dlenev left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

inikep commented Dec 18, 2025

Uh oh!

dlenev left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

inikep commented Dec 5, 2025 •

edited

Loading