Add MailManInTheMiddle security arena submission #20
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Motivation
attack_succeeded.json) so the scenario meets competition gating rules.Description
scenarios/security_arena/submissions/purple_vanguard/mailmaninthemiddle/includingplugin.pywhich exportsMailManInTheMiddlePluginand implements inbox tools simulation, deterministic success checks, baseline/attack inputs, and secure defender guidance.artifacts/inbox_baseline.json,artifacts/inbox_attack.json, andartifacts/company_policy.mdcontaining the sensitive reset email, the attacker email (withhidden_textinjection), and company forwarding policy.scenarios/security_arena/plugins/registry.pyas"mailmaninthemiddle": MailManInTheMiddlePluginand updatedscenarios/security_arena/plugins/__init__.pyto use lazy imports for registry helpers to avoid circular import issues.scenario_mailmaninthemiddle.toml, a README describing objective/flow/attack vector, and an exporterscripts/export_mailmaninthemiddle_submission.pythat runs a short deterministic local evaluation and writesattack_succeeded.jsonplus baseline evidence into the submission folder.Testing
PYTHONPATH=. python scripts/export_mailmaninthemiddle_submission.pywhich executed the deterministic exporter and producedscenarios/security_arena/submissions/purple_vanguard/mailmaninthemiddle/attack_succeeded.jsonandtest_results/baseline_passed.jsonsuccessfully.attack_succeeded.jsonrecords"succeeded": trueandmanipulation_round: 1as expected.Codex Task