Skip to content
View anthony-maio's full-sized avatar

Highlights

  • Pro

Block or report anthony-maio

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
anthony-maio/README.md

Hello, I'm Anthony Maio.

I have been a Staff Software Engineer shipping enterprise production code for 20 years. I have taken a career sabbatical to pivot fully into an independent AI safety researcher focused on agentic systems, scalable oversight, and token-efficient inter-agent communication because I believe this work is more important and valuable.


Research themes

  • Scalable oversight & weak-verifier failure modes
    Measuring when “weaker” evaluators (including humans + smaller models) fail to detect persuasive but incorrect reasoning.

  • Evaluation-awareness / audit-shielding in agentic workflows
    How systems behave differently under “benchmark-shaped” prompts vs. realistic, high-trust production contexts.

  • Coherence-seeking & long-horizon agents
    Architectures for continuity, intervention, and monitoring “epistemic stress” in long-lived systems.

  • Inter-agent communication efficiency
    Protocol design that targets tokenization economics rather than character-count compression.


Papers & preprints (PDF)

(Full index + descriptions: https://making-minds.com/research/)


Open-source & artifacts

Slipstream / SLIPCore (semantic quantization for agent coordination)

Evolutionary Adversarial Pipeline (EAP) for Bloom

Work-in-progress contribution to Bloom to evolve prompts away from benchmark artifacts and probe evaluation-awareness:


What I’m working on now

  • Red-team → blue-team pipelines for agentic deployments (prompt evolution + heterogeneous verification).
  • Protocol + security work around semantic quantization / coordination channels (efficiency and detectability).
  • Reproducible evaluations for oversight failures (CMED-style trap suites + automation).

Collaboration / hiring

If you’re building agentic systems and want help with:

  • evaluation harnesses for deceptive / persuasive error detection,
  • multi-model oversight swarms,
  • or production-grade agent communication protocols,

reach out: anthony@making-minds.ai

Pinned Loading

  1. slipcore slipcore Public

    SLIPCore - Streamlined Interagent Protocol for LLM agent communication

    Python 1

  2. claude-api-desktop claude-api-desktop Public

    A modern, feature-rich desktop client for the Anthropic Claude API with streaming support, extended context capabilities, and an intuitive graphical interface.

    Python 1

  3. google-sheets-timezone-converter google-sheets-timezone-converter Public

    The Timezone Converter add-on provides a powerful custom function for Google Sheets that converts datetime values between different time zones. It supports the full IANA timezone database and dayli…

    JavaScript

  4. hass-adhoc-sql-execution hass-adhoc-sql-execution Public

    Home Assistant Custom Component for Executing custom SQL queries on a MariaDB/MySQL database via a service call.

    Python 1