AIProbe: Uncovering Systemic and Environment Errors in Autonomous Systems Using Differential Testing

Full paper: "Uncovering Systemic and Environment Errors in Autonomous Systems Using Differential Testing", Rahil P. Mehta^*, Yashwanthi Anand^*, Manish Motwani and Sandhya Saisubramanian.

AIProbe is a black-box testing framework for identifying and attributing execution anomalies in autonomous agents. It systematically generates diverse environment-task configurations and uses a search-based oracle planner to determine whether the anomaly stems from the agent's model or the environment itself.

AIProbe is designed to:

Detect execution anomalies
Attribute failures to agent errors or environment errors
Work across discrete/continuous and single/multi-agent domains
Operate in black-box settings with no access to the agent’s internal model or reward function

Dependencies

Dependencies include (but are not limited to):

gymnasium
numpy
scipy
matplotlib
tqdm
torch (for PPO models)

The domains we evaluate require additional installation. To install these requirements, cd src/<DOMAIN_NAME> and follow the environment installation instructions.

Repository Contents

The source code for generating environment-task configurations and evaluating the planner and agent on these configurations are contained with src/<DOMAIN_NAME>. The domains includes ACAS Xu, Coop Navi, Bipedal Walker, Flappy Bird, and Lava.

Details of each environment and the instructions to run AIProbe on these domains are provided within the respective folders.

License

This work is released under the Creative Commons Zero v1.0 Universal (CC0-1.0) license.

Name		Name	Last commit message	Last commit date
Latest commit History 59 Commits
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AIProbe: Uncovering Systemic and Environment Errors in Autonomous Systems Using Differential Testing

Dependencies

Repository Contents

License

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

License

ANSWER-OSU/AIProbe

Folders and files

Latest commit

History

Repository files navigation

AIProbe: Uncovering Systemic and Environment Errors in Autonomous Systems Using Differential Testing

Dependencies

Repository Contents

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages