Skip to content

Conversation

@DBay-ani
Copy link

@DBay-ani DBay-ani commented Jul 3, 2019

Committing code to use the feedback from safety monitors in a variety of
ways (excluding use of CPO), In particular, the monitor is worked into
the environment and used to provide:
*activation of a safety fallback policy
*modification of the reward to encorporate the monitor signal
*provide additional features to the observations received by the
controller.

DBay-ani added 2 commits July 2, 2019 21:55
ways (excluding use of CPO), In particular, the monitor is worked into
the environment and used to provide:
*activation of a safety fallback policy
*modification of the reward to encorporate the monitor signal
*provide additional features to the observations received by the
controller.
@DBay-ani DBay-ani force-pushed the monitorEncorporation_features_fallbackControllerInTrainingTesting_andRewardEncorporation branch from a108c08 to 99b1ffe Compare July 3, 2019 16:55
DBay-ani added 5 commits July 9, 2019 21:34
…rollerInTrainingTesting_andRewardEncorporation
…e training in envs/monitorEncorporated_env.py train/train_monitorEncorporated_straight_planner.py . It is still very much a toy, but it matches the non-toy CPO safety constraints Edward is using . Also, made some trivial adjustments in the envs/monitorEncorporated_env.py to allow the quantitative monitor subformulas to the action - this leverages 95% of infulstructure already there, a very trivial change. I think I took it out before intentionally since I thought in matched the monitor use-cases better.... we might be abusing terminology to call this stuff a monitor - maybe...
…hat was noted in the previous commit log for train/train_monitorEncorporated_straight_planner.py , the fallback controller and quantitative monitor used are toy-ish, but they match the non-toy work Edward is performing with CPO.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant