Skip to content

fix: make webhooks more resilient #316

@GHkrishna

Description

@GHkrishna

Seems like we are missing resiliency while emitting webhooks. As discussed with @sairanjit webhooks being missed can cause some inconsistency of states in platform and agent. Need to check the impact of this in terms of state recovery in case of failures (like are there any hard fetches being made to get synced with the latest state even if the webhooks fail to update states)

Right now we have:

  • no way to retry webhooks
  • Error handling/queuing of webhooks, to be tried later.
    • Statefull vs stateless ?
    • How long do we hold onto them
  • Can the retry logic be abused by a third party consumer by for e.g. purposefully failing webhooks, etc? [This might include understanding trusted and UN-trusted entities and their direct interaction with agent via webhooks/websockets]

Similarly, need to look into websockets.

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementThis enhances the overall experience/code quality/flows, etctriageNeed discussion

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions