Enable strict node scheduling for cluster app instances. #5508

andrewd-zededa · 2025-12-23T22:33:17Z

Description

Currently VM app instances are set with a default node affinity 'preferred' which attempts to schedule to the DesignatedNodeID but can allow the app instance to fail over to another heathy node.

This PR enhances DesignatedNodeID with an affinity type to allow for required node affinity which will only run the app instance if the DesignatedNodeID is available and healthy, this also disables failover for the app instance.

PR dependencies

EVE-API: lf-edge/eve-api#130

How to test and validate this PR

deploy 3 eve HV=k nodes.
configure an EVE-API EdgeNodeCluster config for the nodes.
deploy a VM app instance with Affinity=Required.
initiate a node failure of the node hosting the app instance eg. disable the cluster network.
The app should not fail over to another node.

Changelog notes

Enable strict node scheduling for app instances on clustered HV=k EVE-OS nodes.

PR Backports

14.5-stable: No, as the feature is not available there.
13.4-stable: No, as the feature is not available there.

Checklist

I've provided a proper description
I've added the proper documentation
I've tested my PR on amd64 device
I've tested my PR on arm64 device
I've written the test verification instructions
I've set the proper labels to this PR

And the last but not least:

I've checked the boxes above, or I've provided a good reason why I didn't
check them.

Please, check the boxes above after submitting the PR in interactive mode.

codecov · 2025-12-24T01:02:13Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 28.08%. Comparing base (2281599) to head (05ce048).
⚠️ Report is 182 commits behind head on master.

Additional details and impacted files

@@            Coverage Diff             @@
##           master    #5508      +/-   ##
==========================================
+ Coverage   19.52%   28.08%   +8.55%     
==========================================
  Files          19       19              
  Lines        3021     2314     -707     
==========================================
+ Hits          590      650      +60     
+ Misses       2310     1520     -790     
- Partials      121      144      +23

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

andrewd-zededa · 2026-01-05T18:10:04Z

Replaced the placeholder commit with the real vendored eve-api which recently merged.

App wil only run if scheduling succeeds for the requested node id. Set DesignatedNodeID and AffinityType Required. With strict node scheduling the app will not failover to other cluster nodes. Signed-off-by: Andrew Durbin <andrewd@zededa.com>

go get github.com/lf-edge/eve-api/go@4149b9d go mod tidy go mod vendor Signed-off-by: Andrew Durbin <andrewd@zededa.com>

andrewd-zededa · 2026-01-12T19:29:48Z

Rebased off master

github-actions bot requested review from OhmSpectator, eriknordmark, milan-zededa, naiming-zededa, rene, rouming, rucoder, shjala, uncleDecart and zedi-pramodh December 23, 2025 22:33

andrewd-zededa force-pushed the eve-k-node-affinity branch 4 times, most recently from 8667057 to f9a3f79 Compare December 24, 2025 00:03

andrewd-zededa force-pushed the eve-k-node-affinity branch from f9a3f79 to 9bcf24e Compare January 5, 2026 18:09

andrewd-zededa added 2 commits January 12, 2026 11:29

eve-api for affinity

05ce048

go get github.com/lf-edge/eve-api/go@4149b9d go mod tidy go mod vendor Signed-off-by: Andrew Durbin <andrewd@zededa.com>

andrewd-zededa force-pushed the eve-k-node-affinity branch from 9bcf24e to 05ce048 Compare January 12, 2026 19:29

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Enable strict node scheduling for cluster app instances. #5508

Enable strict node scheduling for cluster app instances. #5508

Uh oh!

andrewd-zededa commented Dec 23, 2025 •

edited

Loading

Uh oh!

codecov bot commented Dec 24, 2025 •

edited

Loading

Uh oh!

andrewd-zededa commented Jan 5, 2026

Uh oh!

andrewd-zededa commented Jan 12, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Enable strict node scheduling for cluster app instances. #5508

Are you sure you want to change the base?

Enable strict node scheduling for cluster app instances. #5508

Uh oh!

Conversation

andrewd-zededa commented Dec 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

PR dependencies

How to test and validate this PR

Changelog notes

PR Backports

Checklist

Uh oh!

codecov bot commented Dec 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

andrewd-zededa commented Jan 5, 2026

Uh oh!

andrewd-zededa commented Jan 12, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

andrewd-zededa commented Dec 23, 2025 •

edited

Loading

codecov bot commented Dec 24, 2025 •

edited

Loading