[ML] Add per allocation and per deployment memory metadata fields to … by MitchLewis930 · Pull Request #6 · Signal65/elasticsearch-Bugbot

MitchLewis930 · 2026-01-30T22:05:32Z

PR_016

Note

Medium Risk
Touches ML deployment task serialization (new transport version) and changes the memory estimation formula used for allocation/stats, which could affect deployment sizing and autoscaling decisions.

Overview
Adds support for model-provided memory requirements in ML deployments. StartTrainedModelDeploymentAction.TaskParams now carries per_deployment_memory_bytes and per_allocation_memory_bytes, serializes them behind a new transport version (V_8_500_064), and includes them in toXContent/parsing.

Updates required native memory estimation. estimateMemoryUsageBytes(...) now takes the new metadata plus number_of_allocations and computes max(240MB + 2*model_size, per_deployment + per_allocation*allocations + model_size) (with ELSER v1 still pinned to a fixed value), and TransportGetTrainedModelsStatsAction wires this into the required_native_memory_bytes stats calculation.

Deployment start/updates propagate the metadata from TrainedModelConfig into task params across assignment/task update paths, and tests/QA add coverage for the new estimation behavior and request helpers.

^{Written by Cursor Bugbot for commit 2e13a9f. This will update automatically on new commits. Configure here.}

…the trained models config (elastic#98139) To improve the required memory estimation of NLP models, this PR introduces two new metadata fields: per_deployment_memory_bytes and per_allocation_memory_bytes. per_deployment_memory_bytes is the memory required to load the model in the deployment per_allocation_memory_bytes is the temporary additional memory used during the inference for every allocation. This PR extends the memory usage estimation logic while ensuring backward compatibility. In a follow-up PR, I will adjust the assignment planner to use the refined memory usage information.

cursor

Cursor Bugbot has reviewed your changes and found 1 potential issue.

^{Bugbot Autofix is OFF. To automatically fix reported issues with Cloud Agents, enable Autofix in the Cursor dashboard.}

cursor · 2026-01-30T22:11:27Z

...ml/src/main/java/org/elasticsearch/xpack/ml/action/TransportGetTrainedModelsStatsAction.java

                parentTaskId,
-                modelSizeStatsListener
+                modelSizeStatsListener,
+                numberOfAllocations


Global allocation sum used for per-model memory estimation

High Severity

The numberOfAllocations is calculated by summing allocations across ALL deployments, but this single total is then used to calculate memory estimates for EACH individual model. When a model has perAllocationMemoryBytes set, the memory formula uses perAllocationMemoryBytes * numberOfAllocations, so using the global sum instead of each model's specific allocation count produces incorrect memory estimates. For example, if Model A has 2 allocations and Model B has 3 allocations, both models would incorrectly use 5 allocations in their memory calculation.

Additional Locations (1)

x-pack/plugin/ml/src/main/java/org/elasticsearch/xpack/ml/action/TransportGetTrainedModelsStatsAction.java#L291-L298

cursor bot reviewed Jan 30, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

[ML] Add per allocation and per deployment memory metadata fields to …#6

[ML] Add per allocation and per deployment memory metadata fields to …#6
MitchLewis930 wants to merge 1 commit intopr_016_beforefrom
pr_016_after

MitchLewis930 commented Jan 30, 2026 •

edited by cursor bot

Loading

Uh oh!

cursor bot left a comment

Uh oh!

cursor bot Jan 30, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Comments

Conversation

MitchLewis930 commented Jan 30, 2026 • edited by cursor bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

cursor bot left a comment

Choose a reason for hiding this comment

Uh oh!

cursor bot Jan 30, 2026

Choose a reason for hiding this comment

Global allocation sum used for per-model memory estimation

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

MitchLewis930 commented Jan 30, 2026 •

edited by cursor bot

Loading