Skip to content

Conversation

@mayrstefan
Copy link
Contributor

General information

It looks like NVIDIA is deprecating the power_state XML-Element in it's nvidia-smi output. We need to allow the deprecation message as a valid input when parsing the the nvidia_smi section.

Bug reports

See report in https://forum.checkmk.com/t/parsing-of-section-nvidia-smi-failed/56724)

From the crash file:

  • OS: Ubuntu 22.04.5 LTS
  • Checkmk version: 2.4.0p10
  • Detailed steps to reproduce the bug use the attachment from the forum post:
  1. create dummy host
  2. use a data source program for this host with cat nvidia_smi_plugin_out.txt

Looks like nvsmi_device_v13.dtd allows a deprecation message for the power_state element: <power_state>Requested functionality has been deprecated</power_state>

This is unexpected for Checkmk which causes a crash when parsing the section.

Proposed changes

  • What is the expected behavior: accept the deprecation message and proceed parsing the section
  • What is the observed behavior: it crashes
  • Is this a new problem? What made you submit this PR (new firmware, new device, changed device behavior)? New version of nvidia-smi cli

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants