SWAPI: Fix I-ALiRT error resulting from new sweep table number #2485

hafarooki · 2025-12-05T21:49:17Z

Change Summary

Overview

I was shown in an email that I-ALiRT is broken because this portion of the code is failing to handle NaNs. I also found that an interval where the same solution kept being returned was having that issue because the zeros were not being excluded from curve_fit. I am working on another potential PR to improve this fitting procedure, but this PR should fix those particular bugs. However, I haven't tested it properly (I did validate that the code runs with a custom unit test but the test is not well-written and not included in the PR) and would like help doing that. Also, there is a potential issue with how I did it: If ALL of the counts are excluded by the mask, there will be NO samples passed to curve fit.

New Dependencies

N/A

New Files

N/A

Deleted Files

N/A

Updated Files

Just updated process_swapi.py to exclude zeros (as otherwise the initial guess is left as the solution and the covariance is infinite) and nan/infinite points (as otherwise there is an error that breaks I-ALiRT)

updated file 1
- description of change 1 in file 1
- description of change 2 in file 2
updated file 2
- descipriton of change 1 in file 2

Testing

I think there should be a test or two added to ensure the new behavior is correct. I haven't gotten around to creating such tests yet as I am not sure how the software team would prefer them to be implemented and I understand that this is urgent..

hafarooki · 2025-12-05T21:49:27Z

@laspsandoval

hafarooki · 2025-12-05T21:51:30Z

Here is the test I wrote that shows that the code does at least run (I guess you can also just use the existing tests with simulated data...):

@pytest.mark.external_test_data
def test_process_real_packet():
    from pathlib import Path
    packets_directory = Path('/home/hafarooki/projects/imap_processing/swapi-test/packets')
    
    xtce_ialirt_path = (
        imap_module_directory / "ialirt" / "packet_definitions" / "ialirt.xml"
    )
    
    calibration_file = pd.read_csv(
        f"{imap_module_directory}/tests/ialirt/data/l0/swapi_ialirt_energy_steps.csv"
    )

    lines = []

    for packet_path in packets_directory.iterdir():    
        sc_xarray_data = packet_file_to_datasets(
            packet_path, xtce_ialirt_path, use_derived_value=False
        )[478]
        lines.append(packet_path.name)
        
        lines.append('\tswapi_coin_cnt# sum: ' + ' '.join(f'{str(key).split("_")[-1]}:{sc_xarray_data[key].sum().item()}'
                            for key in sc_xarray_data.keys()
                            if str(key).startswith('swapi_coin_cnt')))
        lines.append('\ttotal count: ' + str(sum(sc_xarray_data[key].sum().item()
                                        for key in sc_xarray_data.keys()
                                        if str(key).startswith('swapi_coin_cnt'))))
    
        swapi_product = process_swapi_ialirt(sc_xarray_data, calibration_file)
        for i in range(len(swapi_product)):
            density = swapi_product[i]['swapi_pseudo_proton_density']
            temperature = swapi_product[i]['swapi_pseudo_proton_temperature']
            speed = swapi_product[i]['swapi_pseudo_proton_speed']
            lines.append(f'\ti={i} ({swapi_product[i]["met_in_utc"]})')
            lines.append(f'\t\tpseudo moments: density={density}, temperature={temperature}, speed={speed}')

    output_file = packets_directory.parent / 'output.txt'

    output_file.write_text('\n'.join(lines))
    
    assert False, 'need to add assertions comparing with omni'
    ```

laspsandoval

Take a look at my comments.

laspsandoval · 2025-12-05T21:58:56Z

imap_processing/ialirt/l0/process_swapi.py

                60000 * (initial_speed_guess / 400) ** 2,
            ]
        )
+        five_point_range = range(max_index - 2, max_index + 2 + 1)


Can we keep the range the same as before?

The previous range was incorrect -- I should have put this in a separate PR. It was using 6 points instead of 5 points.

I included this here by accident -- should I put this in a separate PR or just update the name/description of this PR?

This might take more time if we change the range. Bishwas will need to review and we will need to fix the test. Could we leave it for now and update it in the next PR?

Change to this:

five_point_range = range(max_index - 3, max_index + 3)

Its okay to leave the energy range like this now.

Ok. We will need to update the test that is failing:
test_optimize_parameters

In Figure 4 of Lee et al. 2025, the I-ALiRT paper, it looks like they actually used 6 points to fit.
https://link.springer.com/article/10.1007/s11214-025-01244-9

imap_processing/ialirt/l0/process_swapi.py

Bishwasls

Its not critical to change the energy bins used for fitting.

Bishwasls · 2025-12-05T22:24:25Z

imap_processing/ialirt/l0/process_swapi.py

                60000 * (initial_speed_guess / 400) ** 2,
            ]
        )
+        five_point_range = range(max_index - 2, max_index + 2 + 1)


Its okay to leave the energy range like this now.

hafarooki · 2025-12-07T03:02:55Z

I reverted to using 6 points for fitting. Locally, I confirmed that the failing test once again passes.

hafarooki · 2025-12-07T03:09:45Z

One thing that should be discussed is how we handle cases where less than 3 points are available for fitting. With less than 3 points, the 3-parameter Maxwellian model cannot be constrained. Some sort of fill value should be used instead?

bishwassth

This version looks good to me.

bishwassth · 2025-12-07T03:27:22Z

One thing that should be discussed is how we handle cases where less than 3 points are available for fitting. With less than 3 points, the 3-parameter Maxwellian model cannot be constrained. Some sort of fill value should be used instead?

That's a good point. Did you see any case where less than three points are available? I always thought we will have at least three points available for fitting, but may be not due to compression of the counts in the I-ALiRT packets.

hafarooki · 2025-12-07T04:43:58Z

So far I have seen a case with only 3 points available but not one with less than 3. However would be good to include the edge case in case it ever happens so that the I-ALiRT service is not interrupted again.

greglucas

Do you have some example tests so that we can add some assertions on what you want to happen in the various cases.

Something like the following pseudo code to test just this curve fitting function.

x = np.arange(63)
# Add a nan to our x that should be ignored in the fits
x[50] = np.nan
output = optimize_pseudo_parameters(x, ...)
assert output ...

imap_processing/ialirt/l0/process_swapi.py

greglucas · 2025-12-08T12:59:21Z

Here is a link to the failing test you should be able to run locally:
https://github.com/IMAP-Science-Operations-Center/imap_processing/actions/runs/19998051126/job/57349485125?pr=2485#step:7:2712

Co-authored-by: Greg Lucas <greg.m.lucas@gmail.com>

… into fix-nan-handling

hafarooki · 2025-12-08T14:35:14Z

Something strange I just realized: Why is there even a nan/infinite value in the xdata (as the stack trace in the original email that raised this issue shows) in the first place?

In process_swapi.py, there is this bit of code:

    # Find the sweep's energy data for the latest time, where sweep_id == 2
    subset = calibration_lut_table[
        (calibration_lut_table["timestamp"] == calibration_lut_table["timestamp"].max())
        & (calibration_lut_table["Sweep #"] == 2)
    ]
    if subset.empty:
        energy_passbands = np.full(NUM_IALIRT_ENERGY_STEPS, np.nan, dtype=np.float64)
    else:
        subset = subset.sort_values(["timestamp", "ESA Step #"])
        energy_passbands = (
            subset["Energy"][:NUM_IALIRT_ENERGY_STEPS].to_numpy().astype(float)
        )

Note in particular that when the data subset is empty, nans are provided as energy.

Is this appropriate behavior? Maybe instead of masking out nans in optimize_pseudo_parameters, we should not give it nans in the first place.

I am not sure what that line of code is meant to be doing exactly---under what circumstances would subset be empty? Clearly, this did end up happening in the real application. I think this implies that calibration_lut_table does not contain any entries for its last timestamp at Sweep # = 2. But I do not know under what circumstances it would be expected to happen.

This reverts commit 194115e.

greglucas · 2025-12-08T15:50:29Z

Something strange I just realized: Why is there even a nan/infinite value in the xdata (as the stack trace in the original email that raised this issue shows) in the first place?

Note in particular that when the data subset is empty, nans are provided as energy.

Is this appropriate behavior? Maybe instead of masking out nans in optimize_pseudo_parameters, we should not give it nans in the first place.

@hafarooki I agree with you here. If that is the case, we should probably just exit/return early because there is no point in even continuing on after that subset.empty call.

Another thing that is slightly confusing to me and might help is to make optimize_pseudo_parameters() function only take in one sweep at a time, it doesn't need to do any looping internally, then it returns only the 3 values it found. We move the loop over sweeps outside of this and populate the dictionary of return values during the loop in the main function rather than the internal function.

greglucas · 2025-12-08T15:51:24Z

Why do we hardcode sweep # == 2? We have actually changed to sweep 3 now, so is there a way for I-ALiRT to know which sweep # to lookup or do we have to hardcode 3 now and make sure we are in-sync manually.

hafarooki · 2025-12-08T20:53:38Z

Good catch! After consulting with the engineering team, I concluded that we can use swapi_version from the packet file to determine which table to use. For now, I just used the first one in the packet, but there are technically 60, one for each second, so we may have to account for potentially multiple in one file. For now this should be good enough

imap_processing/ialirt/l0/process_swapi.py

laspsandoval · 2025-12-09T18:00:39Z

pre-commit.ci autofix

for more information, see https://pre-commit.ci

imap_processing/ialirt/l0/process_swapi.py

…ng into fix-nan-handling

greglucas · 2025-12-10T16:56:56Z

I'm sorry, but I'm not going to review this. You have touched too many things since the last time. Please keep updates minimal!

greglucas

This is touching L2 code now too. Please remove that

hafarooki · 2025-12-10T18:51:10Z

I agree, the scope of this PR has changed because the issue turned out to be different from the original one, and we have identified some other issues along the way. I will split this into multiple PRs once we finalize the changes to be made.

ejzirn · 2025-12-11T00:41:00Z

A lot has happened today for the SWAPI team in terms of how to process the I-ALiRT packets to get our pseudo moments. Because of this, I think we should pause with the PRs for now.

mask zeros and nans from swapi pseudo parameter fit

e8e3052

add assertion

c2f0729

laspsandoval requested a review from bishwassth December 5, 2025 21:52

laspsandoval assigned hafarooki Dec 5, 2025

laspsandoval self-requested a review December 5, 2025 21:52

laspsandoval added the bug Something isn't working label Dec 5, 2025

laspsandoval added this to the December 2025 milestone Dec 5, 2025

laspsandoval reviewed Dec 5, 2025

View reviewed changes

Bishwasls reviewed Dec 5, 2025

View reviewed changes

Revert to using 6 points for fitting

6519994

hafarooki requested a review from laspsandoval December 7, 2025 03:08

bishwassth approved these changes Dec 7, 2025

View reviewed changes

greglucas reviewed Dec 8, 2025

View reviewed changes

imap_processing/ialirt/l0/process_swapi.py Outdated Show resolved Hide resolved

imap_processing/ialirt/l0/process_swapi.py Outdated Show resolved Hide resolved

imap_processing/ialirt/l0/process_swapi.py Outdated Show resolved Hide resolved

hafarooki and others added 4 commits December 8, 2025 08:01

Update imap_processing/ialirt/l0/process_swapi.py

042d29e

Co-authored-by: Greg Lucas <greg.m.lucas@gmail.com>

use nan as solution if less than 3 points available for fitting

4b75b36

Merge branch fix-nan-handling of github.com:hafarooki/imap_processing…

5b02024

… into fix-nan-handling

remove unnecessary check on xdata

194115e

Revert "remove unnecessary check on xdata"

0e8ac25

This reverts commit 194115e.

hafarooki requested a review from greglucas December 8, 2025 15:11

Add unit tests

91a3d69

dynmically determine the sweep number

898da2e

greglucas reviewed Dec 9, 2025

View reviewed changes

imap_processing/ialirt/l0/process_swapi.py Outdated Show resolved Hide resolved

Add tests for swapi_product structure and behavior

7825f93

pre-commit-ci bot and others added 2 commits December 9, 2025 18:01

[pre-commit.ci] auto fixes from pre-commit.com hooks

08d4c01

for more information, see https://pre-commit.ci

use a common function to select the first 63 energy passbands

9d8bf15

hafarooki commented Dec 9, 2025

View reviewed changes

imap_processing/ialirt/l0/process_swapi.py Outdated Show resolved Hide resolved

hafarooki added 2 commits December 9, 2025 16:12

Merge branch 'fix-nan-handling' of github.com:hafarooki/imap_processi…

bf12e4f

…ng into fix-nan-handling

apply formatting changes

0342310

hafarooki requested review from bishwassth and greglucas December 9, 2025 21:20

hafarooki changed the title ~~mask zeros and nans from swapi pseudo parameter fit~~ SWAPI: Fix I-ALiRT error resulting from new sweep table number Dec 9, 2025

hafarooki added 3 commits December 9, 2025 16:37

remove unneeded nan/infinite checks

255b0f4

remove unneeded test

edb6f1e

fix if statement

8920022

greglucas requested changes Dec 10, 2025

View reviewed changes

laspsandoval marked this pull request as draft December 10, 2025 17:45

laspsandoval requested a review from ejzirn December 10, 2025 23:21

This was referenced Dec 11, 2025

BUG - I-ALiRT crash due to change in sweep table version number #2506

Open

BUG - exclusion of zeros in SWAPI I-ALiRT fitting #2507

Open

SWAPI: Fix I-ALiRT error resulting from new sweep table number #2485

Are you sure you want to change the base?

SWAPI: Fix I-ALiRT error resulting from new sweep table number #2485

Uh oh!

Conversation

hafarooki commented Dec 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Change Summary

Overview

New Dependencies

New Files

Deleted Files

Updated Files

Testing

Uh oh!

hafarooki commented Dec 5, 2025

Uh oh!

hafarooki commented Dec 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

laspsandoval left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Bishwasls left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

hafarooki commented Dec 7, 2025

Uh oh!

hafarooki commented Dec 7, 2025

Uh oh!

bishwassth left a comment

Choose a reason for hiding this comment

Uh oh!

bishwassth commented Dec 7, 2025

Uh oh!

hafarooki commented Dec 7, 2025

Uh oh!

greglucas left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

greglucas commented Dec 8, 2025

Uh oh!

hafarooki commented Dec 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

greglucas commented Dec 8, 2025

Uh oh!

greglucas commented Dec 8, 2025

Uh oh!

hafarooki commented Dec 8, 2025

Uh oh!

Uh oh!

laspsandoval commented Dec 9, 2025

Uh oh!

Uh oh!

greglucas commented Dec 10, 2025

Uh oh!

greglucas left a comment

hafarooki commented Dec 5, 2025 •

edited

Loading

hafarooki commented Dec 5, 2025 •

edited

Loading

hafarooki commented Dec 8, 2025 •

edited

Loading