Fix: Checkpoint upgrade #334

kozlov721 · 2025-12-16T03:24:10Z

Purpose

Fixes various issues with upgrading checkpoints from 0.3.11 to 0.4.x

Specification

Removes DebugLoader from saved config
Fixed bugs in checkpoint loading
Fixed incorrect execution order generation
Fixed a bug in PPLCNet regarding wrong activations
Improved the upgrade command

Dependencies & Potential Impact

None / not applicable

Deployment Plan

None / not applicable

Testing & Validation

None / not applicable

klemen1999 · 2025-12-19T08:51:41Z

luxonis_train/__main__.py

    @param opts: A list of optional CLI overrides of the config file.
    """
-    create_model(config, opts, weights=weights).infer(
+    create_model(config, opts, weights=weights, debug_mode=True).infer(


Is this expected to have debug_mode=True?

luxonis_train/config/config.py

luxonis_train/core/core.py

klemen1999 · 2025-12-19T09:07:31Z

luxonis_train/lightning/luxonis_lightning.py

+        old_order = ckpt.get("execution_order")
+        new_order = get_model_execution_order(self)

        for node_name, node in self.nodes.items():


Is there a way to make this whole for loop a bit more readable and not so deeply nested? I realize there are several fallback steps but can we introduce some helper functions and add small comments on each fallback step so that this code is easier to understand and maintene?
Maybe we can try putting it into a LLM asking it clean it up and make it more readable - but we need to have some test cases to make sure we don't actually change any logic

Yeah I'll look into making this more readable

klemen1999 · 2025-12-19T09:16:06Z

luxonis_train/lightning/utils.py


    for name, module in model.named_modules():
-        if name and list(module.parameters()):
+        if list(module.parameters()) and not list(module.children()):


Are execution orders generated and saved so far in the checkpoints still generally correct or we can't use them because of the bug we had?
And what was the bug?

The bug here is that the module names are saved among the parameters like this:

class Foo: conv1 = ... conv2 = ... class Bar: conv3 = ...

This would generate [Foo, Foo.conv1, Foo.conv2, Bar, Bar.conv3]

While this refactor:

class Foo: conv1 = ... class Bar: conv2 = ... conv3 = ...

would generate [Foo, Foo.conv1, Bar, Bar.conv2, Bar.conv3].

This makes the order not match due to the inclusion of the parent module.

I don't quite understand this example tbh. You show here 2 different networks no?

Yeah I wasn't very detauled. These would be two implemenations of the same network using 2 modules where the layers are just run sequentially.

Ok so if I understand correct we have the parents in the checkpoints that are currently saved whereas now we'll only have leaf nodes?
What does that mean in context of backcompatibility, reusing current execution orders,etc? For loading older weights can we filter out these parent nodes first and then try to match them?

The compatibility between 0.3.11 and 0.4.2 and so on should still work.

So the process of re-exporting to 0.3.11 and then again re-exporting with 0.4.2 should work. (or using the luxonis_train upgrade ckpt in 0.4.2)

luxonis_train/nodes/base_node.py

luxonis_train/nodes/blocks/blocks.py

kozlov721 added 21 commits November 18, 2025 12:07

partial ckpt upgrade

a1f77c2

fix version

870d41f

fix versions

56d013c

debug-true

448bf6a

safer cfg dump

cd37c53

fixes

b11c08e

upgrade

eefae67

fixes

1dd6050

updated execution order

0c8b2ed

updates

e43e661

updated debug loader

856ccb1

updated metadata parsing

eb9bf14

updated fomo upgrade

547f246

don't override custom attach index

0066ad0

fix filter task names

1b258d2

fix dump

30eda1b

fix instance seg loading

beccb17

removed wrong logs

7c8f499

fix debug loader in config

765c638

fix ocr

7aabb6f

Merge branch 'main' into fix/checkpoint-upgrade

8216b74

kozlov721 requested a review from a team as a code owner December 16, 2025 03:24

kozlov721 requested review from conorsim, klemen1999 and tersekmatija and removed request for a team December 16, 2025 03:24

github-actions bot assigned kozlov721 Dec 16, 2025

github-actions bot added fix Fixing a bug CLI Changes affecting the CLI labels Dec 16, 2025

updated execution order

f8d36d8

fix order

9b08fd0

klemen1999 reviewed Dec 19, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix: Checkpoint upgrade #334

Fix: Checkpoint upgrade #334

Uh oh!

kozlov721 commented Dec 16, 2025 •

edited

Loading

Uh oh!

klemen1999 Dec 19, 2025

Uh oh!

Uh oh!

Uh oh!

klemen1999 Dec 19, 2025

Uh oh!

kozlov721 Dec 19, 2025

Uh oh!

klemen1999 Dec 19, 2025

Uh oh!

kozlov721 Jan 8, 2026

Uh oh!

klemen1999 Jan 8, 2026

Uh oh!

kozlov721 Jan 8, 2026

Uh oh!

klemen1999 Jan 8, 2026 •

edited

Loading

Uh oh!

kozlov721 Jan 8, 2026

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Fix: Checkpoint upgrade #334

Are you sure you want to change the base?

Fix: Checkpoint upgrade #334

Uh oh!

Conversation

kozlov721 commented Dec 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Specification

Dependencies & Potential Impact

Deployment Plan

Testing & Validation

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

klemen1999 Jan 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

kozlov721 commented Dec 16, 2025 •

edited

Loading

klemen1999 Jan 8, 2026 •

edited

Loading