Make stable validation loss stable and deterministic for Flux when using uncached latents #2238

mturnshek · 2025-11-12T20:16:47Z

Seed wasn't being properly managed for latents generated on the accelerator and stable timestep selection (200, 400, 600, 800) was not handled by the flux sampling function.

Before:

After:

…x latents

mturnshek · 2025-11-19T18:14:00Z

I have noticed something else having to do with loss in sd-scripts that I am confused by. This class:

class LossRecorder:
    def __init__(self):
        self.loss_list: List[float] = []
        self.loss_total: float = 0.0

    def add(self, *, epoch: int, step: int, loss: float) -> None:
        if epoch == 0:
            self.loss_list.append(loss)
        else:
            while len(self.loss_list) <= step:
                self.loss_list.append(0.0)
            self.loss_total -= self.loss_list[step]
            self.loss_list[step] = loss
        self.loss_total += loss

    @property
    def moving_average(self) -> float:
        losses = len(self.loss_list)
        if losses == 0:
            return 0
        return self.loss_total / losses

We use the moving_average property for logging loss/validation/step_average. However, I don't understand why we would use that for validation loss especially. It is counterproductive for past validation loss to affect current validation loss.

On my personal fork I have added:

def reset(self) -> None:
        self.loss_list = []
        self.loss_total = 0.0

and call it between validation checks as a stopgap solution, but there is certainly a more robust way to do it.

make stable validation loss stable and deterministic for uncached flu…

6f7498f

…x latents

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Make stable validation loss stable and deterministic for Flux when using uncached latents #2238

Make stable validation loss stable and deterministic for Flux when using uncached latents #2238

Uh oh!

mturnshek commented Nov 12, 2025

Uh oh!

mturnshek commented Nov 19, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Make stable validation loss stable and deterministic for Flux when using uncached latents #2238

Are you sure you want to change the base?

Make stable validation loss stable and deterministic for Flux when using uncached latents #2238

Uh oh!

Conversation

mturnshek commented Nov 12, 2025

Uh oh!

mturnshek commented Nov 19, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant