Skip to content

Unstable Training #29

@wenjunli-0

Description

@wenjunli-0

When I switch to the areal v0.3.3 and run the asearcher_local experiment, I got very unstable training and training cannot continue due to very high rollout time costs.

Below are the hyperparameters I used to run the experiment and the logs. (The blue curves are relatively stable and the green and grey curves are unstable and are created after I switched to areal v0.3.3)
epochs=10, 4p1t1+4p1t1, batch_size=128, max_concurrent_rollouts=48, mem_fraction_static=0.80

Image Image Image

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions