Skip to content

Resume training no longer works in V25? #3203

Answered by phil291
phil291 asked this question in Q&A
Discussion options

You must be logged in to vote

Digged deeper, resume training does work in V25.

Proof:

The resumed training (blue in graphs) does not do a LR-warmup, avg-loss is continued on previous training (orange).

Data-set included 68 images, max step set to 6800 on beginning.
Resumed training-cfg was identical to initial, except the new max step 20400 (6800x3) and adding the last state-directory.

Coclusion:
Bug in kohya_ss scripts, as stated in above link.
Resuming requires a max step or max epoch bigger then already achieved in "last state".
Training will stop regularly at step 13600 when new "max steps/epochs" minus already trained steps/epochs of "last state" (20400-6800) is reached, so only the difference is trained.

The i…

Replies: 1 comment 1 reply

Comment options

You must be logged in to vote
1 reply
@phil291
Comment options

Answer selected by phil291
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
1 participant