Migrate train_on_inputs to sft-specific params #297

connermanuel · 2025-04-17T18:22:06Z

this PR adjusts the behavior of train_on_inputs.
if train type is SFT, we include this parameter in the TrainingMethod and default it to auto.
if train type is DPO, we default to None and raise if parameter is supplied.

connermanuel · 2025-04-21T22:53:48Z

src/together/cli/api/finetune.py

formatting only

timofeev1995 · 2025-05-09T09:02:42Z

src/together/resources/finetune.py

+        )
+        train_on_inputs = "auto"
+
+    if dpo_beta is not None and training_method != "dpo":


Other option might be just a warning. What do you think?

good question! i don't have a strong opinion here, but i am leaning towards the error because non-None value means the user intentionally set it. if they intentionally set it, it might be because they assumed dpo would be active. so we should cancel and let them know that it isn't, rather than silently continuing with an sft job. wdyt @artek0chumak ?

It's an imediate error, not something from FT API or from a job. I think it's fine to leave it us an error

artek0chumak

Don't forget version bump up: https://github.com/togethercomputer/together-python/blob/main/pyproject.toml#L15

src/together/resources/finetune.py

Co-authored-by: Artem Chumachenko <[email protected]>

connermanuel commented Apr 21, 2025

View reviewed changes

src/together/cli/api/finetune.py Outdated

Copy link

Contributor Author

connermanuel Apr 21, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

formatting only

connermanuel force-pushed the cmanuel/eng-24978-move-train_on_inputs-to-the-parameters-of-sft-training branch 2 times, most recently from 8ef8769 to 1ce13d6 Compare May 8, 2025 22:03

timofeev1995 reviewed May 9, 2025

View reviewed changes

artek0chumak requested changes May 9, 2025

View reviewed changes

src/together/resources/finetune.py Outdated Show resolved Hide resolved

timofeev1995 approved these changes May 13, 2025

View reviewed changes

connermanuel force-pushed the cmanuel/eng-24978-move-train_on_inputs-to-the-parameters-of-sft-training branch from 01ab30b to 8331552 Compare May 13, 2025 13:37

connermanuel requested a review from artek0chumak May 13, 2025 13:37

artek0chumak approved these changes May 13, 2025

View reviewed changes

connermanuel and others added 4 commits May 13, 2025 11:39

migrate to sft_on_inputs, and change defaults to match

14ba59d

add validation to dpo_beta

24c5938

tests

b5c9f67

remove redundant 'automatically'

77601a9

Co-authored-by: Artem Chumachenko <[email protected]>

connermanuel force-pushed the cmanuel/eng-24978-move-train_on_inputs-to-the-parameters-of-sft-training branch from 8331552 to 77601a9 Compare May 13, 2025 18:40

connermanuel merged commit c026b7b into main May 13, 2025
10 checks passed

connermanuel deleted the cmanuel/eng-24978-move-train_on_inputs-to-the-parameters-of-sft-training branch May 13, 2025 18:42

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Migrate train_on_inputs to sft-specific params #297

Migrate train_on_inputs to sft-specific params #297

Uh oh!

connermanuel commented Apr 17, 2025

Uh oh!

connermanuel Apr 21, 2025

Uh oh!

timofeev1995 May 9, 2025

Uh oh!

connermanuel May 12, 2025

Uh oh!

artek0chumak May 13, 2025

Uh oh!

artek0chumak left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Migrate train_on_inputs to sft-specific params #297

Migrate train_on_inputs to sft-specific params #297

Uh oh!

Conversation

connermanuel commented Apr 17, 2025

Uh oh!

connermanuel Apr 21, 2025

Choose a reason for hiding this comment

Uh oh!

timofeev1995 May 9, 2025

Choose a reason for hiding this comment

Uh oh!

connermanuel May 12, 2025

Choose a reason for hiding this comment

Uh oh!

artek0chumak May 13, 2025

Choose a reason for hiding this comment

Uh oh!

artek0chumak left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!