Skip to content

Regarding the OOM issues with fine-tuning Flan-T5-xl #58

Open
@Alsodream

Description

@Alsodream

I tried to do full fine-tuning on the Flan-T5-XL, but I have always faced the issue of OOM. I used 5 A5000 cards, each with 24GB, which should be acceptable in theory. However, I still have OOM. Do I have to use Deepspeed. In the explanation, I saw the word 'DS unload'. 'yes' means that Deepspeed was not used, right? Do any friends also conduct similar experiments? Can you tell me the reason?
1

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions