Open
Description
thx for your hard work
Is there a configuration setting corresponding to the last two rows of Table 8 in the paper? My training results are close to those of "Moto w/o Motion Token." How can I implement the configuration for "Moto w/ Motion Token"? The code for the attention-mask design seems quite complex.
Metadata
Metadata
Assignees
Labels
No labels