-
Notifications
You must be signed in to change notification settings - Fork 424
Pull requests: pytorch/torchtitan
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
refactor ParallelDims and CheckpointManager
CLA Signed
This label is managed by the Meta Open Source bot.
#1384
opened Jul 12, 2025 by
tianyu-l
Loading…
Add option for selective op AC to filter mm shapes based on fqn
CLA Signed
This label is managed by the Meta Open Source bot.
#1380
opened Jul 11, 2025 by
soulitzer
Loading…
add float8 support
CLA Signed
This label is managed by the Meta Open Source bot.
#1378
opened Jul 10, 2025 by
bdhirsh
Loading…
[llama3] add configurations for Llama 3 1B and 3B models
CLA Signed
This label is managed by the Meta Open Source bot.
#1376
opened Jul 9, 2025 by
idoh
Loading…
Add option to exclude low flop mms from every-other-mm sac policy
CLA Signed
This label is managed by the Meta Open Source bot.
#1372
opened Jul 8, 2025 by
soulitzer
Loading…
[benchmark] add h200 bench
CLA Signed
This label is managed by the Meta Open Source bot.
#1361
opened Jul 2, 2025 by
asaiacai
Loading…
Add support for saving HF format tensors with DCP
CLA Signed
This label is managed by the Meta Open Source bot.
#1351
opened Jun 27, 2025 by
ankitageorge
Loading…
[WIP] Document MX FP8 recipe
CLA Signed
This label is managed by the Meta Open Source bot.
#1350
opened Jun 27, 2025 by
lessw2020
Loading…
Autoparallel support for DP-only, DP+TP, or TP-only
CLA Signed
This label is managed by the Meta Open Source bot.
#1349
opened Jun 27, 2025 by
wconstab
Loading…
[WIP] Enable causal block mask for sdpa
CLA Signed
This label is managed by the Meta Open Source bot.
[DSV3] Add PP support for DSV3
CLA Signed
This label is managed by the Meta Open Source bot.
#1345
opened Jun 26, 2025 by
H-Huang
Loading…
[kernels][blackwell] add cutlass/cute group gemm forward for blackwell
CLA Signed
This label is managed by the Meta Open Source bot.
#1327
opened Jun 22, 2025 by
lessw2020
Loading…
Support finetuning from a pretrained model
CLA Signed
This label is managed by the Meta Open Source bot.
#1321
opened Jun 20, 2025 by
vwxyzjn
Loading…
[not for land] testing out float8 128_1_128_128 blockwise scaling
CLA Signed
This label is managed by the Meta Open Source bot.
#1317
opened Jun 18, 2025 by
vkuzo
Loading…
Do not submit: Multinode training seems to be working
CLA Signed
This label is managed by the Meta Open Source bot.
#1314
opened Jun 17, 2025 by
ahmadsharif1
•
Draft
Do not submit: Multinode is working with multiple controllers
CLA Signed
This label is managed by the Meta Open Source bot.
#1313
opened Jun 17, 2025 by
ahmadsharif1
•
Draft
[llama4][auxiliary-loss-free load balancing] update expert_bias without backward hooks
CLA Signed
This label is managed by the Meta Open Source bot.
#1304
opened Jun 16, 2025 by
hann-wang
Loading…
Finetune from pre-trained models
CLA Signed
This label is managed by the Meta Open Source bot.
#1300
opened Jun 15, 2025 by
vwxyzjn
Loading…
[not for land] Use new AC
CLA Signed
This label is managed by the Meta Open Source bot.
#1294
opened Jun 13, 2025 by
soulitzer
Loading…
WIP: Try to use monarch to run torchtitan.
CLA Signed
This label is managed by the Meta Open Source bot.
#1288
opened Jun 12, 2025 by
ahmadsharif1
•
Draft
DO NOT SUBMIT: WIP: Try to use monarch to run torchtitan.
CLA Signed
This label is managed by the Meta Open Source bot.
#1286
opened Jun 12, 2025 by
ahmadsharif1
•
Draft
[deepseek][kernels][blackwell] Cutlass blackwell grouped gemm using cute dsl (forward,backward)
CLA Signed
This label is managed by the Meta Open Source bot.
#1276
opened Jun 8, 2025 by
lessw2020
Loading…
[deepseek][blackwell] add Cutlass cute dsl blackwell dense based looping group gemm
CLA Signed
This label is managed by the Meta Open Source bot.
#1274
opened Jun 8, 2025 by
lessw2020
Loading…
[llama4] enable expert parallel on the same device mesh as tp (tp2ep)
CLA Signed
This label is managed by the Meta Open Source bot.
#1269
opened Jun 6, 2025 by
hann-wang
Loading…
Previous Next
ProTip!
Mix and match filters to narrow down what you’re looking for.