Pull requests: pytorch/torchtune
Pull requests list
Add support for distributed checkpointing of HF safetensors with DCP CLA SignedThis label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2851 openedJun 25, 2025 by ankitageorge Loading…
1 of 8 tasks
Enable activation offloading for XPU CLA SignedThis label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2847 openedJun 24, 2025 by zxd1997066 Loading…
4 of 13 tasks
[Gemma2] Use nn.SDPA via MultiHeadAttention CLA SignedThis label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
[DO NOT MERGE] fix main CLA SignedThis label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2832 openedJun 16, 2025 by joecummings Loading…
[DONT MERGE] Debug branch for the Qwen3 + full model compile CLA SignedThis label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2831 openedJun 16, 2025 by anijain2305 Loading…
skip compiling opt step instead of erroring if opt_in_bwd=True CLA SignedThis label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2827 openedJun 13, 2025 by felipemello1 Loading…
1 of 4 tasks
raise error if fsdp_cpu_offload + opt_in_bwd CLA SignedThis label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2826 openedJun 13, 2025 by felipemello1 Loading…
1 of 4 tasks
Fix #2809, modify attention CLA SignedThis label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2822 openedJun 12, 2025 by krammnic Loading…
2 of 13 tasks
[WIP] Qwen3 MoE support CLA SignedThis label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2820 openedJun 12, 2025 by intervitens • Draft
6 tasks
[RFC] on-the-fly packing CLA SignedThis label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2819 openedJun 12, 2025 by felipemello1 Loading…
[WIP] Integrate OptimizerInBackward into SFT distributed recipe CLA SignedThis label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2818 openedJun 11, 2025 by joecummings • Draft
Test alignment of shared methods in recipes CLA SignedThis label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2807 openedJun 9, 2025 by Andrei-Aksionov • Draft
Integrate Muon optimizer (2725) CLA SignedThis label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2803 openedJun 8, 2025 by Saurabh750 Loading…
1 of 13 tasks
Fix command in config CLA SignedThis label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2796 openedJun 6, 2025 by krammnic Loading…
1 of 13 tasks
[WIP] Proper tool calling support in the torchtune CLA SignedThis label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2794 openedJun 6, 2025 by krammnic Loading…
2 of 13 tasks
[RFC] Reward modeling CLA SignedThis label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2788 openedJun 5, 2025 by krammnic Loading…
[RFC] Iterable Dataset CLA SignedThis label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2785 openedJun 4, 2025 by felipemello1 Loading…
Ungate FP8 + TP CLA SignedThis label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
[NOT FOR REVIEW] Full knowledge distillation recipe TP + FP8 CLA SignedThis label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
[WIP] DSV3 This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2764 openedMay 27, 2025 by SalmanMohammadi • Draft
2 of 7 tasks
Add This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
LRScheduler.state_dict()
to checkpoints CLA Signed#2762 openedMay 23, 2025 by omkar-334 Loading…
2 of 13 tasks
[WIP][DEBUG] llama4 debugging CLA SignedThis label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2756 openedMay 21, 2025 by IvanKobzarev Loading…
Fixing counting number of batches for accumulation through epoch CLA SignedThis label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2745 openedMay 17, 2025 by wesbz Loading…
7 of 13 tasks
Add feature ligerceloss CLA SignedThis label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2741 openedMay 16, 2025 by mananchawla2005 Loading…
7 of 9 tasks
[WIP][Don't Review] Ray updates CLA SignedThis label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2724 openedMay 12, 2025 by pbontrager • Draft
13 tasks
Previous Next
ProTip! Follow long discussions with comments:>50.