🤗Accelerate

Topic	Replies	Views	Activity
About the 🤗Accelerate category	1	2460	February 20, 2022
FastLoRA v4.2 — Fine-tuning library that never crashes (pip installable)	0	50	March 20, 2026
BitsandBytes conflict with Accelerate	7	1515	March 6, 2026
FSDP Auto Wrap does not work using `accelerate` in Multi-GPU Setup	2	508	December 31, 2025
Accelerate + Gemma2 + FSDP	4	297	December 28, 2025
Error on fsdp2 trainer with fsdp_cpu_ram_efficient_loading = True	1	194	December 19, 2025
Confusion with sequence parallelism	3	179	December 16, 2025
Accelerate consumes more memory	4	295	November 19, 2025
Bug when using gradient accumulation with accelerate	1	88	November 7, 2025
Load_state() with custom objects and scheduler	1	104	November 1, 2025
What is the best way to save the state of a model and optimizer, when the model has 2 LoRas?	3	480	October 30, 2025
How does Accelerate ensure uniqueness of data samples across GPUs?	3	1068	October 30, 2025
Loss spike when resuming from FSDP SHARDED_STATE_DICT checkpoint (possible optimizer-state mismatch)	4	340	October 13, 2025
Do I need to divide the loss by num_processes when I set split_batches True?	1	51	September 14, 2025
Perform knowledge distillation using accelerate	1	522	August 14, 2025
How to Setup Deferred Init with Accelerate + DeepSpeed?	6	345	August 11, 2025
How to get the grad norm of a deepspeed-zero3 model after accelerator.prepare()	2	838	July 23, 2025
Problem with full-finetuning on cluster	1	82	June 25, 2025
Transformers Trainer + Accelerate FSDP: How do I load my model from a checkpoint?	3	16856	June 22, 2025
NCCL Timeout Accelerate Load From Checkpoint	2	2879	June 20, 2025
Not seeing memory benefit to accelerate/FSDP2	2	385	June 18, 2025
DistributedSampler with Accelerate	1	129	June 10, 2025
Where can I find the full list of parameters for the Accelerate yaml config?	3	152	June 5, 2025
Synchronizing State, Trainer and Accelerate	2	132	May 22, 2025
[RuntimeError] DPOTrainer - "element 0 of tensors does not require grad and does not have a grad_fn" on 8x A100 GPUs	1	132	May 20, 2025
Reproduce SFTTrainer with Accelerate and Pytorch	0	212	May 18, 2025
11B model gets OOM after using deepspeed zero 3 setting with 8 32G V100	2	1533	April 26, 2025
Multi-gpu inference llama-3.2 vision with QLoRA	4	275	April 25, 2025
How to work with meta tensors?	1	2833	April 16, 2025
Issues with Dataset Loading and Checkpoint Saving using FSDP with HuggingFace Trainer on SLURM Multi-Node Setup	1	353	April 7, 2025