Can you provide a sample code for training with DeepSpeed ZeRO3?
#10
by
SupercarryNg
- opened
When I try to use DeepSpeed ZeRO3 to do some fine-tuning training for a specific task, I can't specify the device map=“sequential”, how do I set it up correctly?
Thanks a lot!