热点
"多GPU训练" 相关文章
Fine-tune FLAN-T5 XL/XXL using DeepSpeed and Hugging Face Transformers
philschmid RSS feed 2025-09-30T11:12:50.000000Z
Combine Amazon SageMaker and DeepSpeed to fine-tune FLAN-T5 XXL
philschmid RSS feed 2025-09-30T11:12:48.000000Z
Fine-tune Falcon 180B with DeepSpeed ZeRO, LoRA and Flash Attention
philschmid RSS feed 2025-09-30T11:11:54.000000Z