Blog Post

Microsoft Developer Community Blog
5 MIN READ

Training and Inference of LLMs with PyTorch Fully Sharded Data Parallel and Better Transformer

vilcek's avatar
vilcek
Icon for Microsoft rankMicrosoft
Jun 14, 2023
In this blog we show how to perform efficient and optimized distributed training and inference of large language models using PyTorch’s Fully Sharded Data Parallel and Better Transformer implementati...
Updated Jun 15, 2023
Version 3.0