Depending on how your network is structured, you can do a drop-in-replace with DeepSpeed in a couple of hours. This will likely speed up your learning a LOT (>3X in my case) vs. using vanilla pytorch
This enables you to learn more quickly, which means faster experimentation, which means a better tuning cycle, which means better artwork!