Tag: Multi-GPU Large Model Speed Optimization