Abstract: Characterizing and predicting the training performance of modern machine learning (ML) workloads on compute systems with compute and communication spread between CPUs, GPUs, and network ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results