Accelerated PyTorch inference with torch.compile on AWS Graviton processors
AWS Machine Learning
JULY 2, 2024
You can see that for the 45 models we benchmarked, there is a 1.35x latency improvement (geomean for the 45 models). You can see that for the 33 models we benchmarked, there is around 2x performance improvement (geomean for the 33 models). We benchmarked 45 models using the scripts from the TorchBench repo.
Let's personalize your content