Unlock cost savings with the new scale down to zero feature in SageMaker Inference
AWS Machine Learning
DECEMBER 2, 2024
Today at AWS re:Invent 2024, we are excited to announce a new feature for Amazon SageMaker inference endpoints: the ability to scale SageMaker inference endpoints to zero instances. These sub-minute metrics can help trigger scale-out actions more precisely, reducing the number of NoCapacityInvocationFailures your users might experience.
Let's personalize your content