Relative Content

Tag Archive for amazon-web-servicesasynchronousinstanceendpointautoscaling

Scaling out a Asynchronous SageMaker Endpoint

I’ve deployed a Asynchronous SageMaker Endpoint and I want it to scale out (to 0 instances) when nothing is requested for a period of times and to scale in when something is requested (to <=1 instances)