Amazon ECS supports attaching Amazon Elastic Inference accelerators to your containers to make running deep learning inference workloads more cost-effective. Amazon Elastic Inference allows you to attach just the right amount of GPU-powered acceleration to any Amazon EC2 or Amazon SageMaker instance, or ECS task, to reduce the cost of running deep learning inference by up to 75%.

from Recent Announcements https://aws.amazon.com/about-aws/whats-new/2019/09/amazon-elastic-inference-now-available-in-amazon-ecs-tasks/