Accelerate Amazon SageMaker inference with C6i Intel-based Amazon EC2 instances
AWS Machine Learning
MARCH 20, 2023
To access the code and documentation, refer to the GitHub repo. Given a document as an input, the model will answer simple questions based on the learning and contexts from the input document. The container gets pushed into Amazon ECR and a C6i based endpoint is created to serve FP32 and INT8 models. Refer to invoke-INT8.py
Let's personalize your content