Accelerate Amazon SageMaker inference with C6i Intel-based Amazon EC2 instances
AWS Machine Learning
MARCH 20, 2023
For more information, refer to Lower Numerical Precision Deep Learning Inference and Training. Use the supplied Python scripts for quantization. Run the provided Python test scripts to invoke the SageMaker endpoint for both INT8 and FP32 versions. py scripts for testing. Refer to invoke-INT8.py py and invoke-FP32.py
Let's personalize your content