APIs, Benchmark, Engineering and Scripts - Customer Contact Central

Accelerate Amazon SageMaker inference with C6i Intel-based Amazon EC2 instances

AWS Machine Learning

MARCH 20, 2023

Refer to the appendix for instance details and benchmark data. Use the supplied Python scripts for quantization. Run the provided Python test scripts to invoke the SageMaker endpoint for both INT8 and FP32 versions. Quantizing the model in PyTorch is possible with a few APIs from Intel PyTorch extensions.

Calibration

Calibration Scripts Benchmark APIs

Amazon Comprehend announces lower annotation limits for custom entity recognition

AWS Machine Learning

AUGUST 3, 2022

For example, you can immediately start detecting entities such as people, places, commercial items, dates, and quantities via the Amazon Comprehend console , AWS Command Line Interface , or Amazon Comprehend APIs. In this post, we walk you through the benchmarking process and the results we obtained while working on subsampled datasets.

Benchmark

Benchmark APIs Metrics Scripts

FMOps/LLMOps: Operationalize generative AI and differences with MLOps

AWS Machine Learning

SEPTEMBER 1, 2023

These teams are as follows: Advanced analytics team (data lake and data mesh) – Data engineers are responsible for preparing and ingesting data from multiple sources, building ETL (extract, transform, and load) pipelines to curate and catalog the data, and prepare the necessary historical data for the ML use cases.

Engineering

Engineering Accountability Construction APIs

Build a robust text-based toxicity predictor

AWS Machine Learning

DECEMBER 6, 2022

In real-world toxicity detection applications, toxicity filtering is mostly used in security-relevant industries like gaming platforms, where models are constantly being challenged by social engineering and adversarial attacks. Social engineers can use this type of characteristic of NLP models to bypass toxicity filtering systems.

Engineering

Engineering APIs Construction Metrics

Train gigantic models with near-linear scaling using sharded data parallelism on Amazon SageMaker

AWS Machine Learning

OCTOBER 31, 2022

Data scientists and machine learning engineers are constantly looking for the best way to optimize their training compute, yet are struggling with the communication overhead that can increase along with the overall cluster size. To get started, follow Modify a PyTorch Training Script to adapt SMPs’ APIs in your training script.

Scripts

Scripts Benchmark APIs Engineering

New performance improvements in Amazon SageMaker model parallel library

AWS Machine Learning

DECEMBER 16, 2022

Finally, we’ll benchmark performance of 13B, 50B, and 100B parameter auto-regressive models and wrap up with future work. A ready-to-use training script for GPT-2 model can be found at train_gpt_simple.py. For training a different model type, you can follow the API document to learn about how to apply SMP APIs.

Benchmark

Benchmark Engineering APIs Scripts

How to extend the functionality of AWS Trainium with custom operators

AWS Machine Learning

APRIL 27, 2023

Trainium support for custom operators Trainium (and AWS Inferentia2) supports CustomOps in software through the Neuron SDK and accelerates them in hardware using the GPSIMD engine (General Purpose Single Instruction Multiple Data engine). The scalar and vector engines are highly parallelized and optimized for floating-point operations.

APIs

APIs Engineering Scripts Benchmark

Gemma is now available in Amazon SageMaker JumpStart

AWS Machine Learning

MARCH 13, 2024

. * The `if __name__ == "__main__"` block checks if the script is being run directly or imported. To run the script, you can use the following command: ``` python hello.py ``` * The output will be printed in the console: ``` Hello, world! Evaluate model on test set, compare to benchmarks, analyze errors and biases.

Benchmark

Benchmark Scripts APIs Feedback

Best practices to build generative AI applications on AWS

AWS Machine Learning

MARCH 14, 2024

We provide an overview of key generative AI approaches, including prompt engineering, Retrieval Augmented Generation (RAG), and model customization. Building large language models (LLMs) from scratch or customizing pre-trained models requires substantial compute resources, expert data scientists, and months of engineering work.

Best practices

Best practices Engineering Chatbots Enterprise

Package and deploy classical ML and LLMs easily with Amazon SageMaker, part 2: Interactive User Experiences in SageMaker Studio

AWS Machine Learning

NOVEMBER 30, 2023

SageMaker makes it easy to deploy models into production directly through API calls to the service. It’s a low-level API available for Java, C++, Go, JavaScript, Node.js, PHP, Ruby, and Python. It’s a low-level API available for Java, C++, Go, JavaScript, Node.js, PHP, Ruby, and Python.

Benchmark

Benchmark APIs Scripts Engineering

Run PyTorch Lightning and native PyTorch DDP on Amazon SageMaker Training, featuring Amazon Search

AWS Machine Learning

AUGUST 18, 2022

Machine learning (ML) experts, data scientists, engineers and enthusiasts have encountered this problem the world over. The team’s early benchmarking results show 7.3 The baseline model used in these benchmarking is a multi-layer perceptron neural network with seven dense fully connected layers and over 200 parameters.

Scripts

Scripts APIs Benchmark Engineering

Host ML models on Amazon SageMaker using Triton: TensorRT models

AWS Machine Learning

MAY 8, 2023

To serve models, Triton supports various backends as engines to support the running and serving of various ML models for inference. With kernel auto-tuning, the engine selects the best algorithm for the target GPU, maximizing hardware utilization. Import the ONNX model into TensorRT and generate the TensorRT engine.

Engineering

Engineering APIs Best practices Scripts

Maximize Stable Diffusion performance and lower inference costs with AWS Inferentia2

AWS Machine Learning

JULY 26, 2023

We compile the UNet for one batch (by using input tensors with one batch), then use the torch_neuronx.DataParallel API to load this single batch model onto each core. Load the UNet model onto two Neuron cores using the torch_neuronx.DataParallel API. If you have a custom inference script, you need to provide that instead.

Scripts

Scripts APIs Benchmark Engineering

Reduce inference time for BERT models using neural architecture search and SageMaker Automated Model Tuning

AWS Machine Learning

JANUARY 19, 2024

The concepts illustrated in this post can be applied to applications that use PLM features, such as recommendation systems, sentiment analysis, and search engines. We use the Recognizing Textual Entailment dataset from the GLUE benchmarking suite. He specializes in Generative AI and Machine Learning Data Engineering.

Metrics

Metrics Scripts Benchmark Enterprise

Accelerate your learning towards AWS Certification exams with automated quiz generation using Amazon SageMaker foundations models

AWS Machine Learning

MAY 31, 2023

In 2018, BERT-large made its debut with its 340 million parameters and innovative transformer architecture, setting the benchmark for performance on NLP tasks. The endpoint comes pre-loaded with the model and ready to serve queries via an easy-to-use API and Python SDK, so you can hit the ground running.

Scripts

Scripts Engineering Accountability APIs

AlexaTM 20B is now available in Amazon SageMaker JumpStart

AWS Machine Learning

NOVEMBER 17, 2022

In this post, we provide an overview of how to deploy and run inference with the AlexaTM 20B model programmatically through JumpStart APIs, available in the SageMaker Python SDK. To use a large language model in SageMaker, you need an inferencing script specific for the model, which includes steps like model loading, parallelization and more.

Scripts

Scripts APIs Government Engineering

Reduce deep learning training time and cost with MosaicML Composer on AWS

AWS Machine Learning

OCTOBER 24, 2022

DL scripts often require boilerplate code, notably the aforementioned double for loop structure that splits the dataset into minibatches and the training into epochs. At the time of this writing, it supports PyTorch and includes 25 techniques—called methods in the MosaicML world—along with standard models, datasets, and benchmarks.

Scripts

Scripts APIs Enterprise Benchmark

Build high-performance ML models using PyTorch 2.0 on AWS – Part 1

AWS Machine Learning

JUNE 6, 2023

The following figure shows a performance benchmark of fine-tuning a RoBERTa model on Amazon EC2 p4d.24xlarge inference with AWS Graviton processors for details on AWS Graviton-based instance inference performance benchmarks for PyTorch 2.0. Run your DLC container with a model training script to fine-tune the RoBERTa model.

Scripts

Scripts APIs Benchmark Management

Scaling Large Language Model (LLM) training with Amazon EC2 Trn1 UltraClusters

AWS Machine Learning

FEBRUARY 16, 2023

Briefly, this is made possible by an installation script specified by CustomActions in the YAML file used for creating the ParallelCluster (see Create ParallelCluster ). You can invoke neuron-top during the training script run to inspect NeuronCore utilization at each node. Jeffrey Huynh is a Principal Engineer in AWS Annapurna Labs.

Scripts

Scripts APIs Benchmark Engineering

Improve price performance of your model training using Amazon SageMaker heterogeneous clusters

AWS Machine Learning

OCTOBER 27, 2022

Our benchmarks show up to 46% price performance benefit after enabling heterogeneous clusters in a CPU-bound TensorFlow computer vision model training. AI Engineering, Mobileye. Performance benchmark results. For more information, refer to Using the SageMaker Python SDK and Using the Low-Level SageMaker APIs.

Scripts

Scripts Benchmark Metrics Transportation

How to Successfully Implement Customer Journey Analytics – Part 1

Pointillist

JULY 25, 2018

Treating every change to customer data as an event saves work for data engineers, as no transformation is required. Pointillist can handle data in all forms, whether it is in tables, excel files, server logs, or 3rd party APIs. 3rd Party APIs: Pointillist has a large number of connectors using 3rd party APIs.

Analytics

Analytics Government APIs Metrics

Operationalize LLM Evaluation at Scale using Amazon SageMaker Clarify and MLOps services

AWS Machine Learning

NOVEMBER 29, 2023

Each trained model needs to be benchmarked against many tasks not only to assess its performances but also to compare it with other existing models, to identify areas that needs improvements and finally, to keep track of advancements in the field. These benchmarks have leaderboards that can be used to compare and contrast evaluated models.

Benchmark

Benchmark Metrics Engineering APIs

How to Successfully Implement Customer Journey Analytics

Pointillist

JULY 25, 2018

Treating every change to customer data as an event saves work for data engineers, as no transformation is required. Pointillist can handle data in all forms, whether it is in tables, excel files, server logs, or 3rd party APIs. 3rd Party APIs: Pointillist has a large number of connectors using 3rd party APIs.

Analytics

Analytics Metrics Government Journey mapping

Customer Contact Central

Accelerate Amazon SageMaker inference with C6i Intel-based Amazon EC2 instances

Amazon Comprehend announces lower annotation limits for custom entity recognition

Trending Sources

FMOps/LLMOps: Operationalize generative AI and differences with MLOps

Build a robust text-based toxicity predictor

Train gigantic models with near-linear scaling using sharded data parallelism on Amazon SageMaker

New performance improvements in Amazon SageMaker model parallel library

How to extend the functionality of AWS Trainium with custom operators

Gemma is now available in Amazon SageMaker JumpStart

Best practices to build generative AI applications on AWS

Package and deploy classical ML and LLMs easily with Amazon SageMaker, part 2: Interactive User Experiences in SageMaker Studio

Run PyTorch Lightning and native PyTorch DDP on Amazon SageMaker Training, featuring Amazon Search

Host ML models on Amazon SageMaker using Triton: TensorRT models

Maximize Stable Diffusion performance and lower inference costs with AWS Inferentia2

Reduce inference time for BERT models using neural architecture search and SageMaker Automated Model Tuning

Accelerate your learning towards AWS Certification exams with automated quiz generation using Amazon SageMaker foundations models

AlexaTM 20B is now available in Amazon SageMaker JumpStart

Reduce deep learning training time and cost with MosaicML Composer on AWS

Build high-performance ML models using PyTorch 2.0 on AWS – Part 1

Scaling Large Language Model (LLM) training with Amazon EC2 Trn1 UltraClusters

Improve price performance of your model training using Amazon SageMaker heterogeneous clusters

How to Successfully Implement Customer Journey Analytics – Part 1

Operationalize LLM Evaluation at Scale using Amazon SageMaker Clarify and MLOps services

How to Successfully Implement Customer Journey Analytics

Stay Connected