Accountability, APIs, Benchmark and Construction

Accountability

APIs

Benchmark

Construction

Build a secure enterprise application with Generative AI and RAG using Amazon SageMaker JumpStart

AWS Machine Learning

SEPTEMBER 6, 2023

These SageMaker endpoints are consumed in the Amplify React application through Amazon API Gateway and AWS Lambda functions. To protect the application and APIs from inadvertent access, Amazon Cognito is integrated into Amplify React, API Gateway, and Lambda functions. You access the React application from your computer.

Enterprise

Enterprise APIs Real estate Construction

Build a multilingual automatic translation pipeline with Amazon Translate Active Custom Translation

AWS Machine Learning

JUNE 15, 2023

We demonstrate how to use the AWS Management Console and Amazon Translate public API to deliver automatic machine batch translation, and analyze the translations between two language pairs: English and Chinese, and English and Spanish. In this post, we present a solution that D2L.ai

APIs

APIs Benchmark Best practices Engineering

Join 52,000+

Insiders

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Trending Sources

How Patsnap used GPT-2 inference on Amazon SageMaker with low latency and cost

AWS Machine Learning

JULY 24, 2023

A recent initiative is to simplify the difficulty of constructing search expressions by autofilling patent search queries using state-of-the-art text generation models. In this section, we show how to build your own container, deploy your own GPT-2 model, and test with the SageMaker endpoint API. model_fp16.onnx gpt2 and predictor.py

APIs

APIs Engineering Construction Benchmark

FMOps/LLMOps: Operationalize generative AI and differences with MLOps

AWS Machine Learning

SEPTEMBER 1, 2023

Each business unit has each own set of development (automated model training and building), preproduction (automatic testing), and production (model deployment and serving) accounts to productionize ML use cases, which retrieve data from a centralized or decentralized data lake or data mesh, respectively.

Engineering

Engineering Accountability Construction APIs

Face-off Probability, part of NHL Edge IQ: Predicting face-off winners in real time during televised games

AWS Machine Learning

OCTOBER 5, 2022

We also share the key technical challenges that were solved during construction of the Face-off Probability model. To make an informed decision, we performed a series of benchmarks to verify SageMaker latency and scalability, and validated that average latency was less than 100 milliseconds under the load, which was within our expectations.

Calibration

Calibration Engineering Automotive Analytics

Evaluate large language models for quality and responsibility

AWS Machine Learning

NOVEMBER 30, 2023

Customers have to leave their development environment to use academic tools and benchmarking sites, which require highly-specialized knowledge. We surveyed existing open-source evaluation frameworks and designed FMEval evaluation API with extensibility in mind.

Construction

Construction Metrics industry standards APIs

Fast-track graph ML with GraphStorm: A new way to solve problems on enterprise-scale graphs

AWS Machine Learning

JUNE 9, 2023

With GraphStorm, you can build solutions that directly take into account the structure of relationships or interactions between billions of entities, which are inherently embedded in most real-world data, including fraud detection scenarios, recommendations, community detection, and search/retrieval problems.

Enterprise

Enterprise Construction Engineering Metrics

Reduce inference time for BERT models using neural architecture search and SageMaker Automated Model Tuning

AWS Machine Learning

JANUARY 19, 2024

We use the Recognizing Textual Entailment dataset from the GLUE benchmarking suite. Choose Request increase at account-level. The requested quota approval may take some time to complete depending on the account permissions. Then we construct a request metadata and record the start time to be used for load testing.

Metrics

Metrics Scripts Benchmark Enterprise

Model Hosting Patterns in SageMaker: Best practices in testing and updating models on SageMaker

AWS Machine Learning

NOVEMBER 9, 2022

Your application simply needs to include an API call with the target model to this endpoint to achieve low-latency, high-throughput inference. To deploy, use the endpoint_from_production_variant construct to create the endpoint. Deepali Rajale is AI/ML Specialist Technical Account Manager at Amazon Web Services.

Best practices

Best practices Construction Metrics Enterprise

Customer Contact Central

Build a secure enterprise application with Generative AI and RAG using Amazon SageMaker JumpStart

Build a multilingual automatic translation pipeline with Amazon Translate Active Custom Translation

Trending Sources

How Patsnap used GPT-2 inference on Amazon SageMaker with low latency and cost

FMOps/LLMOps: Operationalize generative AI and differences with MLOps

Face-off Probability, part of NHL Edge IQ: Predicting face-off winners in real time during televised games

Evaluate large language models for quality and responsibility

Fast-track graph ML with GraphStorm: A new way to solve problems on enterprise-scale graphs

Reduce inference time for BERT models using neural architecture search and SageMaker Automated Model Tuning

Model Hosting Patterns in SageMaker: Best practices in testing and updating models on SageMaker

Stay Connected