Remove APIs Remove Benchmark Remove Construction Remove Consulting
article thumbnail

How Patsnap used GPT-2 inference on Amazon SageMaker with low latency and cost

AWS Machine Learning

A recent initiative is to simplify the difficulty of constructing search expressions by autofilling patent search queries using state-of-the-art text generation models. In this section, we show how to build your own container, deploy your own GPT-2 model, and test with the SageMaker endpoint API. model_fp16.onnx gpt2 and predictor.py

APIs 66
article thumbnail

Fast-track graph ML with GraphStorm: A new way to solve problems on enterprise-scale graphs

AWS Machine Learning

The pre-trained GNN embeddings show a 24% improvement on a shopper activity prediction task over a state-of-the-art BERT- based baseline; it also exceeds benchmark performance in other ads applications.” Basically, by using the API of this layer, you can focus on the model development without worrying about how to scale the model training.