Implement smart document search index with Amazon Textract and Amazon OpenSearch
AWS Machine Learning
SEPTEMBER 8, 2023
Documents in PDF, TIFF, JPEG or PNG format are put in an Amazon Simple Storage Service ( Amazon S3 ) bucket and subsequently indexed into OpenSearch using this Step Functions workflow. The Amazon SQS MessageRetentionPeriod is set to 14 days. The threshold of 550 is based on the Textract Service quota of 600 in the us-east-1 region.
Let's personalize your content