Implement smart document search index with Amazon Textract and Amazon OpenSearch
AWS Machine Learning
SEPTEMBER 8, 2023
Documents in PDF, TIFF, JPEG or PNG format are put in an Amazon Simple Storage Service ( Amazon S3 ) bucket and subsequently indexed into OpenSearch using this Step Functions workflow. Because the TextractAsy nc task can produce multiple paginated output files, the TextractAsyncToJSON2 process combines them into one JSON file.
Let's personalize your content